According to me no need to include transmission time
RTT=time taken by a packet from sender to receiver +time taken by acknowledgement from receiver to sender.
so by the link given, RTT=2*25=50 m sec
32 frames will take time of 32 m sec
so sender has to wait for time=RTT- 32 m sec = 50-32=18 msec