text A is dominated by the voice, when the voice is playing you cant really hear the other noises
in B you feel the crowd can add to what is happening in terms of the clip
A leaves a little more confusion over what is happening 