This is an auditory task. Without the audio, it is impossible to determine which pictures are correct.