Vocabulary learning through multimodal input