-
Visually Grounded Interaction and Language, NeurIPS 2019, NeurIPS 2018
-
Emergent Communication: Towards Natural Language, NeurIPS 2019
-
Workshop on Multimodal Understanding and Learning for Embodied Applications, ACM Multimedia 2019
-
Beyond Vision and Language: Integrating Real-World Knowledge, EMNLP 2019
-
The How2 Challenge: New Tasks for Vision & Language, ICML 2019
-
Visual Question Answering and Dialog, CVPR 2019, CVPR 2017
-
Multi-modal Learning from Videos, CVPR 2019
-
Multimodal Learning and Applications Workshop, CVPR 2019, ECCV 2018
-
Habitat: Embodied Agents Challenge and Workshop, CVPR 2019
-
Closing the Loop Between Vision and Language & LSMD Challenge, ICCV 2019
-
Multi-modal Video Analysis and Moments in Time Challenge, ICCV 2019
-
Cross-Modal Learning in Real World, ICCV 2019
-
Spatial Language Understanding and Grounded Communication for Robotics, NAACL 2019
-
YouTube-8M Large-Scale Video Understanding, ICCV 2019, ECCV 2018, CVPR 2017
-
Language and Vision Workshop, CVPR 2019, CVPR 2018, CVPR 2017, CVPR 2015
-
Sight and Sound, CVPR 2019, CVPR 2018
-
The Large Scale Movie Description Challenge (LSMDC), ICCV 2019, ICCV 2017
-
Wordplay: Reinforcement and Language Learning in Text-based Games, NeurIPS 2018
-
Interpretability and Robustness in Audio, Speech, and Language, NeurIPS 2018
-
Multimodal Robot Perception, ICRA 2018
-
WMT18: Shared Task on Multimodal Machine Translation, EMNLP 2018
-
Shortcomings in Vision and Language, ECCV 2018
-
Grand Challenge and Workshop on Human Multimodal Language, ACL 2018
-
Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, EMNLP 2018, EMNLP 2017, NAACL-HLT 2016, EMNLP 2015, ACL 2014, NAACL-HLT 2013
-
Visual Understanding Across Modalities, CVPR 2017
-
International Workshop on Computer Vision for Audio-Visual Media, ICCV 2017
-
Language Grounding for Robotics, ACL 2017
-
Computer Vision for Audio-visual Media, ECCV 2016
-
Language and Vision, ACL 2016, EMNLP 2015