- Computing Machinery and Intelligence (Alan Turing, 1950) 
- Anderson, He, Buehler, Teney, Johnson, Gould, Zhang, “Bottom-Up and Top-Down Attention”, CVPR 2018 
- Adiwardana, Luong, So, Hall, Fiedel, Thoppilan, Yang, Kulshreshtha, Nemade, Lu, Le, "Towards a Human-like Open-Domain Chatbot", https://arxiv.org/abs/2001.09977 
- Brown et al., “Language Models are Few-shot learners,” 2020.  
- Vaswani et al., "Attention is all you need." 2017 
- Fang, Gupta, Iandola, Srivastava, Deng, Dollar, Gao, He, et al., “From Captions to Visual Concepts and Back,” CVPR2015 
- Guo, Zhang, Hu, He, Gao, “MS-Celeb-1M”, ECCV 2016 
- He, Chen, He, Gao, Li, Deng, Ostendorf, “Deep Reinforcement Learning with a Natural Language Action Space,” ACL2016 
- Huang, He, Gao, Deng, Acero, Heck, “Deep Structured Semantic Model”, CIKM2013 
-  
      
      Liu et al., Mappa Mundi: An Interactive Artistic Mind Map Generator with Artificial Imagination, IJCAI 2019; 
     
-  
      
      Chen et al., MaLiang: An Emotion-Driven Chinese Calligraphy Artwork Composition System, ACM MM 2020 
     
-  
      
      Smith, Williamson, Shuster, Weston, Boureau, “Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills," ACL 2020 
     
-  
      
      Xu, Zhang, Huang, Zhang, Gan, Huang, He, “AttnGAN,” CVPR2018 
     
-  
      
      Yang, He, Gao, Deng, Smola, “Stacked Attention Networks,” CVPR 2016 
     
-  
      
      Yang, Yang, Dyer, He, Smola, Hovy, “Hierarchical Attention Networks”, NAACL2016 
     
-  
      
      Yang, Yih, He, Gao, Deng, “Embedding entitles and relations for learning and inference in knowledge bases”, ICLR2015 
     
-  
      
      Zhang, Yang, He, Deng, “Multimodal Intelligence: Representation Learning, Information Fusion, and Applications”, IEEE JSTSP, March 2020