视觉问答(Visual Question Answering,VQA),是一种涉及计算机视觉和自然语言处理的学习任务。这一任务的定义如下: A VQA system takes as input an image and a free-form, open-ended, natural-language question about the image and produces a natural-language answer as the output[1]。 翻译为中文:一个VQA系统以一张图片和一个关于这张图片形式自由、开放式的自然语言问题作为输入,以生成一条自然语言答案作为输出。简单来说,VQA就是给定的图片进行问答。
  1. Avi Singh's blog
  2. Visual Question Generation as Dual Task of Visual...
  3. Visual Question Answering with Memory-Augmented...
  4. Speech-Based Visual Question Answering
  5. Visual Question Answering Demo in Python Notebook
  6. The Color of the Cat is Gray: 1 Million Full-Sentences Visual...
  7. zhihu.com/lives
  8. Visual Question Answering – Aaditya Prakash...
  9. CVPR 2016 有什么值得关注的亮点? - 知乎
  10. 如何评价 Visual Studio Code? - 知乎
  11. 将错误的版本(有大量的 commits) push 到 Git...
  12. Python出现ValueError: need more than 1 value to unpack...
  13. zhihu.com/question/36701137
  14. 阅读笔记(Multimodal Compact Bilinear Pooling for Visual...
  15. Visual Studio 2017 如何编译 Visual Studio 2015 的项目? - 知乎
  16. Exploring Human-like Attention Supervision in Visual...
  17. Don't Just Assume; Look and Answer: Overcoming Priors for...
  18. Visual Question Answering - handong1587
  19. 知乎 - 发现更大的世界
  20. 怎么能下载到visual modelq? - 知乎
展开全文
参考链接
微信扫码咨询专知VIP会员