【18-16期VALSE在线学术报告通知及参与方式】 - 专知

会员服务 ·

0

【18-16期VALSE在线学术报告通知及参与方式】

2018 年 5 月 31 日 VALSE

报告嘉宾：马林（腾讯）

报告时间：2018年06月06日（星期三）晚上20:00（北京时间）

报告题目：Image/video Captioning

主持人：姬艳丽（电子科大）

报告人简介：

Lin Ma is now a Principal Researcher with Tencent AI Lab, Shenzhen, China. Previously, he was a Researcher with Huawei Noah's Ark Lab, Hong Kong from Aug. 2013 to Sep. 2016. He received his Ph.D. degree in Department of Electronic Engineering at the Chinese University of Hong Kong (CUHK) in 2013. He received the B. E., and M. E. degrees from Harbin Institute of Technology, Harbin, China, in 2006 and 2008, respectively, both in computer science. His current research interests lie in the areas of deep learning, computer vision, especially the multimodal deep learning between vision and language.

个人主页：

http://www.ee.cuhk.edu.hk/~lma/

报告摘要：

Multimodal learning between vision and language, especially image/video captioning, has become a hot research topic. Associated with the language information, deeper understandings of the image/video can be achieved. I will give a brief introduction about our progresses on image/video captioning. For image captioning, we propose to learn to guide decoding for image captioning. For video captioning, we propose an encoder-decoder-reconstructor frame to make a comprehensive understanding of the bi-directional information, specifically the video-to-text and text-to-video, which can thereby boost the performance of video captioning. Besides video captioning, one novel task, namely dense video captioning, involves not only the video localization but also video captioning for each localized video segment. We build a new end-to-end neural network to fully couple the video localization and captioning together.

参考文献：

[1] Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present, X. Chen, L. Ma, W. Jiang, J. Yao, and W. Liu, CVPR 2018.

[2] Reconstruction Network for Video Captioning, B. Wang, L. Ma, W. Zhang, and W. Liu, CVPR 2018.

[3] Bidirectional Attentive Fusion with Context Gating for Dense Video Captioning, J. Wang, W. Jiang, L. Ma, W. Liu, and Y. Xu, CVPR 2018.

18-16期VALSE在线学术报告参与方式：

长按或扫描下方二维码，关注“VALSE”微信公众号（valse_wechat），后台回复“16期”，获取直播地址。

特别鸣谢本次Webinar主要组织者：

VOOC责任委员：沈复民（电子科大）

VODB协调理事：林倞（中山大学）

活动参与方式：

1、VALSE Webinar活动依托在线直播平台进行，活动时讲者会上传PPT或共享屏幕，听众可以看到Slides，听到讲者的语音，并通过聊天功能与讲者交互；

2、为参加活动，请关注VALSE微信公众号：valse_wechat 或加入VALSE QQ群（目前A、B、C、D、E、F、G群已满，除讲者等嘉宾外，只能申请加入VALSE H群，群号：701662399）；

*注：申请加入VALSE QQ群时需验证姓名、单位和身份，缺一不可。入群后，请实名，姓名身份单位。身份：学校及科研单位人员T；企业研发I；博士D；硕士M。

3、在活动开始前5分钟左右，讲者会开启直播，听众点击直播链接即可参加活动，支持安装Windows系统的电脑、MAC电脑、手机等设备；

4、活动过程中，请不要说无关话语，以免影响活动正常进行；

5、活动过程中，如出现听不到或看不到视频等问题，建议退出再重新进入，一般都能解决问题；

6、建议务必在速度较快的网络上参加活动，优先采用有线网络连接；

7、VALSE微信公众号会在每周一推送上一周Webinar报告的总结及视频（经讲者允许后），每周四发布下一周Webinar报告的通知及直播链接。

登录查看更多

0

相关内容

VALSE研讨会

VALSE 发起于 2011年，是 Vision And Learning Seminar 的简写，取“华尔兹舞”之意。目的是为全球计算机视觉、模式识别、机器学习、多媒体技术等相关领域的华人青年学者提供一个平等、自由的学术交流舞台。官网：http://valser.org/

浅谈文字识别：新思考、新挑战及新机遇，华南理工大学金连文教授，VALSE2019: 让机器像人一样阅读：文字检测与识别新趋势

浅谈文字识别：新思考、新挑战及新机遇，华南理工大学金连文教授，VALSE2019: 让机器像人一样阅读：文字检测与识别新趋势

专知会员服务

26+阅读 · 2019年10月24日

图卷积神经网络在计算金融等交叉学科领域的应用研究，复旦大学魏忠钰副教授，第八届全国社会媒体处理大会SMP2019

图卷积神经网络在计算金融等交叉学科领域的应用研究，复旦大学魏忠钰副教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

41+阅读 · 2019年10月21日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

VALSE Webinar 特别专题之产学研共舞VALSE

VALSE Webinar 特别专题之产学研共舞VALSE

VALSE

7+阅读 · 2019年9月19日

VALSE Webinar 19-24期去雨去雾专题

VALSE Webinar 19-24期去雨去雾专题

VALSE

23+阅读 · 2019年9月12日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

18-16期VALSE Webinar会后总结

18-16期VALSE Webinar会后总结

VALSE

3+阅读 · 2018年6月11日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

Arxiv

6+阅读 · 2020年3月18日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Object Hallucination in Image Captioning

Arxiv

3+阅读 · 2019年3月29日

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Arxiv

4+阅读 · 2018年7月8日

Generative Stock Question Answering

Arxiv

6+阅读 · 2018年4月21日

VIP会员

相关主题

相关VIP内容

浅谈文字识别：新思考、新挑战及新机遇，华南理工大学金连文教授，VALSE2019: 让机器像人一样阅读：文字检测与识别新趋势

浅谈文字识别：新思考、新挑战及新机遇，华南理工大学金连文教授，VALSE2019: 让机器像人一样阅读：文字检测与识别新趋势

专知会员服务

26+阅读 · 2019年10月24日

图卷积神经网络在计算金融等交叉学科领域的应用研究，复旦大学魏忠钰副教授，第八届全国社会媒体处理大会SMP2019

图卷积神经网络在计算金融等交叉学科领域的应用研究，复旦大学魏忠钰副教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

41+阅读 · 2019年10月21日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

因果强化学习的统一框架：综述、分类体系、算法与应用

《无人机系统 - 反无人机系统：测试方法》364页

【MIT博士论文】语言模型的推理时学习算法

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

相关资讯

VALSE Webinar 特别专题之产学研共舞VALSE

VALSE Webinar 特别专题之产学研共舞VALSE

VALSE

7+阅读 · 2019年9月19日

VALSE Webinar 19-24期去雨去雾专题

VALSE Webinar 19-24期去雨去雾专题

VALSE

23+阅读 · 2019年9月12日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

18-16期VALSE Webinar会后总结

18-16期VALSE Webinar会后总结

VALSE

3+阅读 · 2018年6月11日

计算机类 | 国际会议信息7条

计算机类 | 国际会议信息7条

Call4Papers

3+阅读 · 2017年11月17日

相关论文

Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation

Arxiv

6+阅读 · 2020年3月18日

Multimodal Intelligence: Representation Learning, Information Fusion, and Applications

Arxiv

78+阅读 · 2019年11月10日

Object Hallucination in Image Captioning

Arxiv

3+阅读 · 2019年3月29日

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Video Captioning with Boundary-aware Hierarchical Language Decoding and Joint Video Prediction

Arxiv

4+阅读 · 2018年7月8日

Generative Stock Question Answering

Arxiv

6+阅读 · 2018年4月21日

大家都在搜

大型语言模型

朱克爱德华兹家族

蓝牙安全攻防

从传统方法到深度学习—— bilateral filter 到 HDRNet的演进

微信扫码咨询专知VIP会员