来自不同视频示范的节目制作 (Program Generation from Diverse Video Demonstrations)

The ability to use inductive reasoning to extract general rules from multiple observations is a vital indicator of intelligence. As humans, we use this ability to not only interpret the world around us, but also to predict the outcomes of the various interactions we experience. Generalising over multiple observations is a task that has historically presented difficulties for machines to grasp, especially when requiring computer vision. In this paper, we propose a model that can extract general rules from video demonstrations by simultaneously performing summarisation and translation. Our approach differs from prior works by framing the problem as a multi-sequence-to-sequence task, wherein summarisation is learnt by the model. This allows our model to utilise edge cases that would otherwise be suppressed or discarded by traditional summarisation techniques. Additionally, we show that our approach can handle noisy specifications without the need for additional filtering methods. We evaluate our model by synthesising programs from video demonstrations in the Vizdoom environment achieving state-of-the-art results with a relative increase of 11.75% program accuracy on prior works

翻译：利用感性推理从多重观测中提取一般规则的能力是一个重要的智慧指标。作为人类,我们不仅利用这种能力来解释我们周围的世界,而且预测我们所经历的各种互动的结果。概括多重观察是一项任务,在历史上给机器难于掌握,特别是在需要计算机视觉时,这一直是机器难于掌握的任务。在本文中,我们提出了一个模型,可以通过同时进行总结和翻译,从视频演示中提取一般规则。我们的方法不同于先前的工作,我们把问题描述成一个多序列到序列的任务,由模型来学习总结。这使我们的模型能够利用本来会被传统合成技术压制或抛弃的边缘案例。此外,我们表明我们的方法可以处理噪音规格,而不需要额外的过滤方法。我们通过综合Vizdoom环境中的视频演示程序来评估我们的模型,从而实现最新结果,相对提高先前工程11.75%的精确度。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日