导航图图示测试(NTT):学习评估类似人类的导航 (Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation)

A key challenge on the path to developing agents that learn complex human-like behavior is the need to quickly and accurately quantify human-likeness. While human assessments of such behavior can be highly accurate, speed and scalability are limited. We address these limitations through a novel automated Navigation Turing Test (ANTT) that learns to predict human judgments of human-likeness. We demonstrate the effectiveness of our automated NTT on a navigation task in a complex 3D environment. We investigate six classification models to shed light on the types of architectures best suited to this task, and validate them against data collected through a human NTT. Our best models achieve high accuracy when distinguishing true human and agent behavior. At the same time, we show that predicting finer-grained human assessment of agents' progress towards human-like behavior remains unsolved. Our work takes an important step towards agents that more effectively learn complex human-like behavior.

翻译：发展能学习复杂人类行为的代理人的道路上的一个关键挑战是需要迅速和准确地量化人类相似性。虽然人类对这种行为的评估可以非常准确,但速度和可缩放性有限。我们通过新颖的自动导航图象测试(ANTT)解决这些局限性,该测试学会预测人类相似性对人类的判断。我们展示了我们的自动NTT在复杂的三维环境中执行导航任务的有效性。我们调查了六种分类模型,以揭示最适合这项任务的建筑类型,并根据通过人类NTT收集的数据验证这些结构。我们的最佳模型在辨别真正的人类和代理人行为时达到了很高的准确性。同时,我们显示对代理人走向人类类似行为的进展的精细的人类评估仍未实现。我们的工作向更有效地学习复杂人类行为的代理人迈出了重要的一步。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

【AAAI2020-Oral】自监督时空学习的视频完形程序，Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning

专知会员服务

30+阅读 · 2020年1月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日