即时目标：一种面向专用人工智能交互的通用方法 (Just-In-Time Objectives: A General Approach for Specialized AI Interactions)

Large language models promise a broad set of functions, but when not given a specific objective, they default to milquetoast results such as drafting emails littered with cliches. We demonstrate that inferring the user's in-the-moment objective, then rapidly optimizing for that singular objective, enables LLMs to produce tools, interfaces, and responses that are more responsive and desired. We contribute an architecture for automatically inducing just-in-time objectives by passively observing user behavior, then steering downstream AI systems through generation and evaluation against this objective. Inducing just-in-time objectives (e.g., "Clarify the abstract's research contribution") enables automatic generation of tools, e.g., those that critique a draft based on relevant HCI methodologies, anticipate related researchers' reactions, or surface ambiguous terminology. In a series of experiments (N=14, N=205) on participants' own tasks, JIT objectives enable LLM outputs that achieve 66-86% win rates over typical LLMs, and in-person use sessions (N=17) confirm that JIT objectives produce specialized tools unique to each participant.

翻译：大型语言模型具备广泛的功能，但当未给定具体目标时，其默认输出往往流于平庸，例如生成充斥陈词滥调的邮件草稿。本文证明，通过推断用户的即时目标并针对该单一目标进行快速优化，能够使LLM生成更具响应性且更符合期望的工具、界面与回复。我们提出一种架构，通过被动观察用户行为自动推导即时目标，并基于该目标引导下游人工智能系统进行生成与评估。即时目标的推导（例如“阐明摘要的研究贡献”）支持工具的自动生成，例如基于相关人机交互方法论批判草稿、预测相关研究者的反应或识别模糊术语的工具。在一系列针对参与者自身任务的实验中（N=14，N=205），采用即时目标的方法使LLM输出在66-86%的对比中优于常规LLM；现场使用环节（N=17）进一步证实，即时目标能够为每位参与者生成独特的专用工具。

相关内容

关注 7076

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日