MS-LATTE: 完成待办任务地点和时间的数据集 (MS-LaTTE: A Dataset of Where and When To-do Tasks are Completed)

Tasks are a fundamental unit of work in the daily lives of people, who are increasingly using digital means to keep track of, organize, triage and act on them. These digital tools -- such as task management applications -- provide a unique opportunity to study and understand tasks and their connection to the real world, and through intelligent assistance, help people be more productive. By logging signals such as text, timestamp information, and social connectivity graphs, an increasingly rich and detailed picture of how tasks are created and organized, what makes them important, and who acts on them, can be progressively developed. Yet the context around actual task completion remains fuzzy, due to the basic disconnect between actions taken in the real world and telemetry recorded in the digital world. Thus, in this paper we compile and release a novel, real-life, large-scale dataset called MS-LaTTE that captures two core aspects of the context surrounding task completion: location and time. We describe our annotation framework and conduct a number of analyses on the data that were collected, demonstrating that it captures intuitive contextual properties for common tasks. Finally, we test the dataset on the two problems of predicting spatial and temporal task co-occurrence, concluding that predictors for co-location and co-time are both learnable, with a BERT fine-tuned model outperforming several other baselines. The MS-LaTTE dataset provides an opportunity to tackle many new modeling challenges in contextual task understanding and we hope that its release will spur future research in task intelligence more broadly.

翻译：任务是人们日常生活中一个基本的工作单位,人们越来越多地使用数字手段跟踪、组织、整理并采取行动。这些数字工具 -- -- 例如任务管理应用程序 -- -- 为研究和理解任务及其与现实世界的联系提供了独特的机会,并通过智能援助,帮助人们更有成效。通过记录文本、时间戳印信息和社会连通图等信号,我们越来越丰富和详细地了解如何创建和安排任务,什么使他们重要,谁在其中采取行动,可以逐步发展。然而,由于在现实世界中采取的行动与数字世界中记录的遥测系统之间基本脱节,这些数字工具(例如任务管理应用程序)为研究和理解任务及其与现实世界的联系提供了一个独特的机会。因此,我们在本文件中汇编和发布一个名为MS-LATTE的大规模数据集,它记录了完成任务的背景的两个核心方面:地点和时间。我们描述了我们的批注框架,并对所收集的数据进行了一些分析,表明它为共同任务收集了直观的背景属性。最后,我们测试了在现实世界中和在数字世界中进行远程测量的行动。我们更接近了在预测空间和时间定位任务基准中的两种问题。我们用一个可测算的日历来预测了未来任务,在预测空间和时间定位上将获得一个选择一个选择。