Specialized domain knowledge is often necessary to accurately annotate training sets for in-depth analysis, but can be burdensome and time-consuming to acquire from domain experts. This issue arises prominently in automated behavior analysis, in which agent movements or actions of interest are detected from video tracking data. To reduce annotation effort, we present TREBA: a method to learn annotation-sample efficient trajectory embedding for behavior analysis, based on multi-task self-supervised learning. The tasks in our method can be efficiently engineered by domain experts through a process we call "task programming", which uses programs to explicitly encode structured knowledge from domain experts. Total domain expert effort can be reduced by exchanging data annotation time for the construction of a small number of programmed tasks. We evaluate this trade-off using data from behavioral neuroscience, in which specialized domain knowledge is used to identify behaviors. We present experimental results in three datasets across two domains: mice and fruit flies. Using embeddings from TREBA, we reduce annotation burden by up to a factor of 10 without compromising accuracy compared to state-of-the-art features. Our results thus suggest that task programming and self-supervision can be an effective way to reduce annotation effort for domain experts.
翻译:专门领域知识对于准确说明用于深入分析的成套培训往往十分必要,但对于从域专家那里获取领域性知识来说可能十分繁琐和费时。这个问题在自动行为分析中特别突出,因为自动行为分析从视频跟踪数据中检测到有关物剂的移动或行动。为了减少批注努力,我们介绍了TREBA:一种在多任务自我监督的学习基础上,学习批注和展示有效轨迹的方法,用于进行行为分析。我们的方法中的任务可由域专家通过我们称之为“任务编程”的过程来有效地设计。这个程序使用程序来明确编码来自域专家的结构化知识。通过交换数据说明时间来构建少量的编程任务,可以减少全域专家的努力。我们利用行为神经科学的数据来评估这种权衡,其中专门领域知识用来识别行为。我们在以下两个领域的三个数据集中提出了实验结果:小鼠和水果苍蝇。我们从TREBA的嵌入中可以减少批量负担10倍,而不会影响准确性与州域规划工作相比,我们可以减少自己的任务。因此,我们建议可以减少一种自我定位任务。