With an increased interest in the production of personal health technologies designed to track user data (e.g., nutrient intake, step counts), there is now more opportunity than ever to surface meaningful behavioral insights to everyday users in the form of natural language. This knowledge can increase their behavioral awareness and allow them to take action to meet their health goals. It can also bridge the gap between the vast collection of personal health data and the summary generation required to describe an individual's behavioral tendencies. Previous work has focused on rule-based time-series data summarization methods designed to generate natural language summaries of interesting patterns found within temporal personal health data. We examine recurrent, convolutional, and Transformer-based encoder-decoder models to automatically generate natural language summaries from numeric temporal personal health data. We showcase the effectiveness of our models on real user health data logged in MyFitnessPal and show that we can automatically generate high-quality natural language summaries. Our work serves as a first step towards the ambitious goal of automatically generating novel and meaningful temporal summaries from personal health data.
翻译:随着对旨在跟踪用户数据的个人健康技术(例如营养摄入、步数计数等)的制作的兴趣增加,现在比以往有更多机会以自然语言的形式向日常用户展示有意义的行为洞察力。这种知识可以提高他们的行为意识,使他们能够采取行动实现健康目标。它还能够弥合广泛收集个人健康数据与描述个人行为倾向所需的即时生成之间的差距。以前的工作侧重于基于规则的时间序列数据汇总方法,旨在生成时间个人健康数据中发现的有趣模式的自然语言摘要。我们检查经常、动态和基于变换器的编码器解码模型,以便从数字个人健康数据中自动生成自然语言摘要。我们展示了我们在MyFitnessPal登录的实际用户健康数据模型的有效性,并表明我们可以自动生成高质量的自然语言摘要。我们的工作是朝着从个人健康数据中自动生成新颖和有意义的时间摘要的宏伟目标迈出的第一步。