Artificial Intelligence and digital health have the potential to transform global health. However, having access to representative data to test and validate algorithms in realistic production environments is essential. We introduce HealthSyn, an open-source synthetic data generator of user behavior for testing reinforcement learning algorithms in the context of mobile health interventions. The generator utilizes Markov processes to generate diverse user actions, with individual user behavioral patterns that can change in reaction to personalized interventions (i.e., reminders, recommendations, and incentives). These actions are translated into actual logs using an ML-purposed data schema specific to the mobile health application functionality included with HealthKit, and open-source SDK. The logs can be fed to pipelines to obtain user metrics. The generated data, which is based on real-world behaviors and simulation techniques, can be used to develop, test, and evaluate, both ML algorithms in research and end-to-end operational RL-based intervention delivery frameworks.
翻译:人工智能和数字健康具有改变全球健康的潜力。然而,获得具有代表性的数据以测试和验证现实生产环境中的算法至关重要。我们引入了HealthSyn,这是用户行为的一个开放源的合成数据生成器,用于在移动保健干预措施中测试强化学习算法。生成器利用Markov程序产生不同的用户行动,每个用户的行为模式可以因个人化干预措施(即催复通知、建议和激励措施)而改变。这些行动被转化为实际日志,使用与HealthKit和开放源SDK等移动保健应用功能特有的ML目的数据模型。这些日志可以输入管道,以获取用户的量度。生成的数据基于现实世界行为和模拟技术,可用于开发、测试和评价研究和终端到终端操作RL干预交付框架的ML算法。</s>