Access to longitudinal, individual-level data on work-life balance and wellbeing is limited by privacy, ethical, and logistical constraints. This poses challenges for reproducible research, methodological benchmarking, and education in domains such as stress modeling, behavioral analysis, and machine learning. We introduce FLOW, a synthetic longitudinal dataset designed to model daily interactions between workload, lifestyle behaviors, and wellbeing. FLOW is generated using a rule-based, feedback-driven simulation that produces coherent temporal dynamics across variables such as stress, sleep, mood, physical activity, and body weight. The dataset simulates 1{,}000 individuals over a two-year period with daily resolution and is released as a publicly available resource. In addition to the static dataset, we describe a configurable data generation tool that enables reproducible experimentation under adjustable behavioral and contextual assumptions. FLOW is intended as a controlled experimental environment rather than a proxy for observed human populations, supporting exploratory analysis, methodological development, and benchmarking where real-world data are inaccessible.
翻译:获取关于工作与生活平衡及幸福感的纵向个体层面数据受到隐私、伦理和实际条件的限制。这给压力建模、行为分析和机器学习等领域的可重复研究、方法基准测试和教育带来了挑战。我们提出了FLOW,这是一个合成纵向数据集,旨在模拟工作量、生活方式行为和幸福感之间的日常互动。FLOW采用基于规则的反馈驱动模拟生成,在压力、睡眠、情绪、体力活动和体重等变量之间产生连贯的时间动态。该数据集以每日分辨率模拟了1,000名个体在两年期间的情况,并作为公开资源发布。除了静态数据集外,我们还描述了一个可配置的数据生成工具,该工具能够在可调整的行为和情境假设下实现可重复的实验。FLOW旨在作为一个受控的实验环境,而非观测人群的替代品,以支持在无法获取真实世界数据时的探索性分析、方法开发和基准测试。