Digital sensors are increasingly being used to monitor the change over time of physiological processes in biological health and disease, often using wearable devices. This generates very large amounts of digital sensor data, for which, a consensus on a common storage, exchange and archival data format standard, has yet to be reached. To address this gap, we propose Time Series Data Format (TSDF): a unified, standardized format for storing all types of physiological sensor data, across diverse disease areas. We pose a series of format design criteria and review in detail current storage and exchange formats. When judged against these criteria, we find these current formats lacking, and propose a very simple, intuitive standard for both numerical sensor data and metadata, based on raw binary data and JSON-format text files, for sensor measurements/timestamps and metadata, respectively. By focusing on the common characteristics of diverse biosensor data, we define a set of necessary and sufficient metadata fields for storing, processing, exchanging, archiving and reliably interpreting, multi-channel biological time series data. Our aim is for this standardized format to increase the interpretability and exchangeability of data, thereby contributing to scientific reproducibility in studies where digital biosensor data forms a key evidence base.
翻译:为了弥补这一差距,我们提议采用时间序列数据格式(TSDF):一个统一、标准化的格式,用于储存各种疾病地区的所有类型的生理传感器数据;我们提出了一系列格式设计标准,并详细审查目前的储存和交换格式;在根据这些标准判断时,我们发现这些格式缺乏数字传感器数据和元数据,并提出了一个非常简单、直观的标准,其依据是原始的二进制数据和JSON-格式文本文件,分别用于传感器测量/时间戳和元数据,从而帮助进行生物数据研究,从而在生物数据的关键形式中提供科学证据。