Respiratory rate (RR) is a clinical metric used to assess overall health and physical fitness. An individual's RR can change from their baseline due to chronic illness symptoms (e.g., asthma, congestive heart failure), acute illness (e.g., breathlessness due to infection), and over the course of the day due to physical exhaustion during heightened exertion. Remote estimation of RR can offer a cost-effective method to track disease progression and cardio-respiratory fitness over time. This work investigates a model-driven approach to estimate RR from short audio segments obtained after physical exertion in healthy adults. Data was collected from 21 individuals using microphone-enabled, near-field headphones before, during, and after strenuous exercise. RR was manually annotated by counting perceived inhalations and exhalations. A multi-task Long-Short Term Memory (LSTM) network with convolutional layers was implemented to process mel-filterbank energies, estimate RR in varying background noise conditions, and predict heavy breathing, indicated by an RR of more than 25 breaths per minute. The multi-task model performs both classification and regression tasks and leverages a mixture of loss functions. It was observed that RR can be estimated with a concordance correlation coefficient (CCC) of 0.76 and a mean squared error (MSE) of 0.2, demonstrating that audio can be a viable signal for approximating RR.
翻译:呼吸机率(RR)是用来评估总体健康和身体健康的一种临床衡量标准,一个人的呼吸机率可因慢性疾病症状(如哮喘、心肺衰竭)、急性疾病(如感染导致的窒息)和在高压期间因体力疲竭而全天期间发生的急性疾病(如感染导致的失眠)而改变其基线;对RR的远距离估计可提供一种低成本有效的方法,用以跟踪疾病进展和长期的心血管呼吸机能;这项工作调查一种由模型驱动的方法,从健康成年人身体施用后获得的短音频段估计RR;在长时间的锻炼之前、期间和之后,利用麦克风、近地耳耳机、急性疾病(如因感染而失去呼吸能力),从21个人收集数据;对RRR进行人工加注,以计算认为的吸入和体力疲竭;对RRR(LSTM)网络进行多任务性长、长和心肺功能的远程记忆(LSTM)网络可以处理中度过滤器能量,在各种背景噪音条件下估计RRRR,并预测沉重的呼吸间隔,每分钟超过25个呼吸信号显示呼吸信号信号信号信号信号信号信号信号信号信号,而观察到的BRBRBRB 。多式模型的模型和正值的计算,可测算算算。