Contemporary artificial neural networks (ANN) are trained end-to-end, jointly learning both features and classifiers for the task of interest. Though enormously effective, this paradigm imposes significant costs in assembling annotated task-specific datasets and training large-scale networks. We propose to decouple feature learning from downstream lung ultrasound tasks by introducing an auxiliary pre-task of visual biomarker classification. We demonstrate that one can learn an informative, concise, and interpretable feature space from ultrasound videos by training models for predicting biomarker labels. Notably, biomarker feature extractors can be trained from data annotated with weak video-scale supervision. These features can be used by a variety of downstream Expert models targeted for diverse clinical tasks (Diagnosis, lung severity, S/F ratio). Crucially, task-specific expert models are comparable in accuracy to end-to-end models directly trained for such target tasks, while being significantly lower cost to train.
翻译:当代人造神经网络(ANN)是经过培训的端对端,共同学习有关任务的特点和分类方法,这一模式虽然非常有效,但在收集附加说明的任务特定数据集和培训大型网络方面成本巨大,我们提议通过引入视觉生物标志分类的辅助前期任务,将特征学习与下游肺超声波任务脱钩。我们证明,通过生物标志标签预测培训模型,可以从超声波视频中学习一个内容丰富、简洁和可解释的特征空间。值得注意的是,生物标志特征提取器可以从附加说明的数据中培训,而视频规模监督薄弱。这些特征可以被各种针对不同临床任务的下游专家模型(诊断、肺重度、S/F比率)使用。关键是,具体任务专家模型在准确性方面可以与直接为此类目标任务培训的端对端模型相比,同时培训成本要低得多。