Assessing breast cancer risk from imaging remains a subjective process, in which radiologists employ computer aided detection (CAD) systems or qualitative visual assessment to estimate breast percent density (PD). More advanced machine learning (ML) models have become the most promising way to quantify breast cancer risk for early, accurate, and equitable diagnoses, but training such models in medical research is often restricted to small, single-institution data. Since patient demographics and imaging characteristics may vary considerably across imaging sites, models trained on single-institution data tend not to generalize well. In response to this problem, MammoDL is proposed, an open-source software tool that leverages UNet architecture to accurately estimate breast PD and complexity from digital mammography (DM). With the Open Federated Learning (OpenFL) library, this solution enables secure training on datasets across multiple institutions. MammoDL is a leaner, more flexible model than its predecessors, boasting improved generalization due to federation-enabled training on larger, more representative datasets.
翻译:评估成像的乳腺癌风险仍是一个主观过程,放射科医生采用计算机辅助检测系统或定性直观评估来估计乳腺密度(PD),更先进的机器学习模型已成为对早期、准确和公平诊断的乳腺癌风险进行量化的最有希望的方法,但是在医学研究中培训这种模型往往局限于小型的单一机构数据,因为病人人口和成像特征在成像地点之间可能有很大差异,因此,经过培训的单一机构数据模型往往不会很好地概括。 针对这一问题,提议了MammoDL,这是一个开放源软件工具,利用UNet结构来精确估计乳腺PD和数字乳房X线照相(DM)的复杂程度。与开放联邦学习图书馆(Open Flex)一起,这一解决方案能够确保多家机构进行数据集培训。MammoDL是一个较精干、较灵活的模型,它比其前几家更灵活,由于在更大、更具代表性的数据集上进行联邦化的培训而改进了通用性。