Assessing breast cancer risk from imaging remains a subjective process, in which radiologists employ simple computer aided detection (CAD) systems or qualitative visual assessment to estimate breast percent density (PD). Machine learning (ML) models have become the most promising way to quantify breast cancer risk for early, accurate, and equitable diagnoses, but training such models in medical research is often restricted to small, single-institution data. Since patient demographics and imaging characteristics may vary considerably across imaging sites, models trained on single-institution data tend not to generalize well. In response to this problem, MammoDL is proposed, an open-source software tool that leverages a U-Net architecture to accurately estimate breast PD and complexity from mammography. With the Open Federated Learning (OpenFL) library, this solution enables secure training on datasets across multiple institutions. MammoDL is a leaner, more flexible model than its predecessors, boasting improved generalization due to federation-enabled training on larger, more representative datasets.
翻译:评估成像的乳腺癌风险仍是一个主观过程,放射学家采用简单的计算机辅助检测系统或定性直观评估来估计乳腺密度(PD),机器学习模式已成为最有希望的方法,可以对早期、准确和公平诊断的乳腺癌风险进行量化,但医学研究培训模式往往局限于小型的单一机构数据。由于病人人口和成像特征在成像地点之间可能有很大差异,因此,经过培训的单一机构数据模型往往不会很好地概括。针对这一问题,提出了MammoDL(MammoDL)(MammoDL)(MammoDL),这是一个开放源软件工具,利用一个U-Net结构来精确估计乳腺PD和乳房X光摄影的复杂程度。与开放联邦学习(OpenFL)图书馆(OpenFL)(Open Fl(Ople)图书馆(Open Development)一起,这一解决方案能够确保多个机构对数据集进行安全的培训。MmmoDL(Onal)是一个较精细的模型,比其前身更灵活的模型,它更灵活,并称,由于在更大、更具代表性的数据集上联邦辅助的训练而改进了通用的通用的通用的通用培训而改进了一般化。