Machine learning is expected to fuel significant improvements in medical care. To ensure that fundamental principles such as beneficence, respect for human autonomy, prevention of harm, justice, privacy, and transparency are respected, medical machine learning applications must be developed responsibly. In this paper, we survey the technical challenges involved in creating medical machine learning systems responsibly and in conformity with existing regulations, as well as possible solutions to address these challenges. We begin by providing a brief overview of existing regulations affecting medical machine learning, showing that properties such as safety, robustness, reliability, privacy, security, transparency, explainability, and nondiscrimination are all demanded already by existing law and regulations - albeit, in many cases, to an uncertain degree. Next, we discuss the underlying technical challenges, possible ways for addressing them, and their respective merits and drawbacks. We notice that distribution shift, spurious correlations, model underspecification, and data scarcity represent severe challenges in the medical context (and others) that are very difficult to solve with classical black-box deep neural networks. Important measures that may help to address these challenges include the use of large and representative datasets and federated learning as a means to that end, the careful exploitation of domain knowledge wherever feasible, the use of inherently transparent models, comprehensive model testing and verification, as well as stakeholder inclusion.
翻译:为了保证现行法律和条例已经要求安全、稳健、可靠、隐私、安全、透明、可解释性和不歧视等基本条件,尽管在许多情况下,程度不确定。接下来,我们讨论潜在的技术挑战、可能的应对方法及其各自的优点和缺点。我们指出,我们注意到,在医疗方面,分配变化、矛盾的相互关系、模型分类和数据短缺是医学(和其他)方面的严重挑战,很难与古典黑箱深层神经网络解决。 可能有助于应对这些挑战的重要措施包括使用大型和有代表性的数据集和硬化学习,作为全面核查的内在手段,作为全面核查的内在工具。