Specialized documentation techniques have been developed to communicate key facts about machine-learning (ML) systems and the datasets and models they rely on. Techniques such as Datasheets, FactSheets, and Model Cards have taken a mainly descriptive approach, providing various details about the system components. While the above information is essential for product developers and external experts to assess whether the ML system meets their requirements, other stakeholders might find it less actionable. In particular, ML engineers need guidance on how to mitigate potential shortcomings in order to fix bugs or improve the system's performance. We survey approaches that aim to provide such guidance in a prescriptive way. We further propose a preliminary approach, called Method Cards, which aims to increase the transparency and reproducibility of ML systems by providing prescriptive documentation of commonly-used ML methods and techniques. We showcase our proposal with an example in small object detection, and demonstrate how Method Cards can communicate key considerations for model developers. We further highlight avenues for improving the user experience of ML engineers based on Method Cards.
翻译:开发了专门的文件技术,以交流机器学习系统及其依赖的数据集和模型的关键事实; 数据表、事实喜好和模型卡等技术主要采取描述性办法,提供关于系统组成部分的各种细节; 虽然上述信息对于产品开发者和外部专家评估ML系统是否符合其要求至关重要,但其他利益攸关方可能发现其行动不那么可行; 特别是,ML工程师需要指导如何减少潜在缺陷,以便纠正错误或改进系统性能; 我们调查旨在以规范方式提供这种指导的方法; 我们进一步提出一种初步办法,称为方法卡,目的是通过提供常用ML方法和技术的规范性文件,提高ML系统的透明度和可复制性; 我们以小型物体探测为例,展示我们的提案,并展示方法卡如何向模型开发者传达关键考虑因素; 我们进一步强调改进ML工程师基于方法卡的用户经验的途径。