《机器学习可复制性经验报告:对从业者和Tensor Flow示范园艺贡献者的指导》 (An Experience Report on Machine Learning Reproducibility: Guidance for Practitioners and TensorFlow Model Garden Contributors)

Vishnu Banna,Akhil Chinnakotla,Zhengxin Yan,Anirudh Vegesana,Naveen Vivek,Kruthi Krishnappa,Wenxin Jiang,Yung-Hsiang Lu,George K. Thiruvathukal,James C. Davis

from arxiv, Technical Report

Machine learning techniques are becoming a fundamental tool for scientific and engineering progress. These techniques are applied in contexts as diverse as astronomy and spam filtering. However, correctly applying these techniques requires careful engineering. Much attention has been paid to the technical potential; relatively little attention has been paid to the software engineering process required to bring research-based machine learning techniques into practical utility. Technology companies have supported the engineering community through machine learning frameworks such as TensorFLow and PyTorch, but the details of how to engineer complex machine learning models in these frameworks have remained hidden. To promote best practices within the engineering community, academic institutions and Google have partnered to launch a Special Interest Group on Machine Learning Models (SIGMODELS) whose goal is to develop exemplary implementations of prominent machine learning models in community locations such as the TensorFlow Model Garden (TFMG). The purpose of this report is to define a process for reproducing a state-of-the-art machine learning model at a level of quality suitable for inclusion in the TFMG. We define the engineering process and elaborate on each step, from paper analysis to model release. We report on our experiences implementing the YOLO model family with a team of 26 student researchers, share the tools we developed, and describe the lessons we learned along the way.

翻译：机械学习技术正在成为科学和工程进步的基本工具。这些技术应用在天文学和垃圾过滤器等多种情况中。但是,正确应用这些技术需要仔细的工程。已经非常注意技术潜力;相对较少注意将基于研究的机器学习技术纳入实用实用实用技术所需的软件工程过程;技术公司通过TensorFllow和PyTorrch等机器学习框架支持工程界,但是如何在这些框架中设计复杂的机器学习模型的细节仍然隐藏着。为了在工程界推广最佳做法,学术机构和谷歌合作发起了一个机器学习模型特别兴趣小组(SIGMODELS),其目标是在诸如TensorFlow模型园等社区地点发展突出的机器学习模型的示范性实施。本报告的目的是确定一个程序,在质量上重新制作适合纳入TFMG的先进机器学习模型。我们界定了工程进程,并详细阐述了从纸面分析到发布模型的每一步。我们报告我们开发的“YOLO模型”模型的经验,我们与学生分享了我们所开发的“26OLO模型”的学习方法。

相关内容

Machine Learning

关注 2245

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日

吴恩达新书《Machine Learning Yearning》完整中文版

专知会员服务

147+阅读 · 2019年10月27日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日