We introduce Mephisto, a framework to make crowdsourcing for research more reproducible, transparent, and collaborative. Mephisto provides abstractions that cover a broad set of task designs and data collection workflows, and provides a simple user experience to make best-practices easy defaults. In this whitepaper we discuss the current state of data collection and annotation in ML research, establish the motivation for building a shared framework to enable researchers to create and open-source data collection and annotation tools as part of their publication, and outline a set of suggested requirements for a system to facilitate these goals. We then step through our resolution in Mephisto, explaining the abstractions we use, our design decisions around the user experience, and share implementation details and where they align with the original motivations. We also discuss current limitations, as well as future work towards continuing to deliver on the framework's initial goals. Mephisto is available as an open source project, and its documentation can be found at www.mephisto.ai.
翻译:我们引入了Mephisto(Mephisto)这个让众包更可复制、更透明、更协作的研究框架。Mephisto提供了包含广泛任务设计和数据收集工作流程的抽象信息,并提供了简单用户经验,使最佳做法容易违约。在本白皮书中,我们讨论了数据收集和ML研究注释的现状,确定了建立共同框架的动机,使研究人员能够创建和开放源数据收集和批注工具,作为其出版物的一部分,并概述了促进这些目标的系统的一系列拟议要求。我们随后在Mephisto(Mephisto)中通过我们的决议,解释了我们使用的抽象概念、我们围绕用户经验的设计决定,并分享了实施细节及其与原始动机保持一致之处。我们还讨论了当前的局限性,以及今后继续实现框架初始目标的工作。Mephisto作为开放源项目提供,其文件可在www.mephisto.ai查阅。