As the computational requirements for machine learning systems and the size and complexity of machine learning frameworks increases, essential framework innovation has become challenging. While computational needs have driven recent compiler, networking, and hardware advancements, utilization of those advancements by machine learning tools is occurring at a slower pace. This is in part due to the difficulties involved in prototyping new computational paradigms with existing frameworks. Large frameworks prioritize machine learning researchers and practitioners as end users and pay comparatively little attention to systems researchers who can push frameworks forward -- we argue that both are equally important stakeholders. We introduce Flashlight, an open-source library built to spur innovation in machine learning tools and systems by prioritizing open, modular, customizable internals and state-of-the-art, research-ready models and training setups across a variety of domains. Flashlight allows systems researchers to rapidly prototype and experiment with novel ideas in machine learning computation and has low overhead, competing with and often outperforming other popular machine learning frameworks. We see Flashlight as a tool enabling research that can benefit widely used libraries downstream and bring machine learning and systems researchers closer together.
翻译:随着机器学习系统的计算要求以及机器学习框架的规模和复杂性的增加,基本的框架创新变得具有挑战性。虽然计算需求驱动了最近的汇编者、网络和硬件进步,但通过机器学习工具利用这些进步的速度较慢。部分原因是在现有框架下新计算模式的原型存在困难。大型框架优先考虑机器学习研究人员和从业人员作为终端用户,相对较少注意能够推动框架向前推进的系统研究人员 -- -- 我们认为两者都是同样重要的利益攸关方。我们引入了闪光灯,这是一个开源图书馆,其建立的目的是通过将开放、模块化、可定制的内部和最先进的、准备研究的模型和训练设置置于不同领域的优先位置,促进机器学习工具和系统的创新。闪光灯使系统研究人员能够在机器学习计算方面快速进行原型和实验,其管理效率较低,与其他广受欢迎的机器学习框架竞争,而且往往比其他广受欢迎。我们认为闪光是一种工具,有助于开展研究,使广泛使用的下游图书馆受益,并使机器学习和系统研究人员更加接近。