通过自动在线实验选择生产系统模型 (Model Selection for Production System via Automated Online Experiments)

A challenge that machine learning practitioners in the industry face is the task of selecting the best model to deploy in production. As a model is often an intermediate component of a production system, online controlled experiments such as A/B tests yield the most reliable estimation of the effectiveness of the whole system, but can only compare two or a few models due to budget constraints. We propose an automated online experimentation mechanism that can efficiently perform model selection from a large pool of models with a small number of online experiments. We derive the probability distribution of the metric of interest that contains the model uncertainty from our Bayesian surrogate model trained using historical logs. Our method efficiently identifies the best model by sequentially selecting and deploying a list of models from the candidate set that balance exploration-exploitation. Using simulations based on real data, we demonstrate the effectiveness of our method on two different tasks.

翻译：工业界的机器学习实践者所面临的一项挑战是选择生产中部署的最佳模式。由于模型往往是生产系统的一个中间组成部分,A/B测试等在线控制实验对整个系统的有效性作出最可靠的估计,但由于预算限制,只能对两个或几个模型进行比较。我们提议一个自动化在线试验机制,通过少量在线实验,从大量模型中有效地进行模型选择。我们从利用历史日志培训的Bayesian替代模型中得出包含模型不确定性的利息指标的概率分布。我们的方法有效地确定了最佳模型,按顺序从候选人集中选择和部署一个能够平衡勘探-开发的模型清单。我们利用基于真实数据的模拟,展示了我们方法在两种不同任务上的有效性。

相关内容

Automator

关注 5

Automator是苹果公司为他们的Mac OS X系统开发的一款软件。 只要通过点击拖拽鼠标等操作就可以将一系列动作组合成一个工作流，从而帮助你自动的（可重复的）完成一些复杂的工作。Automator还能横跨很多不同种类的程序，包括：查找器、Safari网络浏览器、iCal、地址簿或者其他的一些程序。它还能和一些第三方的程序一起工作，如微软的Office、Adobe公司的Photoshop或者Pixelmator等。

【KDD2020】图模型信息融合

专知会员服务

39+阅读 · 2020年10月15日

【DeepMind】强化学习教程，83页ppt

专知会员服务

158+阅读 · 2020年8月7日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日