Most supervised learning models are trained for full automation. However, their predictions are sometimes worse than those by human experts on some specific instances. Motivated by this empirical observation, our goal is to design classifiers that are optimized to operate under different automation levels. More specifically, we focus on convex margin-based classifiers and first show that the problem is NP-hard. Then, we further show that, for support vector machines, the corresponding objective function can be expressed as the difference of two functions f = g - c, where g is monotone, non-negative and {\gamma}-weakly submodular, and c is non-negative and modular. This representation allows a recently introduced deterministic greedy algorithm, as well as a more efficient randomized variant of the algorithm, to enjoy approximation guarantees at solving the problem. Experiments on synthetic and real-world data from several applications in medical diagnosis illustrate our theoretical findings and demonstrate that, under human assistance, supervised learning models trained to operate under different automation levels can outperform those trained for full automation as well as humans operating alone.
翻译:最受监督的学习模式是完全自动化的训练。然而,它们的预测有时比人类专家在某些特定情况下的预测更差。根据这一经验观察,我们的目标是设计最优化的分类方法,在不同自动化水平下运作。更具体地说,我们把重点放在基于边际的分类方法上,首先显示问题在于NP-硬性。然后,我们进一步表明,对于辅助矢量机,相应的客观功能可以表现为两个功能的区别f=g-c,其中g为单体酮、非阴性、微弱的亚模块和c为非负式和模块化。这种表述方式使得最近引入的确定性贪婪算法以及更高效的随机变式能够在解决问题时享有近似保证。医学诊断中若干应用的合成和真实世界数据实验证明了我们的理论结论,并表明,在人类帮助下,经过培训在不同自动化水平下运作的受监督的学习模式可以超越经过充分自动化训练的人,以及单独操作的人。