Many real-world prediction tasks have outcome variables that have characteristic heavy-tail distributions. Examples include copies of books sold, auction prices of art pieces, demand for commodities in warehouses, etc. By learning heavy-tailed distributions, "big and rare" instances (e.g., the best-sellers) will have accurate predictions. Most existing approaches are not dedicated to learning heavy-tailed distribution; thus, they heavily under-predict such instances. To tackle this problem, we introduce Learning to Place (L2P), which exploits the pairwise relationships between instances for learning. In its training phase, L2P learns a pairwise preference classifier: is instance A > instance B? In its placing phase, L2P obtains a prediction by placing the new instance among the known instances. Based on its placement, the new instance is then assigned a value for its outcome variable. Experiments on real data show that L2P outperforms competing approaches in terms of accuracy and ability to reproduce heavy-tailed outcome distribution. In addition, L2P provides an interpretable model by placing each predicted instance in relation to its comparable neighbors. Interpretable models are highly desirable when lives and treasure are at stake.
翻译:许多现实世界的预测任务都有典型的重尾分发结果变量,例如出售书籍的复制件、艺术品拍卖价格、仓库商品需求等。通过学习重尾分发、“大和稀有”事例(例如,最佳卖方)将有准确的预测。大多数现有办法不是专门用来学习重尾分发;因此,它们严重不足预测;为了解决这一问题,我们引入了“学习到地点”(L2P),利用了各种学习实例之间的对比关系。在培训阶段,L2P学习了对比优惠分类:例A > 例B?在其设置阶段,L2P通过将新事例放在已知事例中而获得预测。根据其位置,新事例随后被赋予了成果变量的价值。对真实数据的实验表明,L2P在准确性和复制重尾发成果分布的能力方面优于相互竞争的方法。此外,L2P提供了一种可解释的模型,将每个预测实例放在与可比较的邻居关系中。