Curriculum learning (CL) is a commonly used machine learning training strategy. However, we still lack a clear theoretical understanding of CL's benefits. In this paper, we study the benefits of CL in the multitask linear regression problem under both structured and unstructured settings. For both settings, we derive the minimax rates for CL with the oracle that provides the optimal curriculum and without the oracle, where the agent has to adaptively learn a good curriculum. Our results reveal that adaptive learning can be fundamentally harder than the oracle learning in the unstructured setting, but it merely introduces a small extra term in the structured setting. To connect theory with practice, we provide justification for a popular empirical method that selects tasks with highest local prediction gain by comparing its guarantees with the minimax rates mentioned above.
翻译:课程学习(CL)是一种常用的机器学习培训策略。 但是,我们仍然缺乏对CL的好处的明确理论理解。 在本文中,我们研究了CL在结构化和非结构化环境下多任务线性回归问题中的好处。 对于这两种环境,我们用提供最佳课程的甲骨文和没有甲骨文的甲骨文来计算CL的迷你运算率,因为代理人必须适应性地学习良好的课程。我们的结果表明,适应性学习可能比在非结构化环境中的甲骨文学习要难得多,但只是在结构化环境中增加了一个小的术语。为了将理论与实践联系起来,我们提供了一种普遍的经验方法的理由,通过比较其保证与上述微麦克斯率来选择地方最高预测收益的任务。