教授有差异的在职学习者 (Teaching an Active Learner with Contrastive Examples)

We study the problem of active learning with the added twist that the learner is assisted by a helpful teacher. We consider the following natural interaction protocol: At each round, the learner proposes a query asking for the label of an instance $x^q$, the teacher provides the requested label $\{x^q, y^q\}$ along with explanatory information to guide the learning process. In this paper, we view this information in the form of an additional contrastive example ($\{x^c, y^c\}$) where $x^c$ is picked from a set constrained by $x^q$ (e.g., dissimilar instances with the same label). Our focus is to design a teaching algorithm that can provide an informative sequence of contrastive examples to the learner to speed up the learning process. We show that this leads to a challenging sequence optimization problem where the algorithm's choices at a given round depend on the history of interactions. We investigate an efficient teaching algorithm that adaptively picks these contrastive examples. We derive strong performance guarantees for our algorithm based on two problem-dependent parameters and further show that for specific types of active learners (e.g., a generalized binary search learner), the proposed teaching algorithm exhibits strong approximation guarantees. Finally, we illustrate our bounds and demonstrate the effectiveness of our teaching framework via two numerical case studies.

翻译：我们研究积极学习的问题,因为学习者得到一名有帮助的教师的帮助。我们考虑以下自然互动协议:在每回合中,学习者提出一个询问,要求标出美元xQ$的标签,教师提供要求的标签$xqqq,yqQ$,以及解释性信息,以指导学习过程。在本文件中,我们以另一个对比性实例($xxxc, yc ⁇ $)的形式看待这一信息,其中从一个受美元限制的套件(例如,与同一标签不同的例子)。我们的重点是设计一种教学算法,该算法能够向学习者提供一系列内容丰富的对比性实例,以加速学习过程。我们表明,这会导致一个具有挑战性的序列优化问题,因为一个特定回合的算法的选择取决于互动的历史。我们研究了一种适应性强的教学算法。我们根据两个依赖问题的参数(例如,与同一标签不同的例子不同的例子)来获得很强的业绩保证。我们的重点是设计一种教学方法,通过强健的学习模式(e.g.