Building classifiers on multiple domains is a practical problem in the real life. Instead of building classifiers one by one, multi-domain learning (MDL) simultaneously builds classifiers on multiple domains. MDL utilizes the information shared among the domains to improve the performance. As a supervised learning problem, the labeling effort is still high in MDL problems. Usually, this high labeling cost issue could be relieved by using active learning. Thus, it is natural to utilize active learning to reduce the labeling effort in MDL, and we refer this setting as multi-domain active learning (MDAL). However, there are only few works which are built on this setting. And when the researches have to face this problem, there is no off-the-shelf solutions. Under this circumstance, combining the current multi-domain learning models and single-domain active learning strategies might be a preliminary solution for MDAL problem. To find out the potential of this preliminary solution, a comparative study over 5 models and 4 selection strategies is made in this paper. To the best of our knowledge, this is the first work provides the formal definition of MDAL. Besides, this is the first comparative work for MDAL problem. From the results, the Multinomial Adversarial Networks (MAN) model with a simple best vs second best (BvSB) uncertainty strategy shows its superiority in most cases. We take this combination as our off-the-shelf recommendation for the MDAL problem.
翻译:在多个领域建立分类系统是现实生活中的一个实际问题。 多领域学习( MDL) 并不是逐个建立分类系统,而是同时在多个领域建立分类系统。 MDL 利用各领域共享的信息来提高绩效。 作为一个监管学习问题, 标签工作在MDL问题中仍然很突出。 通常, 使用积极学习可以缓解这个高标签成本问题。 因此, 利用积极学习来减少MDL 标签的标签工作是自然的, 我们把这个设置称为多领域积极学习。 但是, 仅在此背景下建立很少的工作。 当研究必须面对这一问题时, 没有现成的解决方案。 在此情况下, 将目前的多领域学习模式和单领域积极学习战略结合起来, 可能是MDLL问题的初步解决办法。 要找到这一初步解决方案的潜力, 将5种模式和4种选择战略进行比较研究, 本文中我们最了解的是, 这是第一个正式定义MDAL- 的组合。 此外,, 将MDSB 的首个比较性战略作为我们MAL 的第二个模式案例。