The development of neural networks and pretraining techniques has spawned many sentence-level tagging systems that achieved superior performance on typical benchmarks. However, a relatively less discussed topic is what if more context information is introduced into current top-scoring tagging systems. Although several existing works have attempted to shift tagging systems from sentence-level to document-level, there is still no consensus conclusion about when and why it works, which limits the applicability of the larger-context approach in tagging tasks. In this paper, instead of pursuing a state-of-the-art tagging system by architectural exploration, we focus on investigating when and why the larger-context training, as a general strategy, can work. To this end, we conduct a thorough comparative study on four proposed aggregators for context information collecting and present an attribute-aided evaluation method to interpret the improvement brought by larger-context training. Experimentally, we set up a testbed based on four tagging tasks and thirteen datasets. Hopefully, our preliminary observations can deepen the understanding of larger-context training and enlighten more follow-up works on the use of contextual information.
翻译:神经网络和训练前技术的发展已产生了许多在典型基准上取得优异业绩的判刑级标记系统,然而,一个相对较少讨论的议题是,如果将更多的背景信息引入目前的最高分级标记系统,那么,如果将更多的背景信息引入目前的最高分级标记系统,那么,如果现有的一些工作试图将标记系统从判决一级转移到文件一级,那么对于它何时和为什么起作用还没有达成共识,从而限制了在标记任务中采用较大型文本方法。在本文中,我们不是通过建筑勘探来采用最先进的标记系统,而是侧重于调查作为一般战略的较高级文本培训何时和为什么能够奏效。为此,我们就四个拟议的背景信息收集聚合器进行了彻底的比较研究,并提出了一种有属性的辅助评价方法,以解释更大分级培训带来的改进。实验时,我们根据四个标记任务和13个数据集建立了一个试验台。希望我们的初步观察能够加深对较大型文字培训的理解,并启发更多关于背景信息使用的后续工作。