We introduce a family of deep-learning architectures for inter-sentence relation extraction, i.e., relations where the participants are not necessarily in the same sentence. We apply these architectures to an important use case in the biomedical domain: assigning biological context to biochemical events. In this work, biological context is defined as the type of biological system within which the biochemical event is observed. The neural architectures encode and aggregate multiple occurrences of the same candidate context mentions to determine whether it is the correct context for a particular event mention. We propose two broad types of architectures: the first type aggregates multiple instances that correspond to the same candidate context with respect to event mention before emitting a classification; the second type independently classifies each instance and uses the results to vote for the final class, akin to an ensemble approach. Our experiments show that the proposed neural classifiers are competitive and some achieve better performance than previous state of the art traditional machine learning methods without the need for feature engineering. Our analysis shows that the neural methods particularly improve precision compared to traditional machine learning classifiers and also demonstrates how the difficulty of inter-sentence relation extraction increases as the distance between the event and context mentions increase.
翻译:我们引入了一套用于相互判决关系提取的深层次学习结构,即参与者不一定在同一个句子中的关系。我们将这些结构应用于生物医学领域的一个重要用途案例:生物化学事件的生物背景。在这项工作中,生物背景被定义为观察生化事件的生物系统类型。同一候选背景的神经结构编码和累积多发事件,用来确定它是否属于某个特定事件的正确背景。我们建议了两种广泛的结构类型:第一类综合了与发布分类之前提到的事件对应的候选背景的多个实例;第二类独立分类了每个案例并使用结果来投票最终类别,类似于共同方法。我们的实验表明,拟议的神经分类器具有竞争力,有些比以往的艺术传统机器学习方法取得更好的性能,而不需要特征工程。我们的分析表明,与传统的机器学习分类器相比,神经方法尤其更加精确,并且还表明,由于事件和背景背景之间距离的增加,相互判决的提取的难度是如何增加的。