Social media has enabled people to circulate information in a timely fashion, thus motivating people to post messages seeking help during crisis situations. These messages can contribute to the situational awareness of emergency responders, who have a need for them to be categorised according to information types (i.e. the type of aid services the messages are requesting). We introduce a transformer-based multi-task learning (MTL) technique for classifying information types and estimating the priority of these messages. We evaluate the effectiveness of our approach with a variety of metrics by submitting runs to the TREC Incident Streams (IS) track: a research initiative specifically designed for disaster tweet classification and prioritisation. The results demonstrate that our approach achieves competitive performance in most metrics as compared to other participating runs. Subsequently, we find that an ensemble approach combining disparate transformer encoders within our approach helps to improve the overall effectiveness to a significant extent, achieving state-of-the-art performance in almost every metric. We make the code publicly available so that our work can be reproduced and used as a baseline for the community for future work in this domain.
翻译:社交媒体使人们能够及时传播信息,从而激励人们在危机局势中发布寻求帮助的信息,这些信息有助于应急反应人员了解情况,他们需要根据信息类型(即信息所要求的援助服务类型)进行分类。我们采用了基于变压器的多任务学习技术,对信息类型进行分类,并估计这些信息的优先次序。我们通过向TREC事件流轨道(IS)提交数据来评估我们的方法的有效性:专门设计用于灾害推特分类和优先排序的研究举措。结果显示,我们的方法在大多数指标上取得了与其他参与运行的参数相比的竞争性业绩。随后,我们发现,将不同变压器编码器结合到我们的方法中,有助于大大提高总体效力,几乎在每一指标中都实现最先进的业绩。我们公开提供代码,以便复制我们的工作,并用作社区今后在这一领域工作的基线。