The CrisisFACTS Track aims to tackle challenges such as multi-stream fact-finding in the domain of event tracking; participants' systems extract important facts from several disaster-related events while incorporating the temporal order. We propose a combination of retrieval, reranking, and the well-known Integer Linear Programming (ILP) and Maximal Marginal Relevance (MMR) frameworks. In the former two modules, we explore various methods including an entity-based baseline, pre-trained and fine-tuned Question Answering systems, and ColBERT. We then use the latter module as an extractive summarization component by taking diversity and novelty criteria into account. The automatic scoring runs show strong results across the evaluation setups but also reveal shortcomings and challenges.
翻译:" 危机分析工具跟踪 " 旨在应对挑战,如事件跟踪领域的多流实况调查;参与者系统从若干与灾害相关的事件中提取重要事实,同时纳入时间顺序;我们提议将检索、重新排序和众所周知的整数线性编程(ILP)和最大边际关联(MMMR)框架结合起来;在前两个模块中,我们探索各种方法,包括基于实体的基准、预先培训和经过微调的问答系统以及ColBERT。然后,我们通过考虑多样性和新颖标准,将后一个模块用作采掘总结部分。自动评分记录显示各评价组合的有力结果,但也揭示了缺点和挑战。