Automatic Text Summarization (ATS) is becoming relevant with the growth of textual data; however, with the popularization of public large-scale datasets, some recent machine learning approaches have focused on dense models and architectures that, despite producing notable results, usually turn out in models difficult to interpret. Given the challenge behind interpretable learning-based text summarization and the importance it may have for evolving the current state of the ATS field, this work studies the application of two modern Generalized Additive Models with interactions, namely Explainable Boosting Machine and GAMI-Net, to the extractive summarization problem based on linguistic features and binary classification.
翻译:随着文本数据的增长,自动文本摘要(ATS)变得具有相关性;然而,随着公共大规模数据集的普及,最近一些机器学习方法侧重于密集模式和结构,尽管这些模式和结构产生了显著的成果,但往往在难以解释的模型中出现。鉴于可解释的基于学习的文本摘要(ATS)的挑战及其对苯丙胺类兴奋剂领域当前状态演变的重要性,这项工作研究了两种现代通用Additive模型与互动的应用,即可解释的推力机和GAMI-Net,对基于语言特征和二分法分类的采掘合成问题的应用。