Conventional works generally employ a two-phase model in which a generator selects the most important pieces, followed by a predictor that makes predictions based on the selected pieces. However, such a two-phase model may incur the degeneration problem where the predictor overfits to the noise generated by a not yet well-trained generator and in turn, leads the generator to converge to a sub-optimal model that tends to select senseless pieces. To tackle this challenge, we propose Folded Rationalization (FR) that folds the two phases of the rationale model into one from the perspective of text semantic extraction. The key idea of FR is to employ a unified encoder between the generator and predictor, based on which FR can facilitate a better predictor by access to valuable information blocked by the generator in the traditional two-phase model and thus bring a better generator. Empirically, we show that FR improves the F1 score by up to 10.3% as compared to state-of-the-art methods.
翻译:常规工程通常使用两阶段模型,让发电机选择最重要的部件,然后用预测器根据选定的部件作出预测。然而,这种两阶段模型可能会产生退化问题,因为预测器与尚未受过良好训练的发电机产生的噪音相适应,而后又导致发电机聚集到一个往往选择无意义的部件的亚最佳模型上。为了应对这一挑战,我们提议以文字语义提取为视角,将理论模型的两个阶段折叠成一个阶段。FR的关键想法是,在发电机和预测器之间使用一个统一的编码器,使FR能够利用传统的两阶段模型中发电机堵塞的宝贵信息促进更好的预测,从而带来更好的生成器。我们巧妙地表明,FR将F1的得分提高到10.3 %, 与最先进的方法相比,将F1的得分提高到10.3 % 。