Recent advances in Generative Adversarial Networks (GANs) have resulted in its widespread applications to multiple domains. A recent model, IRGAN, applies this framework to Information Retrieval (IR) and has gained significant attention over the last few years. In this focused work, we critically analyze multiple components of IRGAN, while providing experimental and theoretical evidence of some of its shortcomings. Specifically, we identify issues with the constant baseline term in the policy gradients optimization and show that the generator harms IRGAN's performance. Motivated by our findings, we propose two models influenced by self-contrastive estimation and co-training which outperform IRGAN on two out of the three tasks considered.
翻译:最近Generation Adversarial Networks(GANs)的进展导致其广泛应用于多个领域。最近的一个模型,IRGAN,将这一框架应用于信息检索(IR),并在过去几年中引起了很大的注意。在这一重点工作中,我们严格分析IRGAN的多个组成部分,同时提供实验和理论证据,说明其某些缺点。具体地说,我们查明了政策梯度优化中存在固定基准期的问题,并表明产生器伤害了IRGAN的性能。根据我们的调查结果,我们提出了两个受自调估计和联合培训影响的模型,这些模型在所考虑的三项任务中比IRGAN高出了两项。