Despite recent explosion in research interests, in-context learning and the precise impact of the quality of demonstrations remain elusive. While, based on current literature, it is expected that in-context learning shares a similar mechanism to supervised learning, Min et al. (2022) recently reported that, surprisingly, input-label correspondence is less important than other aspects of prompt demonstrations. Inspired by this counter-intuitive observation, we re-examine the importance of ground truth labels on in-context learning from diverse and statistical points of view. With the aid of the newly introduced metrics, i.e., Ground-truth Label Effect Ratio (GLER), demo-gain, and label sensitivity, we find that the impact of the correct input-label matching can vary according to different configurations. Expanding upon the previous key finding on the role of demonstrations, the complementary and contrastive results suggest that one might need to take more care when estimating the impact of each component in in-context learning demonstrations.
翻译:尽管最近出现了研究兴趣的爆炸,但内文学习和示威质量的确切影响仍然难以捉摸,虽然根据目前的文献,预期内文学习与监督学习有着类似的机制,但Min等人(2022年)最近报告说,令人惊讶的是,输入标签的通信不如迅速示威的其他方面重要。在这种反直觉的观察的启发下,我们重新审查了从不同和统计角度学习内文的地面真相标签的重要性。在新引入的参数(即地面真相拉贝尔效应比率(GLER)、暂停竞争和标签敏感性)的帮助下,我们发现正确的输入标签匹配的影响可能因不同的配置而不同。根据以前关于示威作用的关键调查结果,补充和对比的结果表明,在估计文字学习演示中每个组成部分的影响时,可能需要更加谨慎。