In Grammatical Error Correction, systems are evaluated by the number of errors they correct. However, no one has assessed whether all error types are equally important. We provide and apply a method to quantify the importance of different grammatical error types to humans. We show that some rare errors are considered disturbing while other common ones are not. This affects possible directions to improve both systems and their evaluation.
翻译:文中错误校正,对系统进行校正,根据校正误差数进行评估,然而,没有人评估所有错误类型是否都同样重要。我们提供并使用一种方法来量化不同语法错误类型对人类的重要性。我们发现,一些罕见错误被认为令人不安,而其他常见错误则不令人不安。这影响到改进系统及其评价的可能方向。