Preparing exact and comprehensive word meaning explanations is one of the key steps in the process of monolingual dictionary writing. In standard methodology, the explanations need an expert lexicographer who spends a substantial amount of time checking the consistency between the descriptive text and corpus evidence. In the following text, we present a new tool that derives explanations automatically based on collective information from very large corpora, particularly on word sketches. We also propose a quantitative evaluation of the constructed explanations, concentrating on explanations of nouns. The methodology is to a certain extent language independent; however, the presented verification is limited to Czech and English. We show that the presented approach allows to create explanations that contain data useful for understanding the word meaning in approximately 90% of cases. However, in many cases, the result requires post-editing to remove redundant information.
翻译:编写准确和全面的词义含义解释是单语字典写法过程中的关键步骤之一。在标准方法中,解释需要一位专家地名录学家,他花费大量时间核对描述性文字和物证的一致性。在下面的文本中,我们提出了一个新工具,它自动地从非常庞大的社团获得基于集体信息的解释,特别是在字形草图上。我们还提议对构建的解释进行定量评估,侧重于名词解释。方法在某种程度上是独立的语言;不过,提出的核查限于捷克和英语。我们表明,所提出的方法可以产生包含有助于理解约90%案件词义的数据的解释。然而,在许多情况下,结果要求编辑后删除多余的信息。</s>