HeidelTime is one of the most widespread and successful tools for detecting temporal expressions in texts. Since HeidelTime's pattern matching system is based on regular expression, it can be extended in a convenient way. We present such an extension for the German resources of HeidelTime: HeidelTime-EXT . The extension has been brought about by means of observing false negatives within real world texts and various time banks. The gain in coverage is 2.7% or 8.5%, depending on the admitted degree of potential overgeneralization. We describe the development of HeidelTime-EXT, its evaluation on text samples from various genres, and share some linguistic observations. HeidelTime ext can be obtained from https://github.com/texttechnologylab/heideltime.
翻译:海德尔时报是发现文本中时间表达方式的最广泛和最成功的工具之一。 由于海德尔时报模式匹配系统是以正常表达方式为基础的, 它可以以方便的方式扩展。 我们为海德尔时报的德国资源展示了这样的扩展: 海德尔时报- EXT 。 该扩展是通过在现实世界文本和各种时间库中观测虚假的负值来实现的。 覆盖面的增益为2. 7 % 或8.5%, 取决于所承认可能的过度概括程度。 我们描述了海德尔时报的开发情况, 它对各族的文本样本的评估, 并分享一些语言观察。 海德尔时报的转机可从 https://github.com/texttechlab/hedeltime获得。