Machine transliteration, as defined in this paper, is a process of automatically transforming written script of words from a source alphabet into words of another target alphabet within the same language, while preserving their meaning, as well as pronunciation. The main goal of this paper is to present a machine transliteration tool between three common scripts used in low-resource Uzbek language: the old Cyrillic, currently official Latin, and newly announced New Latin alphabets. The tool has been created using a combination of rule-based and fine-tuning approaches. The created tool is available as an open-source Python package, as well as a web-based application including a public API. To our knowledge, this is the first machine transliteration tool that supports the newly announced Latin alphabet of the Uzbek language.
翻译:本文定义的机器转写是一个过程,将文字的文字文字从源字母自动转换成同一语言内另一种目标字母的文字,同时保留其含义和发音。本文的主要目标是在乌兹别克低资源语言中使用的三种通用文字(旧西里尔语、目前正式拉丁语和新宣布的新拉丁字母)之间展示一个机器转写工具。该工具是使用基于规则的和精细调整方法的组合创建的。创建的工具作为开放源Python软件包以及包括公共API在内的网上应用程序提供。据我们所知,这是第一个支持新宣布的乌兹别克语拉丁字母的机器转写工具。
Alphabet is mostly a collection of companies. This newer Google is a bit slimmed down, with the companies that are pretty far afield of our main internet products contained in Alphabet instead.https://abc.xyz/