This work presents our approach to train a neural network to detect hate-speech texts in Hindi and Bengali. We also explore how transfer learning can be applied to learning these languages, given that they have the same origin and thus, are similar to some extend. Even though the whole experiment was conducted with low computational power, the obtained result is comparable to the results of other, more expensive, models. Furthermore, since the training data in use is relatively small and the two languages are almost entirely unknown to us, this work can be generalized as an effort to demystify lost or alien languages that no human is capable of understanding.
翻译:这项工作展示了我们培训神经网络以检测印地语和孟加拉语中仇恨言论的方法。 我们还探索了如何将传导学习应用于学习这些语言,因为这些语言的来源相同,因此与某些语言相似。尽管整个实验的计算能力较低,但所获得的结果与其他更昂贵的模式类似。此外,由于使用的培训数据相对较少,而且这两种语言几乎完全不为我们所知,这项工作可以被广泛推广,以努力解开人类无法理解的迷思或外来语言。