Suggesting similar questions for a user query has many applications ranging from reducing search time of users on e-commerce websites, training of employees in companies to holistic learning for students. The use of Natural Language Processing techniques for suggesting similar questions is prevalent over the existing architecture. Mainly two approaches are studied for finding text similarity namely syntactic and semantic, however each has its draw-backs and fail to provide the desired outcome. In this article, a self-learning combined approach is proposed for determining textual similarity that introduces a robust weighted syntactic and semantic similarity index for determining similar questions from a predetermined database, this approach learns the optimal combination of the mentioned approaches for a database under consideration. Comprehensive analysis has been carried out to justify the efficiency and efficacy of the proposed approach over the existing literature.
翻译:建议用户查询的类似问题有许多应用,从减少电子商务网站用户的搜索时间、公司雇员培训到学生整体学习等,在现有结构中,普遍使用自然语言处理技术提出类似问题,主要研究两种方法,以寻找案文相似性,即综合和语义,但每种方法都有缺点,不能提供预期结果。在本条中,提议采用自学综合方法,确定文本相似性,采用一种强有力的加权综合和语义相似性指数,从预定的数据库中确定类似问题,这一方法学习了所提到方法的最佳组合,供审议的数据库使用。已进行全面分析,说明拟议方法对现有文献的效率和效果。