Literary artefacts are generally indexed and searched based on titles, meta data and keywords over the years. This searching and indexing works well when user/reader already knows about that particular creative textual artefact or document. This indexing and search hardly takes into account interest and emotional makeup of readers and its mapping to books. When a person is looking for a literary textual artefact, he/she might be looking for not only information but also to seek the joy of reading. In case of literary artefacts, progression of emotions across the key events could prove to be the key for indexing and searching. In this paper, we establish clusters among literary artefacts based on computational relationships among sentiment progressions using intelligent text analysis. We have created a database of 1076 English titles + 20 Marathi titles and also used database http://www.cs.cmu.edu/~dbamman/booksummaries.html with 16559 titles and their summaries. We have proposed Sentiment Progression based Indexing for searching and recommending books. This can be used to create personalized clusters of book titles of interest to readers. The analysis clearly suggests better searching and indexing when we are targeting book lovers looking for a particular type of book or creative artefact. This indexing and searching can find many real-life applications for recommending books.
翻译:文艺作品通常根据这些年来的职称、元数据和关键词进行索引和搜索。当用户/阅读者已经知道特定创造性的文字艺术或文档时,这种搜索和索引工作效果很好。这种索引和搜索几乎没有考虑到读者的兴趣和情感构成以及书籍的映像。当一个人寻找文学文字艺术时,他/她可能不仅寻找信息,而且还寻找阅读的乐趣。在文学手工艺品的情况下,关键活动之间的情感进展可以证明是索引和搜索的关键。在本文中,我们利用智能文本分析建立基于情绪发展之间计算关系的文学作品集群。我们创建了一个1076个英文标题+20 Marathi标题的数据库,并且还使用了数据库 http://www.cs.cmu.edu/~dbamman/bookhackhacket.html 以及16559个标题及其摘要。我们提出了基于搜索和建议书籍的Sentiment Strement Previcion 索引。这可用于为读者创建个人化的书籍标题分类组合。我们的分析清楚地表明了搜索和索引,当我们搜索书籍时,可以找到一个针对读者的真爱和索引。