With the rising applications implemented in different domains, it is inevitable to require databases to adopt corresponding appropriate data models to store and exchange data derived from various sources. To handle these data models in a single platform, the community of databases introduces a multi-model database. And many vendors are improving their products from supporting a single data model to being multi-model databases. Although this brings benefits, spending lots of enthusiasm to master one of the multi-model query languages for exploring a database is unfriendly to most users. Therefore, we study using keyword searches as an alternative way to explore and query multi-model databases. In this paper, we attempt to utilize quantum physics's probabilistic formalism to bring the problem into vector spaces and represent events (e.g., words) as subspaces. Then we employ a density matrix to encapsulate all the information over these subspaces and use density matrices to measure the divergence between query and candidate answers for finding top-\textit{k} the most relevant results. In this process, we propose using pattern mining to identify compounds for improving accuracy and using dimensionality reduction for reducing complexity. Finally, empirical experiments demonstrate the performance superiority of our approaches over the state-of-the-art approaches.
翻译:随着不同领域应用的不断增多,不可避免地要求数据库采用相应的适当数据模型来储存和交换来自不同来源的数据。为了在一个平台上处理这些数据模型,数据库群引入了一个多模式数据库。许多供应商正在改进其产品,从支持一个单一数据模型到成为多模式数据库。虽然这样做带来了好处,但花大量热情来掌握一个用于探索数据库的多模式查询语言对于大多数用户来说是不友好的。因此,我们研究使用关键词搜索作为探索和查询多模式数据库的替代方法。在本文件中,我们试图利用量子物理的概率化形式主义将问题带入矢量空间并代表事件(如文字)作为子空间。然后我们用一个密度矩阵来汇总这些子空间上的所有信息,并利用密度矩阵来衡量查询和候选答案之间的差异,以找到最相关的结果。在此过程中,我们建议使用模式采矿来查明化合物,以提高准确性,并使用量子化来减少复杂性。最后,我们的经验实验显示了我们方法在状态上的表现优势。