We present Thistle, a fully functional vector database. Thistle is an entry into the domain of latent knowledge use in answering search queries, an ongoing research topic at both start-ups and search engine companies. We implement Thistle with several well-known algorithms, and benchmark results on the MS MARCO dataset. Results help clarify the latent knowledge domain as well as the growing Rust ML ecosystem.
翻译:我们介绍了Thistle,一个完全功能的向量数据库。Thistle是回答搜索查询中的潜在知识使用的一个入口,这是一直以来的研究话题,不仅在初创公司和搜索引擎公司中,我们使用几个知名算法实现了Thistle,并在MS MARCO数据集上进行了基准测试。结果有助于澄清潜在知识领域以及不断发展的Rust ML生态系统。