Apache Lucene is a widely popular information retrieval library used to provide search functionality in an extremely wide variety of applications. Naturally, it has to efficiently index and search large number of documents. With non-volatile memory in DIMM form factor (NVDIMM), software now has access to durable, byte-addressable memory with write latency within an order of magnitude of DRAM write latency. In this preliminary article, we present the first reported work on the impact of using NVDIMM on the performance of committing, searching, and near-real time searching in Apache Lucene. We show modest improvements by using NVM but, our empirical study suggests that bigger impact requires redesigning Lucene to access NVM as byte-addressable memory using loads and stores, instead of accessing NVM via the file system.
翻译:Apache Lucene是一个广受欢迎的信息检索图书馆,用于在极其广泛的应用中提供搜索功能。 当然,它必须高效地索引和搜索大量文件。由于DIMM形式要素(NDIM)中的非挥发性内存(NVDIM),软件现在可以在DRAM写缓存的幅度范围内获得耐久的、字节性内存和写缓存。在这个初步文章中,我们首次报告了使用NDIMM对在Apache Lucene进行、搜索和近实时搜索的绩效的影响。我们通过使用NVM来显示一些微小的改进。但是,我们的实证研究表明,更大的影响需要重新设计Lucene,以便使用载荷和仓库,而不是通过档案系统访问NVM。