Automatically configuring storage systems is hard: parameter spaces are large and conditions vary across workloads, deployments, and versions. Heuristic and ML tuners are often system specific, require manual glue, and degrade under changes. Recent LLM-based approaches help but usually treat tuning as a single-shot, system-specific task, which limits cross-system reuse, constrains exploration, and weakens validation. We present StorageXTuner, an LLM agent-driven auto-tuning framework for heterogeneous storage engines. StorageXTuner separates concerns across four agents - Executor (sandboxed benchmarking), Extractor (performance digest), Searcher (insight-guided configuration exploration), and Reflector (insight generation and management). The design couples an insight-driven tree search with layered memory that promotes empirically validated insights and employs lightweight checkers to guard against unsafe actions. We implement a prototype and evaluate it on RocksDB, LevelDB, CacheLib, and MySQL InnoDB with YCSB, MixGraph, and TPC-H/C. Relative to out-of-the-box settings and to ELMo-Tune, StorageXTuner reaches up to 575% and 111% higher throughput, reduces p99 latency by as much as 88% and 56%, and converges with fewer trials.
翻译:自动配置存储系统具有挑战性:参数空间庞大,且工作负载、部署环境和系统版本的条件各异。基于启发式或机器学习的调优器通常针对特定系统设计,需要人工干预进行适配,并在环境变化时性能下降。近期基于大语言模型(LLM)的方法有所改进,但通常将调优视为单次、系统特定的任务,这限制了跨系统复用性,约束了参数探索空间,并削弱了验证有效性。本文提出StorageXTuner,一种基于LLM智能体驱动的异构存储引擎自动调优框架。StorageXTuner通过四个智能体实现关注点分离:执行器(沙盒化基准测试)、提取器(性能摘要生成)、搜索器(基于洞察的配置探索)和反思器(洞察生成与管理)。该设计将洞察驱动的树搜索与分层记忆机制相结合,促进经验验证的洞察积累,并采用轻量级检查器防止不安全操作。我们实现了原型系统,并在RocksDB、LevelDB、CacheLib和MySQL InnoDB上使用YCSB、MixGraph及TPC-H/C工作负载进行评估。相较于默认配置和ELMo-Tune方法,StorageXTuner最高可提升575%和111%的吞吐量,将p99延迟降低达88%和56%,且以更少的调优轮次实现收敛。