StorageXTuner：一种基于LLM智能体驱动的异构存储系统自动调优框架 (StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems)

Automatically configuring storage systems is hard: parameter spaces are large and conditions vary across workloads, deployments, and versions. Heuristic and ML tuners are often system specific, require manual glue, and degrade under changes. Recent LLM-based approaches help but usually treat tuning as a single-shot, system-specific task, which limits cross-system reuse, constrains exploration, and weakens validation. We present StorageXTuner, an LLM agent-driven auto-tuning framework for heterogeneous storage engines. StorageXTuner separates concerns across four agents - Executor (sandboxed benchmarking), Extractor (performance digest), Searcher (insight-guided configuration exploration), and Reflector (insight generation and management). The design couples an insight-driven tree search with layered memory that promotes empirically validated insights and employs lightweight checkers to guard against unsafe actions. We implement a prototype and evaluate it on RocksDB, LevelDB, CacheLib, and MySQL InnoDB with YCSB, MixGraph, and TPC-H/C. Relative to out-of-the-box settings and to ELMo-Tune, StorageXTuner reaches up to 575% and 111% higher throughput, reduces p99 latency by as much as 88% and 56%, and converges with fewer trials.

翻译：自动配置存储系统具有挑战性：参数空间庞大，且工作负载、部署环境和系统版本的条件各异。基于启发式或机器学习的调优器通常针对特定系统设计，需要人工干预进行适配，并在环境变化时性能下降。近期基于大语言模型（LLM）的方法有所改进，但通常将调优视为单次、系统特定的任务，这限制了跨系统复用性，约束了参数探索空间，并削弱了验证有效性。本文提出StorageXTuner，一种基于LLM智能体驱动的异构存储引擎自动调优框架。StorageXTuner通过四个智能体实现关注点分离：执行器（沙盒化基准测试）、提取器（性能摘要生成）、搜索器（基于洞察的配置探索）和反思器（洞察生成与管理）。该设计将洞察驱动的树搜索与分层记忆机制相结合，促进经验验证的洞察积累，并采用轻量级检查器防止不安全操作。我们实现了原型系统，并在RocksDB、LevelDB、CacheLib和MySQL InnoDB上使用YCSB、MixGraph及TPC-H/C工作负载进行评估。相较于默认配置和ELMo-Tune方法，StorageXTuner最高可提升575%和111%的吞吐量，将p99延迟降低达88%和56%，且以更少的调优轮次实现收敛。