Agents are now used widely in the process of software development, but building production-ready software engineering agents is a complex task. Deploying software agents effectively requires flexibility in implementation and experimentation, reliable and secure execution, and interfaces for users to interact with agents. In this paper, we present the OpenHands Software Agent SDK, a toolkit for implementing software development agents that satisfy these desiderata. This toolkit is a complete architectural redesign of the agent components of the popular OpenHands framework for software development agents, which has 64k+ GitHub stars. To achieve flexibility, we design a simple interface for implementing agents that requires only a few lines of code in the default case, but is easily extensible to more complex, full-featured agents with features such as custom tools, memory management, and more. For security and reliability, it delivers seamless local-to-remote execution portability, integrated REST/WebSocket services. For interaction with human users, it can connect directly to a variety of interfaces, such as visual workspaces (VS Code, VNC, browser), command-line interfaces, and APIs. Compared with existing SDKs from OpenAI, Claude, and Google, OpenHands uniquely integrates native sandboxed execution, lifecycle control, model-agnostic multi-LLM routing, and built-in security analysis. Empirical results on SWE-Bench Verified and GAIA benchmarks demonstrate strong performance. Put together, these elements allow the OpenHands Software Agent SDK to provide a practical foundation for prototyping, unlocking new classes of custom applications, and reliably deploying agents at scale.
翻译:智能体当前已广泛应用于软件开发流程中,但构建生产就绪的软件工程智能体是一项复杂的任务。有效部署软件智能体需要实现与实验的灵活性、可靠且安全的执行环境,以及用户与智能体交互的接口。本文提出OpenHands软件智能体SDK,这是一个用于实现满足上述需求的软件开发智能体的工具包。该工具包是对广受欢迎的OpenHands软件开发智能体框架(在GitHub上拥有超过6.4万星标)中智能体组件的完整架构重设计。为实现灵活性,我们设计了一个简洁的智能体实现接口,默认情况下仅需数行代码即可完成,同时可轻松扩展为支持自定义工具、内存管理等功能的复杂全功能智能体。在安全性与可靠性方面,该SDK提供无缝的本地到远程执行可移植性、集成的REST/WebSocket服务。为支持与人类用户交互,它能直接连接多种接口,如可视化工作空间(VS Code、VNC、浏览器)、命令行界面及API。与OpenAI、Claude和Google的现有SDK相比,OpenHands独特地集成了原生沙箱执行、生命周期控制、模型无关的多LLM路由及内置安全分析功能。在SWE-Bench Verified和GAIA基准测试上的实证结果展现了卓越性能。综合而言,这些特性使OpenHands软件智能体SDK能够为原型设计、解锁新型定制化应用及大规模可靠部署智能体提供实用基础。