This paper aims to explore fundamental questions in the era when AI coding assistants like GitHub Copilot are widely adopted: what do developers truly value and criticize in AI coding assistants, and what does this reveal about their needs and expectations in real-world software development? Unlike previous studies that conduct observational research in controlled and simulated environments, we analyze extensive, first-hand user reviews of AI coding assistants, which capture developers' authentic perspectives and experiences drawn directly from their actual day-to-day work contexts. We identify 1,085 AI coding assistants from the Visual Studio Code Marketplace. Although they only account for 1.64% of all extensions, we observe a surge in these assistants: over 90% of them are released within the past two years. We then manually analyze the user reviews sampled from 32 AI coding assistants that have sufficient installations and reviews to construct a comprehensive taxonomy of user concerns and feedback about these assistants. We manually annotate each review's attitude when mentioning certain aspects of coding assistants, yielding nuanced insights into user satisfaction and dissatisfaction regarding specific features, concerns, and overall tool performance. Built on top of the findings-including how users demand not just intelligent suggestions but also context-aware, customizable, and resource-efficient interactions-we propose five practical implications and suggestions to guide the enhancement of AI coding assistants that satisfy user needs.
翻译:本文旨在探讨AI编程助手(如GitHub Copilot)被广泛采用的时代中的基本问题:开发者究竟看重和批评AI编程助手的哪些方面?这又揭示了他们在实际软件开发中的哪些需求和期望?与以往在受控模拟环境中进行观察性研究不同,我们分析了大量关于AI编程助手的一手用户评论,这些评论直接捕捉了开发者从其真实日常工作场景中获得的真实观点与体验。我们从Visual Studio Code市场中识别出1,085个AI编程助手。尽管它们仅占所有扩展的1.64%,但我们观察到此类助手数量激增:其中超过90%是在过去两年内发布的。随后,我们从32个安装量和评论数充足的AI编程助手中抽样选取用户评论进行人工分析,构建了关于这些助手的用户关注点与反馈的全面分类体系。我们手动标注了每条评论在提及编程助手特定方面时所持的态度,从而获得了关于用户对具体功能、关注点及整体工具性能的满意与不满意情况的细致洞察。基于研究发现——包括用户不仅需要智能建议,还要求情境感知、可定制且资源高效的交互——我们提出了五项实践启示与建议,以指导改进AI编程助手,从而更好地满足用户需求。