This paper presents an AI-driven IoT robotic teleoperation system designed for real-time remote manipulation and intelligent visual monitoring, tailored for smart city applications. The architecture integrates a Flutter-based cross-platform mobile interface with MQTT-based control signaling and WebRTC video streaming via the LiveKit framework. A YOLOv11-nano model is deployed for lightweight object detection, enabling real-time perception with annotated visual overlays delivered to the user interface. Control commands are transmitted via MQTT to an ESP8266-based actuator node, which coordinates multi-axis robotic arm motion through an Arduino Mega2560 controller. The backend infrastructure is hosted on DigitalOcean, ensuring scalable cloud orchestration and stable global communication. Latency evaluations conducted under both local and international VPN scenarios (including Hong Kong, Japan, and Belgium) demonstrate actuator response times as low as 0.2 seconds and total video latency under 1.2 seconds, even across high-latency networks. This low-latency dual-protocol design ensures responsive closed-loop interaction and robust performance in distributed environments. Unlike conventional teleoperation platforms, the proposed system emphasizes modular deployment, real-time AI sensing, and adaptable communication strategies, making it well-suited for smart city scenarios such as remote infrastructure inspection, public equipment servicing, and urban automation. Future enhancements will focus on edge-device deployment, adaptive routing, and integration with city-scale IoT networks to enhance resilience and scalability.
翻译:本文提出一种专为智慧城市应用设计的AI驱动的物联网机器人遥操作系统,用于实现实时远程操控与智能视觉监控。该架构将基于Flutter的跨平台移动界面与基于MQTT的控制信令、通过LiveKit框架实现的WebRTC视频流传输相结合。系统部署了YOLOv11-nano模型进行轻量化目标检测,实现带标注视觉叠加层的实时感知并推送至用户界面。控制指令通过MQTT传输至基于ESP8266的执行器节点,由Arduino Mega2560控制器协调多轴机械臂运动。后端基础设施托管于DigitalOcean平台,确保可扩展的云编排与稳定的全球通信。在本地及国际VPN场景(涵盖香港、日本和比利时)下的延迟评估表明,执行器响应时间最低可达0.2秒,视频总延迟低于1.2秒,即使在高延迟网络中仍保持稳定。这种低延迟双协议设计保障了分布式环境下响应灵敏的闭环交互与鲁棒性能。与传统遥操作平台相比,本系统强调模块化部署、实时AI感知及自适应通信策略,使其特别适用于远程基础设施巡检、公共设备维护及城市自动化等智慧城市场景。未来改进将聚焦于边缘设备部署、自适应路由及与城市级物联网网络的集成,以提升系统韧性与可扩展性。