Despite the numerous successes of machine learning over the past decade (image recognition, decision-making, NLP, image synthesis), self-driving technology has not yet followed the same trend. In this paper, we study the history, composition, and development bottlenecks of the modern self-driving stack. We argue that the slow progress is caused by approaches that require too much hand-engineering, an over-reliance on road testing, and high fleet deployment costs. We observe that the classical stack has several bottlenecks that preclude the necessary scale needed to capture the long tail of rare events. To resolve these problems, we outline the principles of Autonomy 2.0, an ML-first approach to self-driving, as a viable alternative to the currently adopted state-of-the-art. This approach is based on (i) a fully differentiable AV stack trainable from human demonstrations, (ii) closed-loop data-driven reactive simulation, and (iii) large-scale, low-cost data collections as critical solutions towards scalability issues. We outline the general architecture, survey promising works in this direction and propose key challenges to be addressed by the community in the future.
翻译:尽管在过去十年中机械学习取得了许多成功(图像识别、决策、NLP、图像合成),自驾技术尚未跟上同样的趋势。我们研究了现代自驾堆的历史、构成和发展瓶颈。我们争辩说,进展缓慢是由于一些方法,这些方法需要太多手动工程、过分依赖道路测试和车队部署费用高昂。我们注意到,古典堆叠有一些瓶颈,无法达到捕捉稀有事件长期尾声所需的必要规模。为了解决这些问题,我们概述了自动驱动2.0的原则,这是ML第一种自行驱动方法,是目前采用的最新技术的可行替代方法。这一方法的基础是:(一) 完全不同于人类演示的AV堆;(二) 闭路数据驱动的被动模拟,以及(三) 大规模低成本数据收集,作为适应可扩展性问题的关键解决办法。我们概述了总体架构,调查有朝此方向发展的前景,并提出未来社区要解决的关键挑战。