As deep neural networks (DNNs) prove their importance and feasibility, more and more DNN-based apps, such as detection and classification of objects, have been developed and deployed on autonomous vehicles (AVs). To meet their growing expectations and requirements, AVs should "optimize" use of their limited onboard computing resources for multiple concurrent in-vehicle apps while satisfying their timing requirements (especially for safety). That is, real-time AV apps should share the limited on-board resources with other concurrent apps without missing their deadlines dictated by the frame rate of a camera that generates and provides input images to the apps. However, most, if not all, of existing DNN solutions focus on enhancing the concurrency of their specific hardware without dynamically optimizing/modifying the DNN apps' resource requirements, subject to the number of running apps, owing to their high computational cost. To mitigate this limitation, we propose DynaMIX (Dynamic MIXed-precision model construction), which optimizes the resource requirement of concurrent apps and aims to maximize execution accuracy. To realize a real-time resource optimization, we formulate an optimization problem using app performance profiles to consider both the accuracy and worst-case latency of each app. We also propose dynamic model reconfiguration by lazy loading only the selected layers at runtime to reduce the overhead of loading the entire model. DynaMIX is evaluated in terms of constraint satisfaction and inference accuracy for a multi-tasking system and compared against state-of-the-art solutions, demonstrating its effectiveness and feasibility under various environmental/operating conditions.
翻译:由于深层神经网络(DNN)证明了其重要性和可行性,越来越多的DNN软件,例如物体的探测和分类,已经开发并安装在自主车辆(AV)上。为满足其日益增长的期望和要求,AV应“优化”其有限的机载计算资源,用于多个同时的车辆应用程序,同时满足其时间要求(特别是安全方面),即实时AV应用程序应与其他同时应用程序共享有限的机载资源,同时不错过其最后期限,同时不错过其他同时应用程序,因为相机的定位率为应用程序生成和提供输入图像。然而,如果不是全部的话,大多数现有的DNNN解决方案侧重于加强具体硬件的组合,而不以动态优化/调整DNN应用程序的资源需求,但取决于运行应用程序的数量,因为其计算成本高。为减轻这一限制,我们建议DynamMIX系统(D制式模型-全精度模型)与其他应用程序共享,以展示其资源需求,并力求最大限度地实现执行的准确性。为了在最坏的机载成本水平上实现最精确性,我们还考虑在最差的机型系统上下对机载系统进行最精确性调整,在最精确的机尾的机尾的优化中进行优化,我们仅考虑在最精确的机尾的机尾的机尾的机尾部优化地评估。