Deep Neural Networks (DNNs) have achieved great success in a massive number of artificial intelligence (AI) applications by delivering high-quality computer vision, natural language processing, and virtual reality applications. However, these emerging AI applications also come with increasing computation and memory demands, which are challenging to handle especially for the embedded systems where limited computation/memory resources, tight power budgets, and small form factors are demanded. Challenges also come from the diverse application-specific requirements, including real-time responses, high-throughput performance, and reliable inference accuracy. To address these challenges, we will introduce a series of effective design methods in this book chapter to enable efficient algorithms, compilers, and various optimizations for embedded systems.
翻译:深神经网络(DNNs)通过提供高质量的计算机视觉、自然语言处理和虚拟现实应用,在大量人工智能应用方面取得了巨大成功,然而,这些新兴的人工智能应用也伴随着越来越多的计算和记忆需求,这些需求尤其对内部系统具有挑战性,因为内部系统需要有限的计算/模拟资源、紧凑的电力预算和小形式因素。挑战还来自各种具体应用要求,包括实时反应、高通量性能和可靠的推论准确性。为了应对这些挑战,我们将在本书章中引入一系列有效的设计方法,使内部系统能够高效的算法、编集器和各种优化。