The remarkable progress in computer vision over the last few years is, by and large, attributed to deep learning, fueled by the availability of huge sets of labeled data, and paired with the explosive growth of the GPU paradigm. While subscribing to this view, this book criticizes the supposed scientific progress in the field and proposes the investigation of vision within the framework of information-based laws of nature. Specifically, the present work poses fundamental questions about vision that remain far from understood, leading the reader on a journey populated by novel challenges resonating with the foundations of machine learning. The central thesis is that for a deeper understanding of visual computational processes, it is necessary to look beyond the applications of general purpose machine learning algorithms and focus instead on appropriate learning theories that take into account the spatiotemporal nature of the visual signal.
翻译:过去几年中,计算机愿景的显著进步,基本上归功于深层次的学习,由大量贴标签的数据的提供所推动,与GPU范式的爆炸性增长相配合。这本书在支持这一观点的同时,批评了该领域的假定科学进步,并提议在以信息为基础的自然法则框架内对愿景进行调查。具体地说,目前的工作提出了关于愿景的基本问题,但远未被理解,使读者走上一条充满与机器学习基础相一致的新挑战的旅程。核心论点是,为了更深入地了解视觉计算过程,有必要超越一般目的机器学习算法的应用,而侧重于考虑到视觉信号的波长性质的适当学习理论。