In digital cameras, we find a major limitation: the image and video form inherited from a film camera obstructs it from capturing the rapidly changing photonic world. Here, we present vidar, a bit sequence array where each bit represents whether the accumulation of photons has reached a threshold, to record and reconstruct the scene radiance at any moment. By employing only consumer-level CMOS sensors and integrated circuits, we have developed a vidar camera that is 1,000x faster than conventional cameras. By treating vidar as spike trains in biological vision, we have further developed a spiking neural network-based machine vision system that combines the speed of the machine and the mechanism of biological vision, achieving high-speed object detection and tracking 1,000x faster than human vision. We demonstrate the utility of the vidar camera and the super vision system in an assistant referee and target pointing system. Our study is expected to fundamentally revolutionize the image and video concepts and related industries, including photography, movies, and visual media, and to unseal a new spiking neural network-enabled speed-free machine vision era.
翻译:在数字相机中,我们发现一个主要的限制:从电影相机中继承的图像和视频形式阻碍了它捕捉迅速变化的光学世界。在这里,我们展示了维达尔,一个位数的顺序阵列,每个位数代表光子的累积是否达到临界值,以便随时记录并重建场景的亮度。我们仅使用消费者一级的 CMOS 传感器和集成电路,就开发了一个维达尔摄像头的摄像头,比常规摄像头快1 000x。我们把维达尔作为生物视觉中的峰值列列,进一步开发了一个基于神经网络的机器视觉系统,将机器的速度和生物视觉机制结合起来,实现高速天体探测并跟踪比人类视觉更快的1000x。我们展示了维达尔摄像头和超级视觉系统的实用性,一个助理裁判和瞄准系统。我们的研究有望从根本上改革图像和视频概念及相关产业,包括摄影、电影和视觉媒体。我们的研究还进一步开发了一个新的神经网络快速无线的机器视觉时代。