Learning general-purpose representations from multisensor data produced by the omnipresent sensing systems (or IoT in general) has numerous applications in diverse use cases. Existing purely supervised end-to-end deep learning techniques depend on the availability of a massive amount of well-curated data, acquiring which is notoriously difficult but required to achieve a sufficient level of generalization on a task of interest. In this work, we leverage the self-supervised learning paradigm towards realizing the vision of continual learning from unlabeled inputs. We present a generalized framework named Sense and Learn for representation or feature learning from raw sensory data. It consists of several auxiliary tasks that can learn high-level and broadly useful features entirely from unannotated data without any human involvement in the tedious labeling process. We demonstrate the efficacy of our approach on several publicly available datasets from different domains and in various settings, including linear separability, semi-supervised or few shot learning, and transfer learning. Our methodology achieves results that are competitive with the supervised approaches and close the gap through fine-tuning a network while learning the downstream tasks in most cases. In particular, we show that the self-supervised network can be utilized as initialization to significantly boost the performance in a low-data regime with as few as 5 labeled instances per class, which is of high practical importance to real-world problems. Likewise, the learned representations with self-supervision are found to be highly transferable between related datasets, even when few labeled instances are available from the target domains. The self-learning nature of our methodology opens up exciting possibilities for on-device continual learning.
翻译:从全景感应系统(或一般的IoT)产生的多传感器数据中学习一般目的的表示方式在多种使用案例中有许多应用。现有的纯受监督的端到端深学习技术取决于能否获得大量精密的数据,而获得这种数据是臭名昭著的困难,但为了在感兴趣的任务中实现足够程度的概括化而需要获得这些数据。在这项工作中,我们利用自我监督的学习模式,以实现从无标签投入中不断学习的愿景。我们提出了一个名为Sense and Learning的通用框架,用于从原始感应数据中学习代表或特征学习。它包含一些辅助性任务,这些辅助任务完全可以从无注释的数据中学习高层次和广泛有用的特征,而没有人类参与任何繁琐的标签过程。我们展示了我们在不同领域和不同环境中公开提供的若干数据集,包括线性可辨性、半超强或少短镜头的学习,以及转移学习。我们的方法取得了与监督性学习方法相竞争的结果,通过精细的网络调整而缩小差距,同时在多数情况下学习下游任务中可以从无注释性的数据中学习,我们作为高级可追溯性数据的高级数据,我们利用的高级数据,在高等级中学习的轨道上进行自我分析。在高层次上进行自我分析。我们所利用的轨道进行自我分析,在高等级研究,在高层次上进行自我研究,在高层次上进行自我研究。在低层次研究。在高层次研究。作为利用的轨道上进行自我研究。在高层次研究,作为利用的轨道上,在高层次研究。