Recent advances in unsupervised learning have highlighted the possibility of learning to reconstruct signals from noisy and incomplete linear measurements alone. These methods play a key role in medical and scientific imaging and sensing, where ground truth data is often scarce or difficult to obtain. However, in practice, measurements are not only noisy and incomplete but also quantized. Here we explore the extreme case of learning from binary observations and provide necessary and sufficient conditions on the number of measurements required for identifying a set of signals from incomplete binary data. Our results are complementary to existing bounds on signal recovery from binary measurements. Furthermore, we introduce a novel self-supervised learning approach, which we name SSBM, that only requires binary data for training. We demonstrate in a series of experiments with real datasets that SSBM performs on par with supervised learning and outperforms sparse reconstruction methods with a fixed wavelet basis by a large margin.
翻译:在未受监督的近期学习进展中,突出地显示了学习从噪音和不完整线性测量中重建信号的可能性,这些方法在医疗和科学成像和遥感中发挥着关键作用,因为地面真实数据往往很少或难以获得,但在实践中,测量不仅噪音和不完整,而且量化。在这里,我们探讨了从二进制观测中学习的极端案例,并就从不完整的二进制数据中确定一组信号所需的测量数量提供了必要和充分的条件。我们的结果与从二进制测量中恢复信号的现有界限是相辅相成的。此外,我们采用了一种新型的自我监督学习方法,我们称之为SSBM,这只需要二进制数据来进行培训。我们在一系列实验中用真实的数据集展示了SSBM在受监督的学习中演化,在大边缘固定波盘的基础上超越稀有的重建方法。</s>