The adoption of "human-in-the-loop" paradigms in computer vision and machine learning is leading to various applications where the actual data acquisition (e.g., human supervision) and the underlying inference algorithms are closely interwined. While classical work in active learning provides effective solutions when the learning module involves classification and regression tasks, many practical issues such as partially observed measurements, financial constraints and even additional distributional or structural aspects of the data typically fall outside the scope of this treatment. For instance, with sequential acquisition of partial measurements of data that manifest as a matrix (or tensor), novel strategies for completion (or collaborative filtering) of the remaining entries have only been studied recently. Motivated by vision problems where we seek to annotate a large dataset of images via a crowdsourced platform or alternatively, complement results from a state-of-the-art object detector using human feedback, we study the "completion" problem defined on graphs, where requests for additional measurements must be made sequentially. We design the optimization model in the Fourier domain of the graph describing how ideas based on adaptive submodularity provide algorithms that work well in practice. On a large set of images collected from Imgur, we see promising results on images that are otherwise difficult to categorize. We also show applications to an experimental design problem in neuroimaging.
翻译:在计算机视觉和机器学习中采用“人到人到行”的范式通常不属于这种处理的范围。例如,随着在计算机视觉和机器学习中采用“人到行”范式,实际数据获取(例如,人到监督)和基本推算算法密切交织在一起的各种应用正在导致实际数据获取(例如,人到监督)和基本推算算法之间的相互交织。虽然在积极学习方面的典型工作提供了有效的解决办法,但当学习模块涉及分类和回归任务时,许多实际问题,例如部分观测到的测量、财政限制,甚至数据的额外分布或结构方面通常不属于这种处理的范围。例如,随着对显示为矩阵(或阵列)的数据的局部测量的顺序获取,最近才对其余条目的完成(或协作过滤)新颖战略进行了研究。由于我们试图通过众到源平台来说明大量图像数据集的图像数据集,我们从适应性子进化的图像中提供了新的算法。我们从一个大系列的实验性图像到一个我们所收集到一个有希望的图像的模型。