While uniform sampling has been widely studied in the matrix completion literature, CUR sampling approximates a low-rank matrix via row and column samples. Unfortunately, both sampling models lack flexibility for various circumstances in real-world applications. In this work, we propose a novel and easy-to-implement sampling strategy, coined Cross-Concentrated Sampling (CCS). By bridging uniform sampling and CUR sampling, CCS provides extra flexibility that can potentially save sampling costs in applications. In addition, we also provide a sufficient condition for CCS-based matrix completion. Moreover, we propose a highly efficient non-convex algorithm, termed Iterative CUR Completion (ICURC), for the proposed CCS model. Numerical experiments verify the empirical advantages of CCS and ICURC against uniform sampling and its baseline algorithms, on both synthetic and real-world datasets.
翻译:虽然在矩阵完成文献中对统一抽样进行了广泛研究,但CUR抽样通过行和柱抽样大致接近低级矩阵,不幸的是,两种抽样模型都缺乏适应现实世界应用中各种情况的灵活性,在这项工作中,我们提出了一个创新的、易于执行的抽样战略,催生了交叉集中抽样(CCS),通过连接统一抽样和CUR抽样,CCS提供了额外的灵活性,有可能节省应用中的抽样成本。此外,我们还为CCS基矩阵的完成提供了充分的条件。此外,我们提议对拟议的CCS模型采用一种高效的非凝解算法,称为循环CUR完成法(ICUR)。数字实验核查CCS和ICURC在合成和现实世界数据集方面对统一取样及其基线算法的经验优势。