We consider the statistical inference for noisy incomplete binary (or 1-bit) matrix. Despite the importance of uncertainty quantification to matrix completion, most of the categorical matrix completion literature focuses on point estimation and prediction. This paper moves one step further toward the statistical inference for binary matrix completion. Under a popular nonlinear factor analysis model, we obtain a point estimator and derive its asymptotic normality. Moreover, our analysis adopts a flexible missing-entry design that does not require a random sampling scheme as required by most of the existing asymptotic results for matrix completion. Under reasonable conditions, the proposed estimator is statistically efficient and optimal in the sense that the Cramer-Rao lower bound is achieved asymptotically for the model parameters. Two applications are considered, including (1) linking two forms of an educational test and (2) linking the roll call voting records from multiple years in the United States Senate. The first application enables the comparison between examinees who took different test forms, and the second application allows us to compare the liberal-conservativeness of senators who did not serve in the Senate at the same time.
翻译:我们考虑对噪音不全的二元(或一比位)矩阵的统计推断。尽管不确定性量化对矩阵完成十分重要,但大多数绝对矩阵完成文献都侧重于点估测和预测。本文件进一步向二元矩阵完成的统计推断迈进了一步。在流行的非线性要素分析模型下,我们获得了一个点估量器,并得出其无症状的正常性。此外,我们的分析采用了灵活的缺失输入设计,该设计不要求按照大多数现有无症状结果的随机抽样方法来完成矩阵完成。在合理条件下,拟议的估算器在统计上是有效和最佳的,因为在模型参数方面,低频线-射线下线的受限在统计推算中是轻度完成的。考虑了两种应用,包括:(1) 将两种教育测试形式联系起来,(2) 将美国参议院多年的滚动投票记录联系起来。第一个应用使采用不同测试表的受访者能够进行比较,而第二个应用使我们能够比较在参议院没有同时任职的参议员的自由保守性。