This article presents a novel, general, and effective simulation-inspired approach, called {\it repro samples method}, to conduct statistical inference. The approach studies the performance of artificial samples, referred to as {\it repro samples}, obtained by mimicking the true observed sample to achieve uncertainty quantification and construct confidence sets for parameters of interest with guaranteed coverage rates. Both exact and asymptotic inferences are developed. An attractive feature of the general framework developed is that it does not rely on the large sample central limit theorem and is likelihood-free. As such, it is thus effective for complicated inference problems which we can not solve using the large sample central limit theorem. The proposed method is applicable to a wide range of problems, including many open questions where solutions were previously unavailable, for example, those involving discrete or non-numerical parameters. To reduce the large computational cost of such inference problems, we develop a unique matching scheme to obtain a data-driven candidate set. Moreover, we show the advantages of the proposed framework over the classical Neyman-Pearson framework. We demonstrate the effectiveness of the proposed approach on various models throughout the paper and provide a case study that addresses an open inference question on how to quantify the uncertainty for the unknown number of components in a normal mixture model. To evaluate the empirical performance of our repro samples method, we conduct simulations and study real data examples with comparisons to existing approaches. Although the development pertains to the settings where the large sample central limit theorem does not apply, it also has direct extensions to the cases where the central limit theorem does hold.
翻译:文章提出了一个创新、一般和有效的模拟激励方法,称为 prit repro explex 方法}, 以进行统计推断。 这种方法研究人工样本的性能,称为 prit repro exampers}, 模拟真实观察到的样本,以实现不确定性的量化, 并为利益参数建立信任套件, 保证覆盖率。 开发了精确和无孔不入的推论。 开发的总框架的一个有吸引力的特征是, 它不依赖大型样本中央限制主点, 并且没有可能性。 因此, 这种方法对复杂的推断问题有效, 我们无法用大型样本中心限制主点来解决这个问题。 提议的方法适用于一系列广泛的问题, 包括许多以前没有解决办法的开放问题, 例如那些涉及离散或非数字参数的问题。 为了降低这类推断问题的巨大计算成本, 我们开发了一个独特的匹配方案, 以获得数据驱动的候选数据集。 此外, 我们展示了拟议框架对古典 Neyman- Pearson 框架的优势, 我们无法用大样本中大中心点限制 。 我们展示了常规样本的精确度分析方法的有效性, 在各种模型中, 我们用一个未知的模型的模型中, 我们用一个未知的模型的模型的模型的模型的模拟分析分析方法的模型中, 提供了一个未知的模型的模型的模型的精确度分析方法的精确性分析。