In this work, we study how to efficiently obtain perfect samples from a discrete distribution $\mathcal{D}$ given access only to pairwise comparisons of elements of its support. Specifically, we assume access to samples $(x, S)$, where $S$ is drawn from a distribution over sets $\mathcal{Q}$ (indicating the elements being compared), and $x$ is drawn from the conditional distribution $\mathcal{D}_S$ (indicating the winner of the comparison) and aim to output a clean sample $y$ distributed according to $\mathcal{D}$. We mainly focus on the case of pairwise comparisons where all sets $S$ have size 2. We design a Markov chain whose stationary distribution coincides with $\mathcal{D}$ and give an algorithm to obtain exact samples using the technique of Coupling from the Past. However, the sample complexity of this algorithm depends on the structure of the distribution $\mathcal{D}$ and can be even exponential in the support of $\mathcal{D}$ in many natural scenarios. Our main contribution is to provide an efficient exact sampling algorithm whose complexity does not depend on the structure of $\mathcal{D}$. To this end, we give a parametric Markov chain that mixes significantly faster given a good approximation to the stationary distribution. We can obtain such an approximation using an efficient learning from pairwise comparisons algorithm (Shah et al., JMLR 17, 2016). Our technique for speeding up sampling from a Markov chain whose stationary distribution is approximately known is simple, general and possibly of independent interest.
翻译:在这项工作中,我们研究如何高效率地从离散分配中获取完美样本 $\ mathcal{D} 美元,而获得的只是其支持要素的配对比较。具体地说,我们假设能够获取样本 $(x,S) 美元,其中美元是从每组的配送中抽取的 $(mathcal) 美元(表明所比较的因素) 美元,而美元是从有条件分配中抽取的 $(mathcal{D} 美元)(表明比较的胜者), 目的是输出一个更清洁的样本 $y 的比值 。 我们主要关注的是所有设置为 $(x,S) 美元规模的配对比 。 我们设计了一个马克夫 链, 它的固定分配与 $(macalcal cal) 相匹配 。 然而, 这个算法的精度取决于所分配的结构 $\ mathcal cal {D}, 并且甚至可以在支持 $\ cal calal levelylearal 美元 中 。