We introduce methods to bound the mean of a discrete distribution (or finite population) based on sample data, for random variables with a known set of possible values. In particular, the methods can be applied to categorical data with known category-based values. For small sample sizes, we show how to leverage the knowledge of the set of possible values to compute bounds that are stronger than for general random variables such as standard concentration inequalities.
翻译:我们引入了方法来约束基于抽样数据的离散分布(或有限人口)的平均值,包括已知一组可能值的随机变量。 特别是,这些方法可以适用于已知类别值的绝对数据。 对于小样本大小,我们展示了如何利用一组可能值的知识来计算比标准浓度不平等等一般随机变量更强的界限。