The number of non-negative integer matrices with given row and column sums appears in a variety of problems in mathematics and statistics but no closed-form expression for it is known, so we rely on approximations of various kinds. Here we describe a new such approximation, motivated by consideration of the statistics of matrices with non-integer numbers of columns. This estimate can be evaluated in time linear in the size of the matrix and returns results of accuracy as good as or better than existing linear-time approximations across a wide range of settings. We also use this new estimate as the starting point for an improved numerical method for either counting or sampling matrices using sequential importance sampling. Code implementing our methods is provided.
翻译:带有给定行和列总和的非负整数矩阵的数量出现在数学和统计的多种问题中,但没有确切的表示方式,因此我们依赖各种近似值。这里我们描述一种新的近似值,其动机是考虑带有非整数列的矩阵统计数据。这一估计值可以按时间线性评价矩阵大小,并返回准确性结果,好于或优于现有线性近似值。我们还将这一新的估计值作为利用顺序重要性抽样进行计算或抽样矩阵的改进数字方法的起点。提供了执行我们方法的代码。