The behaviour of statistical relational representations across differently sized domains has become a focal area of research from both a modelling and a complexity viewpoint. Recently, projectivity of a family of distributions emerged as a key property, ensuring that marginal inference is independent of the domain size and that under certain additional requirements parameter estimation from random samples is statistically consistent. However, the currently used formalisation assumes that the domain is characterised only by its size. This contribution extends the notion of projectivity from families of distributions indexed by domain size to functors taking extensional data from a database. This makes projectivity available for the large range of applications taking structured input. We transfer the known results on projective families of distributions to the new setting. This includes statistical consistency of learning, a characterisation of projective fragments in different statistical relational formalisms as well as a general representation theorem for projective families of distributions. Furthermore, we prove a correspondence between projectivity and distributions on countably infinite domains, which we use to unify and generalise earlier work on statistical relational representations in infinite domains. Finally, we use the extended notion of projectivity to define a further strengthening, which we call $\sigma$-projectivity, and which allows the use of the same representation in different modes while retaining projectivity.
翻译:从建模和复杂的角度来看,不同规模领域的统计关系代表行为已成为研究的重点领域。最近,分布式大家庭的预测性成为关键属性,确保边际推论独立于领域大小,根据某些额外要求,随机抽样的参数估计在统计上是一致的。然而,目前使用的正规化假设域的特征仅以其大小为特征。这一贡献扩大了按域大小指数的分布式家庭与从数据库中取用扩展数据的真菌体的分布式家庭之间的投影性概念。这为大量应用结构化投入提供了投影性。我们把分布式分布式预测式大家庭的已知结果转移到新的环境。这包括学习的统计一致性、不同统计关系形式主义中投影性碎片的特征化以及分布式分布式分布式组合的一般代表性。此外,我们证明在可计量的有限范围内的预测性和分布性之间是对应的,我们用来统一和概括早先关于统计关系表象的工作。最后,我们使用扩大的投影率概念来界定不同统计形式,同时使用不同的比例性项目,我们称之为保留的模型。