翻译标题：保证连接查询结果的统一抽样和结果大小估计的Õ(AGM/OUT)运行时间翻译摘要：我们提出了一种新方法，用于估计大型数据库D中小型连接查询Q的答案数 OUT，以及连接查询的统一抽样。我们的方法是第一个满足以下所有语句的方法。-支持任意Q，它可以是非循环或循环的，包含二进制和非二进制关系。-保证任意小的错误，并始终以\~O(AGM/OUT)的时间成本高概率成立，其中AGM是 OUT 的 AGM 上限（OUT的上限），而\~O隐藏了输入大小的对数因子。我们还在一个统一的框架中解释了以前的连接大小估计器。所有方法都依赖于D中关系上的某些索引，这些索引在线性时间内构建。此外，我们使用广义超树分解（GHD）扩展了我们的方法，以在OUT较小时获得比Õ(AGM/OUT)更低的复杂度，并介绍了用于提高估计效率和准确性的优化技术。 (Guaranteeing the Õ(AGM/OUT) Runtime for Uniform Sampling and OUT Size Estimation over Joins)

翻译：翻译标题：保证连接查询结果的统一抽样和结果大小估计的Õ(AGM/OUT)运行时间翻译摘要：我们提出了一种新方法，用于估计大型数据库D中小型连接查询Q的答案数 OUT，以及连接查询的统一抽样。我们的方法是第一个满足以下所有语句的方法。-支持任意Q，它可以是非循环或循环的，包含二进制和非二进制关系。-保证任意小的错误，并始终以\~O(AGM/OUT)的时间成本高概率成立，其中AGM是 OUT 的 AGM 上限（OUT的上限），而\~O隐藏了输入大小的对数因子。我们还在一个统一的框架中解释了以前的连接大小估计器。所有方法都依赖于D中关系上的某些索引，这些索引在线性时间内构建。此外，我们使用广义超树分解（GHD）扩展了我们的方法，以在OUT较小时获得比Õ(AGM/OUT)更低的复杂度，并介绍了用于提高估计效率和准确性的优化技术。

Kyoungmin Kim,Jaehyun Ha,George Fletcher,Wook-Shin Han

from arxiv, 19 pages

We propose a new method for estimating the number of answers OUT of a small join query Q in a large database D, and for uniform sampling over joins. Our method is the first to satisfy all the following statements. - Support arbitrary Q, which can be either acyclic or cyclic, and contain binary and non-binary relations. - Guarantee an arbitrary small error with a high probability always in \~O(AGM/OUT) time, where AGM is the AGM bound OUT (an upper bound of OUT), and \~O hides the polylogarithmic factor of input size. We also explain previous join size estimators in a unified framework. All methods including ours rely on certain indexes on relations in D, which take linear time to build offline. Additionally, we extend our method using generalized hypertree decompositions (GHDs) to achieve a lower complexity than \~O(AGM/OUT) when OUT is small, and present optimization techniques for improving estimation efficiency and accuracy.

翻译：