量化无环连接依赖的损失 (Quantifying the Loss of Acyclic Join Dependencies) - 专知论文

会员服务 ·

0

冗余 · 损失 · 关联 · KL散度 · 下界 ·

2023 年 4 月 10 日

Quantifying the Loss of Acyclic Join Dependencies

翻译：量化无环连接依赖的损失

Batya Kenig,Nir Weinberger

from arxiv, To appear in PODS 2023

Acyclic schemes posses known benefits for database design, speeding up queries, and reducing space requirements. An acyclic join dependency (AJD) is lossless with respect to a universal relation if joining the projections associated with the schema results in the original universal relation. An intuitive and standard measure of loss entailed by an AJD is the number of redundant tuples generated by the acyclic join. Recent work has shown that the loss of an AJD can also be characterized by an information-theoretic measure. Motivated by the problem of automatically fitting an acyclic schema to a universal relation, we investigate the connection between these two characterizations of loss. We first show that the loss of an AJD is captured using the notion of KL-Divergence. We then show that the KL-divergence can be used to bound the number of redundant tuples. We prove a deterministic lower bound on the percentage of redundant tuples. For an upper bound, we propose a random database model, and establish a high probability bound on the percentage of redundant tuples, which coincides with the lower bound for large databases.

翻译：无环模式对于数据库设计、查询速度的提升和空间要求的减少有着已知的好处。如果关联模式对于通用关系是无损的，则关联谓词是无环的。直观和标准的损失测量方法是所生成的冗余元组的数量。最近的研究表明，关联谓词的损失也可以通过信息熵测量来表征。在自动适应通用关系的无环模式的问题上，我们研究了这两种损失表征之间的关系。我们首先证明了关联谓词的损失可以使用KL散度来刻画。然后我们证明KL散度可以用来限制冗余元组的数量。我们给出了冗余元组的确定性下界。对于上界，我们提出了一个随机数据库模型，并对冗余元组的百分比建立了高概率上界。对于大型数据库，这个上界与下界相一致。

0

相关内容

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【CVPR2020-牛津大学】具有自适应邻域一致性的通信网络，Correspondence Networks with Adaptive Neighbourhood Consensus

【CVPR2020-牛津大学】具有自适应邻域一致性的通信网络，Correspondence Networks with Adaptive Neighbourhood Consensus

专知会员服务

16+阅读 · 2020年3月27日

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

专知会员服务

15+阅读 · 2019年12月17日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

一文理解Ranking Loss/Margin Loss/Triplet Loss

一文理解Ranking Loss/Margin Loss/Triplet Loss

极市平台

16+阅读 · 2020年8月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

某些非线性波方程的解性态的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cycloartane型三萜抗肝损伤构效关系和作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为的SoS体系结构评价研究

国家自然科学基金

1+阅读 · 2012年12月31日

功能梯度材料对称结构的静动力学问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有良好NLO性的含[TpMS3]配体簇合物的设计及组装

国家自然科学基金

0+阅读 · 2009年12月31日

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Arxiv

0+阅读 · 2023年5月25日

SmartTrim: Adaptive Tokens and Parameters Pruning for Efficient Vision-Language Models

Arxiv

0+阅读 · 2023年5月24日

Interpretation and visualization of distance covariance through additive decomposition of correlations formula

Arxiv

0+阅读 · 2023年5月24日

New Bounds on the Size of Binary Codes with Large Minimum Distance

Arxiv

0+阅读 · 2023年5月23日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

VIP会员

文章信息

相关主题

相关VIP内容

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

【SIGIR2020】一个统一的双视图模型，用于具有不一致性损失的评论总结和情绪分类，A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss

专知会员服务

22+阅读 · 2020年6月3日

【CVPR2020-牛津大学】具有自适应邻域一致性的通信网络，Correspondence Networks with Adaptive Neighbourhood Consensus

【CVPR2020-牛津大学】具有自适应邻域一致性的通信网络，Correspondence Networks with Adaptive Neighbourhood Consensus

专知会员服务

16+阅读 · 2020年3月27日

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

【NeurlPS2019论文总结】一致收敛可能无法解释深度学习中的泛化现象，Uniform convergence may be unable to explain generalization in deep learning

专知会员服务

15+阅读 · 2019年12月17日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

生成式人工智能导论：可靠性、负责任开发及实际应用（第二版）

《2025财年美陆军转型倡议（ATI）部队结构与组织提案》

【CMU博士论文】分布偏移下的可信机器学习

智能体 EDA 的曙光：自主数字芯片设计综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

一文理解Ranking Loss/Margin Loss/Triplet Loss

一文理解Ranking Loss/Margin Loss/Triplet Loss

极市平台

16+阅读 · 2020年8月10日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion

Arxiv

0+阅读 · 2023年5月25日

SmartTrim: Adaptive Tokens and Parameters Pruning for Efficient Vision-Language Models

Arxiv

0+阅读 · 2023年5月24日

Interpretation and visualization of distance covariance through additive decomposition of correlations formula

Arxiv

0+阅读 · 2023年5月24日

New Bounds on the Size of Binary Codes with Large Minimum Distance

Arxiv

0+阅读 · 2023年5月23日

Artificial Intelligence for the Metaverse: A Survey

Arxiv

31+阅读 · 2022年2月15日

相关基金

某些非线性波方程的解性态的研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cycloartane型三萜抗肝损伤构效关系和作用机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于行为的SoS体系结构评价研究

国家自然科学基金

1+阅读 · 2012年12月31日

功能梯度材料对称结构的静动力学问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

具有良好NLO性的含[TpMS3]配体簇合物的设计及组装

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员