Statistical independence and conditional independence are two fundamental concepts in statistics and machine learning. Copula Entropy is a mathematical concept defined by Ma and Sun for multivariate statistical independence measuring and testing, and also proved to be closely related to conditional independence (or transfer entropy). As the unified framework for measuring both independence and causality, CE has been applied to solve several related statistical or machine learning problems, including association discovery, structure learning, variable selection, and causal discovery. The nonparametric methods for estimating copula entropy and transfer entropy were also proposed previously. This paper introduces copent, the R package which implements these proposed methods for estimating copula entropy and transfer entropy. The implementation detail of the package is introduced. Three examples with simulated data and real-world data on variable selection and causal discovery are also presented to demonstrate the usage of this package. The examples on variable selection and causal discovery show the strong ability of copent on testing (conditional) independence compared with the related packages. The copent package is available on the Comprehensive R Archive Network (CRAN) and also on GitHub at https://github.com/majianthu/copent.
翻译:统计独立和有条件独立是统计和机器学习的两个基本概念。Copula Entropy是一个数学概念,由Ma和Sun为多变量统计独立度和测试而定义,也证明与有条件独立(或转移酶)密切相关。作为衡量独立性和因果关系的统一框架,CE已经用于解决若干相关的统计或机器学习问题,包括协会发现、结构学习、变量选择和因果发现。以前也曾提出过估算相交录和传输酶的非参数性方法。本文介绍了用于估算相交录的R软件,即采用这些估算相交录和传输酶的拟议方法的R软件包。介绍了该软件包的实施细节。还介绍了三个模拟数据和真实世界变量选择和因果发现数据的例子,以证明该软件包的使用情况。关于可变选择和因果发现的例子表明,与相关软件包相比,共同测试(有条件)独立度的能力很强。CR档案综合网络(CRAAN)和GitHububub(https://github.com/majiant/copent/copent) 。