The direpack package aims to establish a set of modern statistical dimension reduction techniques into the Python universe as a single, consistent package. The dimension reduction methods included resort into three categories: projection pursuit based dimension reduction, sufficient dimension reduction, and robust M estimators for dimension reduction. As a corollary, regularized regression estimators based on these reduced dimension spaces are provided as well, ranging from classical principal component regression up to sparse partial robust M regression. The package also contains a set of classical and robust pre-processing utilities, including generalized spatial signs, as well as dedicated plotting functionality and cross-validation utilities. Finally, direpack has been written consistent with the scikit-learn API, such that the estimators can flawlessly be included into (statistical and/or machine) learning pipelines in that framework.
翻译:粒子包装包旨在将一套现代统计层面的减少技术作为一个单一、一致的包件进入皮顿宇宙中,该包件旨在将一套现代统计层面的减少技术作为一套单一、一致的包件;该包件的减少方法包括分为三类:投影追求的尺寸减少、足够的尺寸减少和对尺寸减少的稳健估计;作为必然结果,还提供了基于这些较小尺寸空间的正规回归估计器,从经典主要组成部分倒退到稀薄的部分稳健M型回归;该包包包还包含一套传统和稳健的预处理工具,包括通用空间标志,以及专用绘图功能和交叉验证工具;最后,该包件的写法与Scikit-learn API一致,因此,估计器可以完美地纳入(统计和/或机器)在该框架中的学习管道中。