Partial least squares (PLS) is a dimensionality reduction technique used as an alternative to ordinary least squares (OLS) in situations where the data is colinear or high dimensional. Both PLS and OLS provide mean based estimates, which are extremely sensitive to the presence of outliers or heavy tailed distributions. In contrast, quantile regression is an alternative to OLS that computes robust quantile based estimates. In this work, the multivariate PLS is extended to the quantile regression framework, obtaining a theoretical formulation of the problem and a robust dimensionality reduction technique that we call fast partial quantile regression (fPQR), that provides quantile based estimates. An efficient implementation of fPQR is also derived, and its performance is studied through simulation experiments and the chemometrics well known biscuit dough dataset, a real high dimensional example.
翻译:部分最小方(PLS) 是一种在数据为共线或高维的情况下作为普通最小方(OLS)的替代物而使用的维度减少技术。 PLS 和 OLS 都提供了平均的基于估算值,这些估算值对外部线的存在极为敏感,或者对严重尾部分布极为敏感。相反,四分位回归是计算稳健的量基估算值的 OLS 的替代物。在这项工作中,多变量 PLS 扩展至四分位回归框架, 获得问题的理论配方, 并获得一种强力的维度减少技术, 我们称之为快速的局部回归(fPQR ), 提供基于定量的估算值。 FPQR 的高效实施也被推算出来, 其性能通过模拟实验和广为人知的双项数据集来研究, 一个真正的高维示例。