Pre-smoothing is a technique aimed at increasing the signal-to-noise ratio in data to improve subsequent estimation and model selection in regression problems. Motivated by the many scientific applications in which multi-response regression problems arise, particularly when the number of responses is large, we propose here to extend pre-smoothing methods to the multiple outcomne setting. Specifically, we introduce and study a simple technique for pre-smoothing based on low-rank approximation. We establish theoretical results on the performance of the proposed methodology, which show that in the large-response setting, the proposed technique outperforms ordinary least squares estimation with the mean squared error criterion, whilst being computationally more efficient than alternative approaches such as reduced rank regression. We quantify our estimator's benefit empirically in a number of simulated experiments. We also demonstrate our proposed low-rank pre-smoothing technique on real data arising from the environmental and biological sciences.
翻译:暂无翻译