Distributional data analysis, concerned with statistical analysis and modeling for data objects consisting of random probability density functions (PDFs) in the framework of functional data analysis (FDA), has received considerable interest in recent years. However, many important aspects remain unexplored, such as outlier detection and robustness. Existing functional outlier detection methods are mainly used for ordinary functional data and usually perform poorly when applied to PDFs. To fill this gap, this study focuses on PDF-valued outlier detection, as well as its application in robust distributional regression. Similar to ordinary functional data, detecting the shape outlier masked by the "curve net" formed by the bulk of the PDFs is the major challenge in PDF-outlier detection. To this end, we propose a tree-structured transformation system for feature extraction as well as converting the shape outliers to easily detectable magnitude outliers, relevant outlier detectors are designed for the specific transformed data. A multiple detection strategy is also proposed to account for detection uncertainties and to combine different detectors to form a more reliable detection tool. Moreover, we propose a distributional-regression-based approach for detecting the abnormal associations of PDF-valued two-tuples. As a specific application, the proposed outlier detection methods are applied to robustify a distribution-to-distribution regression method, and we develop a robust estimator for the regression operator by downweighting the detected outliers. The proposed methods are validated and evaluated by extensive simulation studies or real data applications. Relevant comparative studies demonstrate the superiority of the developed outlier detection method with other competitors in distributional outlier detection.
翻译:分布式数据分析涉及在功能性数据分析(FDA)框架内对随机概率密度函数(PDFs)构成的数据对象进行统计分析和建模,近年来引起了相当大的兴趣;然而,许多重要方面仍未探索,例如外星探测和稳健性。现有的功能外星探测方法主要用于普通功能数据,在应用PDFs时通常表现不佳。为填补这一空白,本研究侧重于PDF估值的外星探测及其在稳健分布式回归中的应用。与普通功能数据类似,发现由大部分PDFs比较应用构成的“偏差网”遮盖的外形外体是PDFs外探测的主要挑战。为此,我们建议采用树形结构外外星探测系统,以及将外形外星转换为易于检测的外星体外星系。还提出了多种探测战略,以探测不确定性和将不同的探测器组合组合成更可靠的检测工具。此外,我们提议采用一种分布式反偏差的比值分析方法来探测超常的比级探测方法。