Outlier detection is a significant area in data mining. It can be either used to pre-process the data prior to an analysis or post the processing phase (before visualization) depending on the effectiveness of the outlier and its importance. Outlier detection extends to several fields such as detection of credit card fraud, network intrusions, machine failure prediction, potential terrorist attacks, and so on. Outliers are those data points with characteristics considerably different. They deviate from the data set causing inconsistencies, noise and anomalies during analysis and result in modification of the original points However, a common misconception is that outliers have to be immediately eliminated or replaced from the data set. Such points could be considered useful if analyzed separately as they could be obtained from a separate mechanism entirely making it important to the research question. This study surveys the different methods of outlier detection for spatial analysis. Spatial data or geospatial data are those that exhibit geographic properties or attributes such as position or areas. An example would be weather data such as precipitation, temperature, wind velocity, and so on collected for a defined region.
翻译:外线探测是数据挖掘的一个重要领域,既可用于在分析前预处理数据,也可用于在处理阶段(可视化前)之前处理数据,这取决于外线的有效性和重要性。外线探测延伸到若干领域,例如发现信用卡欺诈、网络入侵、机器故障预测、潜在恐怖袭击等。外线是具有显著不同特征的数据点。这些数据点不同于数据集,在分析过程中造成不一致、噪音和异常,并导致原始点的修改。然而,一个常见的错误观念是,外线必须立即从数据集中消除或替换。如果从一个对研究问题具有完全重要性的单独机制中单独分析,这些点可以被视为有用。这项研究调查了空间分析的外向探测方法。空间数据或地理空间数据是具有地理属性或位置或区域等特征的数据。一个实例是降雨、温度、风速等天气数据,为特定区域收集的数据就是一例。