User Tools

Site Tools


sift:principal_component_analysis:outlier_detection_for_pca

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
sift:principal_component_analysis:outlier_detection_for_pca [2024/08/28 19:02] – [Reference] wikisysopsift:principal_component_analysis:outlier_detection_for_pca [2024/08/28 19:04] (current) – [Reference] wikisysop
Line 98: Line 98:
 For many KDD applications, such as detecting criminal activities in E-commerce, finding the rare instances or the outliers, can be more interesting than finding the common patterns. Existing work in outlier detection regards being an outlier as a binary property. In this paper, we contend that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier. This degree is called the local outlier factor (LOF) of an object. It is local in that the degree depends on how isolated the object is with respect to the surrounding neighborhood. We give a detailed formal analysis showing that LOF enjoys many desirable properties. Using real-world datasets, we demonstrate that LOF can be used to find outliers which appear to be meaningful, but can otherwise not be identified with existing approaches. Finally, a careful performance evaluation of our algorithm confirms we show that our approach of finding local outliers can be practical. For many KDD applications, such as detecting criminal activities in E-commerce, finding the rare instances or the outliers, can be more interesting than finding the common patterns. Existing work in outlier detection regards being an outlier as a binary property. In this paper, we contend that for many scenarios, it is more meaningful to assign to each object a degree of being an outlier. This degree is called the local outlier factor (LOF) of an object. It is local in that the degree depends on how isolated the object is with respect to the surrounding neighborhood. We give a detailed formal analysis showing that LOF enjoys many desirable properties. Using real-world datasets, we demonstrate that LOF can be used to find outliers which appear to be meaningful, but can otherwise not be identified with existing approaches. Finally, a careful performance evaluation of our algorithm confirms we show that our approach of finding local outliers can be practical.
  
 +----
  
-Markus M. BreunigHans-Peter KriegelRaymond T. Ng, and Jörg Sander2000LOF: identifying density-based local outliersSIGMOD Rec. 29, 2 (June 2000), 93–104. https://doi.org/10.1145/335191.335388+SliškovićDraženRatko Grbić, and Željko Hocenski"Multivariate statistical process monitoring." Tehnički vjesnik 19.(2012): 33-41.
  
 **Abstract** **Abstract**
sift/principal_component_analysis/outlier_detection_for_pca.1724871753.txt.gz · Last modified: 2024/08/28 19:02 by wikisysop