Efficient Pruning Schemes for Distance-Based Outlier Detection
N. Vu, und V. Gopalkrishnan.
Machine Learning and Knowledge Discovery in Databases (2009)

Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.

URL

http://dx.doi.org/10.1007/978-3-642-04174-7_11

Suchen auf

Diese Publikation wurde noch nicht bewertet.

Bewertungsverteilung

Durchschnittliche Benutzerbewertung0,0 von 5.0 auf Grundlage von 0 Rezensionen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

@article{nguyen2009efficient,
  abstract = {Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we
present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.},
  added-at = {2010-05-04T08:55:46.000+0200},
  author = {Vu, Nguyen and Gopalkrishnan, Vivekanand},
  biburl = {https://puma.uni-kassel.de/bibtex/2b33d7b9133cc3d81e507f4366658fb56/folke},
  interhash = {e219b7e66b466cc39f44520b37f91a61},
  intrahash = {b33d7b9133cc3d81e507f4366658fb56},
  journal = {Machine Learning and Knowledge Discovery in Databases},
  keywords = {2009 clustering detection ecml outlier pkdd},
  pages = {160--175},
  timestamp = {2010-05-04T08:55:48.000+0200},
  title = {Efficient Pruning Schemes for Distance-Based Outlier Detection},
  url = {http://dx.doi.org/10.1007/978-3-642-04174-7_11},
  year = 2009
}

%0 Journal Article
%1 nguyen2009efficient
%A Vu, Nguyen
%A Gopalkrishnan, Vivekanand
%D 2009
%J Machine Learning and Knowledge Discovery in Databases
%K 2009 clustering detection ecml outlier pkdd
%P 160--175
%T Efficient Pruning Schemes for Distance-Based Outlier Detection
%U http://dx.doi.org/10.1007/978-3-642-04174-7_11
%X Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we
present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.

PUMA

Efficient Pruning Schemes for Distance-Based Outlier Detection
N. Vu, und V. Gopalkrishnan.
Machine Learning and Knowledge Discovery in Databases (2009)

Tags

Nutzer

Kommentare und Rezensionen

Zitieren Sie diese Publikation

PUMA

Efficient Pruning Schemes for Distance-Based Outlier DetectionN. Vu, und V. Gopalkrishnan. Machine Learning and Knowledge Discovery in Databases (2009)

Tags

Nutzer

Kommentare und Rezensionen

Zitieren Sie diese Publikation

Efficient Pruning Schemes for Distance-Based Outlier Detection
N. Vu, und V. Gopalkrishnan.
Machine Learning and Knowledge Discovery in Databases (2009)