Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Pruning Schemes for Distance-Based Outlier Detection

N. Vu, und V. Gopalkrishnan. Machine Learning and Knowledge Discovery in Databases (2009)

Zusammenfassung

Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.

Links und Ressourcen

URL:

http://dx.doi.org/10.1007/978-3-642-04174-7_11

BibTeX-Schlüssel:

nguyen2009efficient

Suchen auf:

Kommentare und Rezensionen
(0)

Es gibt bisher keine Rezension oder Kommentar. Sie können eine schreiben!

Zitieren Sie diese Publikation

@article{nguyen2009efficient,
  abstract = {Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we
present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.},
  added-at = {2010-05-04T08:55:46.000+0200},
  author = {Vu, Nguyen and Gopalkrishnan, Vivekanand},
  biburl = {https://puma.uni-kassel.de/bibtex/2b33d7b9133cc3d81e507f4366658fb56/folke},
  interhash = {e219b7e66b466cc39f44520b37f91a61},
  intrahash = {b33d7b9133cc3d81e507f4366658fb56},
  journal = {Machine Learning and Knowledge Discovery in Databases},
  keywords = {2009 clustering detection ecml outlier pkdd},
  pages = {160--175},
  timestamp = {2010-05-04T08:55:48.000+0200},
  title = {Efficient Pruning Schemes for Distance-Based Outlier Detection},
  url = {http://dx.doi.org/10.1007/978-3-642-04174-7_11},
  year = 2009
}

%0 Journal Article
%1 nguyen2009efficient
%A Vu, Nguyen
%A Gopalkrishnan, Vivekanand
%D 2009
%J Machine Learning and Knowledge Discovery in Databases
%K 2009 clustering detection ecml outlier pkdd
%P 160--175
%T Efficient Pruning Schemes for Distance-Based Outlier Detection
%U http://dx.doi.org/10.1007/978-3-642-04174-7_11
%X Outlier detection finds many applications, especially in domains that have scope for abnormal behavior. In this paper, we
present a new technique for detecting distance-based outliers, aimed at reducing execution time associated with the detectionprocess. Our approach operates in two phases and employs three pruning rules. In the first phase, we partition the data intoclusters, and make an early estimate on the lower bound of outlier scores. Based on this lower bound, the second phase thenprocesses relevant clusters using the traditional block nested-loop algorithm. Here two efficient pruning rules are utilizedto quickly discard more non-outliers and reduce the search space. Detailed analysis of our approach shows that the additionaloverhead of the first phase is offset by the reduction in cost of the second phase. We also demonstrate the superiority ofour approach over existing distance-based outlier detection methods by extensive empirical studies on real datasets.

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Pruning Schemes for Distance-Based Outlier Detection

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen
(0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@folkes Tags hervorgehoben)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Efficient Pruning Schemes for Distance-Based Outlier Detection

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen (0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@folkes Tags hervorgehoben)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficient Pruning Schemes for Distance-Based Outlier Detection

Kommentare und Rezensionen
(0)