Principal Direction Divisive Partitioning
D. Boley.
Data Mining and Knowledge Discovery (1997)

We propose a new algorithm capable of partitioning a set of documents or other samples based on an embedding in a high dimensional Euclidean space (i.e. in which every document is a vector of real numbers). The method is unusual in that it is divisive, as opposed to agglomerative, and operates by repeatedly splitting clusters into smaller clusters. The splits are not based on any distance or similarity measure. The documents are assembled in to a matrix which is very sparse. It is this sparsity that permits the algorithm to be very efficient. The performance of the method is illustrated with a set of text documents obtained from the World Wide Web. Some possible extensions are proposed for further investigation.

Suchen auf

Diese Publikation wurde noch nicht bewertet.

Bewertungsverteilung

Durchschnittliche Benutzerbewertung0,0 von 5.0 auf Grundlage von 0 Rezensionen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

@article{Boley97principaldirection,
  abstract = {We propose a new algorithm capable of partitioning a set of documents or other samples based on an embedding in a high dimensional Euclidean space (i.e. in which every document is a vector of real numbers). The method is unusual in that it is divisive, as opposed to agglomerative, and operates by repeatedly splitting clusters into smaller clusters. The splits are not based on any distance or similarity measure. The documents are assembled in to a matrix which is very sparse. It is this sparsity that permits the algorithm to be very efficient. The performance of the method is illustrated with a set of text documents obtained from the World Wide Web. Some possible extensions are proposed for further investigation.},
  added-at = {2010-05-04T08:55:46.000+0200},
  author = {Boley, Daniel},
  biburl = {https://puma.uni-kassel.de/bibtex/2bca740460f14035af773f665887b6fa4/folke},
  interhash = {281afd06bd3e21ec3ef212da4ec18ee0},
  intrahash = {bca740460f14035af773f665887b6fa4},
  journal = {Data Mining and Knowledge Discovery},
  keywords = {clustering community detection divisive svd},
  pages = {325--344},
  timestamp = {2010-05-04T08:55:49.000+0200},
  title = {Principal Direction Divisive Partitioning},
  volume = 2,
  year = 1997
}

%0 Journal Article
%1 Boley97principaldirection
%A Boley, Daniel
%D 1997
%J Data Mining and Knowledge Discovery
%K clustering community detection divisive svd
%P 325--344
%T Principal Direction Divisive Partitioning
%V 2
%X We propose a new algorithm capable of partitioning a set of documents or other samples based on an embedding in a high dimensional Euclidean space (i.e. in which every document is a vector of real numbers). The method is unusual in that it is divisive, as opposed to agglomerative, and operates by repeatedly splitting clusters into smaller clusters. The splits are not based on any distance or similarity measure. The documents are assembled in to a matrix which is very sparse. It is this sparsity that permits the algorithm to be very efficient. The performance of the method is illustrated with a set of text documents obtained from the World Wide Web. Some possible extensions are proposed for further investigation.

PUMA

Principal Direction Divisive Partitioning
D. Boley.
Data Mining and Knowledge Discovery (1997)

Tags

Nutzer

Kommentare und Rezensionen

Zitieren Sie diese Publikation

PUMA

Principal Direction Divisive PartitioningD. Boley. Data Mining and Knowledge Discovery (1997)

Tags

Nutzer

Kommentare und Rezensionen

Zitieren Sie diese Publikation

Principal Direction Divisive Partitioning
D. Boley.
Data Mining and Knowledge Discovery (1997)