Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reading Tea Leaves: How Humans Interpret Topic Models

J. Chang, J. Boyd-Graber, S. Gerrish, C. Wang, und D. Blei. NIPS, Seite 288--296. Curran Associates, Inc., (2009)

Zusammenfassung

Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both a predictive model of future text and a latent topic representation of the corpus. Practitioners typically assume that the latent space is semantically meaningful. It is used to check models, summarize the corpus, and guide exploration of its contents. However, whether the latent space is interpretable is in need of quantitative evaluation. In this paper, we present new quantitative methods for measuring semantic meaning in inferred topics. We back these measures with large-scale user studies, showing that they capture aspects of the model that are undetected by previous measures of model quality based on held-out likelihood. Surprisingly, topic models which perform better on held-out likelihood may infer less semantically meaningful topics.

Links und Ressourcen

URL:

http://books.nips.cc/papers/files/nips22/NIPS2009_0125.pdf

BibTeX-Schlüssel:

chang2009reading

Suchen auf:

Kommentare und Rezensionen
(0)

Es gibt bisher keine Rezension oder Kommentar. Sie können eine schreiben!

Zitieren Sie diese Publikation

@inproceedings{chang2009reading,
  abstract = {Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both a predictive model of future text and a latent topic representation of the corpus. Practitioners typically assume that the latent space is semantically meaningful. It is used to check models, summarize the corpus, and guide exploration of its contents. However, whether the latent space is interpretable is in need of quantitative evaluation. In this paper, we present new quantitative methods for measuring semantic meaning in inferred topics. We back these measures with large-scale user studies, showing that they capture aspects of the model that are undetected by previous measures of model quality based on held-out likelihood. Surprisingly, topic models which perform better on held-out likelihood may infer less semantically meaningful topics.},
  added-at = {2012-06-01T09:20:37.000+0200},
  author = {Chang, Jonathan and Boyd-Graber, Jordan L. and Gerrish, Sean and Wang, Chong and Blei, David M.},
  biburl = {https://puma.uni-kassel.de/bibtex/2cd4cf8ff8a676ca7bbc4201ddbc2d024/jaeschke},
  booktitle = {NIPS},
  editor = {Bengio, Yoshua and Schuurmans, Dale and Lafferty, John D. and Williams, Christopher K. I. and Culotta, Aron},
  interhash = {48210cee941ee21e6282798e28270a6d},
  intrahash = {cd4cf8ff8a676ca7bbc4201ddbc2d024},
  isbn = {9781615679119},
  keywords = {cirg collective intelligence model topic},
  pages = {288--296},
  publisher = {Curran Associates, Inc.},
  timestamp = {2012-06-07T12:16:24.000+0200},
  title = {Reading Tea Leaves: How Humans Interpret Topic Models},
  url = {http://books.nips.cc/papers/files/nips22/NIPS2009_0125.pdf},
  year = 2009
}

%0 Conference Paper
%1 chang2009reading
%A Chang, Jonathan
%A Boyd-Graber, Jordan L.
%A Gerrish, Sean
%A Wang, Chong
%A Blei, David M.
%B NIPS
%D 2009
%E Bengio, Yoshua
%E Schuurmans, Dale
%E Lafferty, John D.
%E Williams, Christopher K. I.
%E Culotta, Aron
%I Curran Associates, Inc.
%K cirg collective intelligence model topic
%P 288--296
%T Reading Tea Leaves: How Humans Interpret Topic Models
%U http://books.nips.cc/papers/files/nips22/NIPS2009_0125.pdf
%X Probabilistic topic models are a popular tool for the unsupervised analysis of text, providing both a predictive model of future text and a latent topic representation of the corpus. Practitioners typically assume that the latent space is semantically meaningful. It is used to check models, summarize the corpus, and guide exploration of its contents. However, whether the latent space is interpretable is in need of quantitative evaluation. In this paper, we present new quantitative methods for measuring semantic meaning in inferred topics. We back these measures with large-scale user studies, showing that they capture aspects of the model that are undetected by previous measures of model quality based on held-out likelihood. Surprisingly, topic models which perform better on held-out likelihood may infer less semantically meaningful topics.
%@ 9781615679119

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reading Tea Leaves: How Humans Interpret Topic Models

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen
(0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@jaeschkes Tags hervorgehoben)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Reading Tea Leaves: How Humans Interpret Topic Models

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen (0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@jaeschkes Tags hervorgehoben)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reading Tea Leaves: How Humans Interpret Topic Models

Kommentare und Rezensionen
(0)