Tang‌, L. & Liu‌, H.: Community Detection and Mining in Social Media. 2010
[Volltext] [Kurzfassung] [BibTeX]

The past decade has witnessed the emergence of participatory Web and social media, bringing people together in many creative ways. Millions of users are playing, tagging, working, and socializing online, demonstrating new forms of collaboration, communication, and intelligence that were hardly imaginable just a short time ago. Social media also helps reshape business models, sway opinions and emotions, and opens up numerous possibilities to study human interaction and collective behavior in an unparalleled scale. This lecture, from a data mining perspective, introduces characteristics of social media, reviews representative tasks of computing with social media, and illustrates associated challenges. It introduces basic concepts, presents state-of-the-art algorithms with easy-to-understand examples, and recommends effective evaluation methods. In particular, we discuss graph-based community detection techniques and many important extensions that handle dynamic, heterogeneous networks in social media. We also demonstrate how discovered patterns of communities can be used for social media mining. The concepts, algorithms, and methods presented in this lecture can help harness the power of social media and support building socially-intelligent systems. This book is an accessible introduction to the study of community detection and mining in social media. It is an essential reading for students, researchers, and practitioners in disciplines and applications where social media is a key source of data that piques our curiosity to understand, manage, innovate, and excel. This book is supported by additional materials, including lecture slides, the complete set of figures, key references, some toy data sets used in the book, and the source code of representative algorithms. The readers are encouraged to visit the book website for the latest information. Table of Contents: Social Media and Social Computing / Nodes, Ties, and Influence / Community Detection and Evaluation / Communities in Heterogeneous Networks / Social Media Mining

Manning, C. D.; Raghavan, P. & Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, 2008
[Volltext] [BibTeX]

Lowd, D. & Meek, C.: Good Word Attacks on Statistical Spam Filters. (2005),
[Volltext] [BibTeX]

Boykin, P. & Roychowdhury, V.: Personal Email Networks: An Effective Anti-Spam Tool. , 2004
[Volltext] [BibTeX]

Carstensen, K.-U.; Eber, C.; Endriss, C.; Jekat, S.; Klabunde, R. & Langer, H. (Hrsg.): Computerlinguistik und Sprachtechnologie. Eine Einführung. Heidelberg: Spektrum Akademischer Verlag, 2004
[Volltext] [BibTeX]

Ferber, R.: Information Retrieval: Suchmodelle und Data-Mining-Verfahren für Textsammlungen und das Web. Heidelberg: dpunkt Verlag, 2003
[Volltext] [BibTeX]

MacKay, D. J. C.: Information Theory, Inference, and Learning Algorithms. (2003),
[Volltext] [BibTeX]

Berkhin, P.: Survey Of Clustering Data Mining Techniques. San Jose, CA, 2002
[Volltext] [BibTeX]

Sarwar, B. M.; Karypis, G.; Konstan, J. A. & Riedl, J.: Item-based collaborative filtering recommendation algorithms.. WWW. 2001, S. 285-295
[Volltext] [BibTeX]

Baeza-Yates, R. A. & Ribeiro-Neto, B. A.: Modern Information Retrieval. ACM Press / Addison-Wesley, 1999
[Volltext] [BibTeX]

Manning, C. D. & Schütze, H.: Foundations of Statistical Natural Language Processing. Cambridge, Massachusetts: The MIT Press, 1999
[Volltext] [BibTeX]

Witten, I. H.; Moffat, A. & Bell, T. C.: Managing Gigabytes: Compressing and Indexing Documents and Images, Second Edition. Morgan Kaufmann, 1999
[Volltext] [BibTeX]

Ng, R. T.; Lakshmanan, L. V. S.; Han, J. & Pang, A.: Exploratory Mining and Pruning Optimizations of Constrained Association Rules.. SIGMOD Conference. 1998, S. 13-24
[Volltext] [BibTeX]

Charniak, E.: Statistical Techniques for Natural Language Parsing. In: AI Magazine 18 (1997), Nr. 4, S. 33-44
[Volltext] [BibTeX]

Brachman, R. J. & Anand, T.: The Process of Knowledge Discovery in Databases.. Advances in Knowledge Discovery and Data Mining. 1996, S. 37-57
[Volltext] [BibTeX]

Fayyad, U. M.; Piatetsky-Shapiro, G.; Smyth, P. & Uthurusamy, R. (Hrsg.): Advances in Knowledge Discovery and Data Mining.. AAAI/MIT Press, 1996
[Volltext] [BibTeX]

Park, J.; Chen, M. & Yu, P.: An effective hash-based algorithm for mining association rules. In: Proceedings of the 1995 ACM SIGMOD international conference on Management of data (1995), S. 175-186
[BibTeX]

Agrawal, R. & Srikant, R.: Fast Algorithms for Mining Association Rules in Large Databases. VLDB '94: Proceedings of the 20th International Conference on Very Large Data Bases. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc., 1994, S. 487-499
[BibTeX]

van Rijsbergen, C. J.: Information retrieval. 2. Aufl. London: Butterworths, 1979
[Volltext] [BibTeX]

Dempster, A.; Laird, N. & Rubin, D.: Maximum likelihood from incomplete data via the EM algorithm.. In: J. Royal Statistical Society, Series B 39 (1977), Nr. 1, S. 1-38
[BibTeX]