20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the data
Data files:
20_newsgroups.tar.gz (17.3M; 61.6M uncompressed)
mini_newsgroups.tar.gz A subset composed of 100 articles from each newsgroup. (1.9M; 6.2M uncompressed)
TIR 2010
7th International Workshop on Text-based Information Retrieval
in conjunction with DEXA 2010
University of Deusto
Bilbao, Spain
30 August - 3 September 2010
M. Sanderson, und W. Croft. Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'99, Seite 206--213. (1999)
S. Staab, und A. Hotho. Intelligent Information Processing and Web Mining, Proceedings of the International IIS: IIPWM'03 Conference held in Zakopane, Seite 451-452. (2003)
A. Hotho, S. Staab, und G. Stumme. Proc. of the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD, Volume 2838 von LNCS, Seite 217-228. (2003)
B. Lauser, und A. Hotho. Proc. of the 7th European Conference in Research and Advanced Technology for Digital Libraries, ECDL 2003, Volume 2769 von LNCS, Seite 140-151. Springer, (2003)
P. Cimiano, A. Hotho, und S. Staab. Proceedings of the Conference on Languages Resources and Evaluation (LREC), Lisbon, Portugal, ELRA - European Language Ressources Association, (Mai 2004)
A. Hotho, und G. Stumme. Proceedings of FGML Workshop, Seite 37-45. Special Interest Group of German Informatics Society (FGML --- Fachgruppe Maschinelles Lernen der GI e.V.), (2002)
A. Hotho, A. Maedche, und S. Staab. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining, Seite 607--608. Washington, DC, USA, IEEE Computer Society, (2001)
L. Baker, und A. McCallum. Proceedings of SIGIR-98, 21st ACM International Conference on Research and Development in Information Retrieval, Seite 96--103. Melbourne, AU, ACM Press, New York, US, (1998)
G. Ifrim, M. Theobald, und G. Weikum. Proceedings of the 22nd International Conference on Machine Learning - Learning in Web Search (LWS 2005), Seite 18--26. Bonn, Germany, (2005)