Publications
The Wikipedia XML Corpus
Denoyer, L. & Gallinari, P.
SIGIR Forum (2006) [pdf]
RCV1: A New Benchmark Collection for Text Categorization Research
Lewis, D. D.; Yang, Y.; Rose, T. G. & Li, F.
Journal of Machine Learning Research, 5(Apr) 361-397 (2004) [pdf]
Corpus-Based Knowledge Representation
Halevy, A. Y. & Madhavan, J.
Gottlob, G. & Walsh, T., ed., 'IJCAI-03, Proceedings of the Eighteenth International Joint Conference
on Artificial Intelligence, Acapulco, Mexico, August 9-15, 2003', Morgan Kaufmann, 1567-1572 (2003)
Semantic similarity based on corpus statistics and lexical taxonomy
Jiang, J. J. & Conrath, D. W.
CoRR, cmp-lg/9709008() (1997)