Denoyer, L. & Gallinari, P. (2006), 'The Wikipedia XML Corpus', SIGIR Forum .
Lewis, D. D.; Yang, Y.; Rose, T. G. & Li, F. (2004), 'RCV1: A New Benchmark Collection for Text Categorization Research', Journal of Machine Learning Research 5 (Apr), 361--397.
Halevy, A. Y. & Madhavan, J. (2003),
Corpus-Based Knowledge Representation, in
Georg Gottlob & Toby Walsh, ed.,
'IJCAI-03, Proceedings of the Eighteenth International Joint Conference
on Artificial Intelligence, Acapulco, Mexico, August 9-15, 2003'
, Morgan Kaufmann,
, pp. 1567-1572
.
Jiang, J. J. & Conrath, D. W. (1997), 'Semantic similarity based on corpus statistics and lexical taxonomy', CoRR cmp-lg/9709008 .