TY - JOUR AU - Denoyer, Ludovic AU - Gallinari, Patrick T1 - The Wikipedia XML Corpus JO - SIGIR Forum PY - 2006/ VL - IS - SP - EP - UR - http://www-connex.lip6.fr/~denoyer/wikipediaXML/ DO - KW - data KW - dm KW - mining KW - xml KW - corpus KW - ml KW - wikipedia L1 - SN - N1 - Wikipedia XML Corpus N1 - AB - ER - TY - JOUR AU - Lewis, D. D. AU - Yang, Y. AU - Rose, T. G. AU - Li, F. T1 - RCV1: A New Benchmark Collection for Text Categorization Research JO - Journal of Machine Learning Research PY - 2004/ VL - 5 IS - Apr SP - 361 EP - 397 UR - http://www.jmlr.org/papers/volume5/lewis04a/lewis04a.pdf DO - KW - benchmark KW - text KW - classification KW - RCV1 KW - reuters KW - corpus L1 - SN - N1 - N1 - AB - ER - TY - CONF AU - Halevy, Alon Y. AU - Madhavan, Jayant A2 - Gottlob, Georg A2 - Walsh, Toby T1 - Corpus-Based Knowledge Representation T2 - IJCAI-03, Proceedings of the Eighteenth International Joint Conference

on Artificial Intelligence, Acapulco, Mexico, August 9-15, 2003 PB - Morgan Kaufmann C1 - PY - 2003/ CY - VL - IS - SP - 1567 EP - 1572 UR - DO - KW - representation KW - knowledge KW - based KW - corpus L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Jiang, Jay J. AU - Conrath, David W. T1 - Semantic similarity based on corpus statistics and lexical taxonomy JO - CoRR PY - 1997/ VL - cmp-lg/9709008 IS - SP - EP - UR - DO - KW - corpus KW - semantic KW - similarity KW - wordnet L1 - SN - N1 - N1 - AB - ER -