Rubin, T. N., Chambers, A., Smyth, P. & Steyvers, M. (2011). Statistical Topic Models for Multi-Label Document Classification (cite arxiv:1107.2462)

Carpena, P., Bernaola-Galván, P., Hackenberg, M., Coronado, A. V. & Oliver, J. L. (2009). Level statistics of words: Finding keywords in literary texts and symbolic sequences. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 79, 035102. doi: 10.1103/PhysRevE.79.035102

Huang, A., Milne, D. N., Frank, E. & Witten, I. H. (2009). Clustering Documents Using a Wikipedia-Based Concept Representation.. In T. Theeramunkong, B. Kijsirikul, N. Cercone & T. B. Ho (eds.), PAKDD (p./pp. 628-636), : Springer. ISBN: 978-3-642-01306-5

Heyer, G., Quasthoff, U.,, Wittig, T. (2008). Text Mining: Wissensrohstoff Text. Herdecke ; Bochum: W3L-Verl.. ISBN: 978-3-937137-30-8

Berendt, B., Hotho, A., Mladenic, D. & Semeraro, G. (eds.) (2007). From Web to Social Web: Discovering and Deploying User and Content Profiles (Vol. 4736). Springer. ISBN: 978-3-540-74950-9

Feldman, R., Sanger, J. (2007). The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press. ISBN: 0521836573

Colas, F. & Brazdil, P. (2006). On the Behavior of SVM and Some Older Algorithms in Binary Text Classification Tasks. Text, Speech and Dialogue, , 45--52.

Crane, G. (2006). What Do You Do with a Million Books?. D-Lib Magazine, 12. doi: 10.1045/march2006-crane

Weiss, S. M., Indurkhya, N.,, Zhang, T. (2004). Text Mining. Predictive Methods for Analyzing Unstructured Information. Springer, Berlin. ISBN: 0387954333

Hotho, A., Maedche, A. & Staab, S. (2001). Text Clustering Based on Good Aggregations. ICDM '01: Proceedings of the 2001 IEEE International Conference on Data Mining (p./pp. 607--608), Washington, DC, USA: IEEE Computer Society. ISBN: 0-7695-1119-8