Sinha, A., Shen, Z., Song, Y., Ma, H., Eide, D., Hsu, B.-J. P. & Wang, K. (2015). An Overview of Microsoft Academic Service (MAS) and Applications.. In A. Gangemi, S. Leonardi & A. Panconesi (eds.), WWW (Companion Volume) (p./pp. 243-246), : ACM. ISBN: 978-1-4503-3473-0

Adomavicius, G. & Zhang, J. (2012). Impact of Data Characteristics on Recommender Systems Performance. ACM Trans. Manage. Inf. Syst., 3, 3:1--3:17. doi: 10.1145/2151163.2151166

Zubiaga, A., Fresno, V., Martinez, R. & Garcia-Plaza, A. P. (2012). Harnessing Folksonomies to Produce a Social Classification of Resources. IEEE Transactions on Knowledge and Data Engineering, 99. doi: http://doi.ieeecomputersociety.org/10.1109/TKDE.2012.115

La Rowe, G., Ambre, S., Burgoon, J., Ke, W. & Börner, K. (2009). The Scholarly Database and its utility for scientometrics research. Scientometrics, 79, 219--234. doi: 10.1007/s11192-009-0414-2

Capocci, A. & Caldarelli, G. (2008). Folksonomies and clustering in the collaborative system CiteULike. Journal of Physics A: Mathematical and Theoretical, 41, 224016 (7pp).

Caverlee, J. & Webb, S. (2008). A Large-Scale Study of MySpace:
servations and Implications for Online Social Networks. Proceedings from the 2nd International Conference on Weblogs and Social Media (AAAI), .

Narayanan, A. & Shmatikov, V. (2008). Robust De-anonymization of Large Sparse Datasets. Proc. of the 29th IEEE Symposium on Security and Privacy (p./pp. 111--125), May, : IEEE Computer Society.

Song, Y., Zhang, L. & Giles, C. L. (2008). A sparse gaussian processes classification framework for fast tag suggestions. CIKM '08: Proceeding of the 17th ACM conference on Information and knowledge mining (p./pp. 93--102), New York, NY, USA: ACM. ISBN: 978-1-59593-991-3

Hassan-Montero, Y. & Herrero-Solana, V. (2006). Improving Tag-Clouds as Visual Information Retrieval Interfaces. InScit2006: International Conference on Multidisciplinary Information Sciences and Technologies, .

Liu, V. & Curran, J. R. (2006). Web Text Corpus for Natural Language Processing.. EACL, : The Association for Computer Linguistics. ISBN: 1-932432-59-0

Narayanan, A. & Shmatikov, V. (2006). How To Break Anonymity of the Netflix Prize Dataset

McRae, K., Cree, G. S., Seidenberg, M. S. & McNorgan, C. (2005). Semantic feature production norms for a large set of living and nonliving things. Behav Res Methods, 37, 547-559.

Newman, C. B. D. & Merz, C. (1998). UCI Repository of machine learning databases (Technical report, University of California, Irvine, Dept. of Information and Computer Sciences)