This is the home page of the ParsCit project, which performs reference string parsing, sometimes also called citation parsing or citation extraction. It is architected as a supervised machine learning procedure that uses Conditional Random Fields as its learning mechanism. You can download the code below, parse strings online, or send batch jobs to our web service (coming soon!). The code contains both the training data, feature generator and shell scripts to connect the system to a web service (used here too).
Neil Ireson, Fabio Ciravegna, Marie Elaine Califf, Dayne Freitag, Nicholas Kushmerick, Alberto Lavelli: Evaluating Machine Learning for Information Extraction, 22nd International Conference on Machine Learning (ICML 2005), Bonn, Germany, 7-11 August, 2005
R. Mihalcea, and A. Csomai. Proceedings of the sixteenth ACM Conference on information and knowledge management, page 233--242. New York, NY, USA, ACM, (2007)
O. Gunes, C. Schallhart, T. Furche, J. Lehmann, and A. Ngomo. Proceedings of the 3rd Workshop on the People's Web Meets NLP: Collaboratively Constructed Semantic Resources and their Applications to NLP, page 29--33. Association for Computational Linguistics, (July 2012)
E. Garbin, and I. Mani. Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, page 363--370. Stroudsburg, PA, USA, Association for Computational Linguistics, (2005)
B. Martins, H. Manguinhas, and J. Borbinha. Proceedings of the International Conference on Semantic Computing, page 1--9. IEEE Computer Society, (August 2008)
T. Tezuka, R. Lee, Y. Kambayashi, and H. Takakura. Proceedings of the Second International Conference on Web Information Systems Engineering, 2, page 14--21. (December 2001)
J. Lafferty, A. McCallum, and F. Pereira. Proceedings of the Eighteenth International Conference on Machine Learning, page 282--289. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2001)
M. Granitzer, M. Hristakeva, R. Knight, K. Jack, and R. Kern. Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics, page 19:1--19:8. New York, NY, USA, ACM, (2012)
Y. Li, A. Wen, Q. Lin, R. Li, and Z. Lu. Web-Age Information Management, volume 6897 of Lecture Notes in Computer Science, Springer, Berlin/Heidelberg, (2011)
X. Chai, B. Vuong, A. Doan, and J. Naughton. Proceedings of the 35th SIGMOD international conference on Management of data, page 87--100. New York, NY, USA, ACM, (2009)
M. Atzmueller, and S. Beer. Proc. 55th IWK, International Workshop on Design, Evaluation and Refinement of Intelligent Systems (DERIS), University of Ilmenau, (2010)
T. Rattenbury, N. Good, and M. Naaman. SIGIR '07: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 103--110. New York, NY, USA, ACM Press, (2007)
J. Tang, M. Hong, J. Li, and B. Liang. International Semantic Web Conference, volume 4273 of Lecture Notes in Computer Science, page 640-653. Springer, (2006)
P. Kluegl, M. Atzmueller, and F. Puppe. Proc. 4th International Workshop on Knowledge Engineering and Software Engineering (KESE 2008), 31th German Conference on Artificial Intelligence (KI-2008), accepted, (2008)
T. Rattenbury, N. Good, and M. Naaman. SIGIR '07: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, page 103--110. New York, NY, USA, ACM Press, (2007)
Y. Jin, Y. Matsuo, and M. Ishizuka. Proceedings of the European Semantic Web Conference, ESWC2007, volume 4519 of Lecture Notes in Computer Science, Springer-Verlag, (July 2007)
M. Kayed, and K. Shaalan. IEEE Transactions on Knowledge and Data Engineering18 (10):
1411--1428(2006)Member-Chia-Hui Chang and Member-Moheb Ramzy Girgis.
A. Takasu. JCDL '03: Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries, page 49--60. Washington, DC, USA, IEEE Computer Society, (2003)
G. Gottlob, C. Koch, R. Baumgartner, M. Herzog, and S. Flesca. Proceedings of the Twenty-third ACM SIGACT-SIGMOD-SIGART Symposium
on Principles of Database Systems, June 14-16, 2004, Paris, France, page 1-12. ACM, (2004)