%0 %0 Book Section %A Piskorski, Jakub & Yangarber, Roman %D 2013 %T Information Extraction: Past, Present and Future %E Poibeau, Thierry; Saggion, Horacio; Piskorski, Jakub & Yangarber, Roman %B Multi-source, Multilingual Information Extraction and Summarization %C %I Springer Berlin Heidelberg %V %6 %N %P 23-49 %& %Y %S Theory and Applications of Natural Language Processing %7 %8 %9 %? %! %Z %@ 978-3-642-28568-4 %( %) %* %L %M %1 %2 Information Extraction: Past, Present and Future - Springer %3 incollection %4 %# %$ %F piskorski2013information %K extraction, information, sota, survey %X In this chapter we present a brief overview of Information Extraction, which is an area of natural language processing that deals with finding factual information in free text. In formal terms, %Z %U http://dx.doi.org/10.1007/978-3-642-28569-1_2 %+ %^ %0 %0 Conference Proceedings %A %D 2013 %T Multi-source, multilingual information extraction and summarization %E Poibeau, Thierry; Saggion, Horacio; Piskorski, Jakub & Yangarber, Roman %B %C Berlin; New York %I Springer %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ 9783642285691 3642285694 3642285686 9783642285684 %( %) %* %L %M %1 %2 Multi-source, Multilingual Information Extraction and Summarization - Springer %3 proceedings %4 %# %$ %F thierrypoibeau2013multisource %K extraction, information, multi, multilingual, sota, summarization %X Information extraction (IE) and text summarization (TS) are powerful technologies for finding relevant pieces of information in text and presenting them to the user in condensed form. The ongoing information explosion makes IE and TS critical for successful functioning within the information society. These technologies face particular challenges due to the inherent multi-source nature of the information explosion. The technologies must now handle not isolated texts or individual narratives, but rather large-scale repositories and streams--in general, in multiple languages--containing a multiplicity of perspectives, opinions, or commentaries on particular topics, entities or events. There is thus a need to adapt existing techniques and develop new ones to deal with these challenges. This volume contains a selection of papers that present a variety of methodologies for content identification and extraction, as well as for content fusion and regeneration. The chapters cover various aspects of the challenges, depending on the nature of the information sought--names vs. events,-- and the nature of the sources--news streams vs. image captions vs. scientific research papers, etc. This volume aims to offer a broad and representative sample of studies from this very active research field. %Z %U http://link.springer.com/book/10.1007/978-3-642-28569-1 %+ %^ %0 %0 Generic %A Bakshy, Eytan; Rosenn, Itamar; Marlow, Cameron & Adamic, Lada %D 2012 %T The Role of Social Networks in Information Diffusion %E %B %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 The Role of Social Networks in Information Diffusion %3 misc %4 %# %$ %F bakshy2012social %K diffusion, facebook, information, toread %X Online social networking technologies enable individuals to simultaneously share information with any number of peers. Quantifying the causal effect of these technologies on the dissemination of information requires not only identification of who influences whom, but also of whether individuals would still propagate information in the absence of social signals about that information. We examine the role of social networks in online information diffusion with a large-scale field experiment that randomizes exposure to signals about friends' information sharing among 253 million subjects in situ. Those who are exposed are significantly more likely to spread information, and do so sooner than those who are not exposed. We further examine the relative role of strong and weak ties in information propagation. We show that, although stronger ties are individually more influential, it is the more abundant weak ties who are responsible for the propagation of novel information. This suggests that weak ties may play a more dominant role in the dissemination of information online than currently believed. %Z cite arxiv:1201.4145Comment: 10 pages, 7 figures. In the Proceedings of ACM WWW 2012, April 16-20, 2012, Lyon, France %U http://arxiv.org/abs/1201.4145 %+ %^ %0 %0 Conference Proceedings %A Klügl, Peter; Toepfer, Martin; Lemmerich, Florian; Hotho, Andreas & Puppe, Frank %D 2012 %T Collective Information Extraction with Context-Specific Consistencies. %E Flach, Peter A.; Bie, Tijl De & Cristianini, Nello %B ECML/PKDD (1) %C %I Springer %V 7523 %6 %N %P 728-743 %& %Y %S Lecture Notes in Computer Science %7 %8 %9 %? %! %Z %@ 978-3-642-33459-7 %( %) %* %L %M %1 %2 %3 inproceedings %4 conf/pkdd/2012-1 %# %$ %F conf/pkdd/KluglTLHP12 %K 2012, context, extraction, ie, information, myown %X %Z %U http://dblp.uni-trier.de/db/conf/pkdd/pkdd2012-1.html#KluglTLHP12 %+ %^ %0 %0 Conference Proceedings %A Toepfer, Martin; Kluegl, Peter; Hotho, Andreas & Puppe., Frank %D 2010 %T Conditional Random Fields For Local Adaptive Reference Extraction %E Atzmüller, Martin; Benz, Dominik; Hotho, Andreas & Stumme, Gerd %B Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen {\&} Adaptivitaet %C Kassel, Germany %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 lwa2010 %# %$ %F kdml21 %K 2010, crf, extraction, information, myown %X The accurate extraction of bibliographic information from scientific publications is an active field of research. Machine learning and sequence labeling approaches like Conditional Random Fields (CRF) are often applied for this reference extraction task, but still suffer from the ambiguity of reference notation. Reference sections apply a predefined style guide and contain only homogeneous references. Therefore, other references of the same paper or journal often provide evidence how the fields of a reference are correctly labeled. We propose a novel approach that exploits the similarities within a document. Our process model uses information of unlabeled documents directly during the extraction task in order to automatically adapt to the perceived style guide. This is implemented by changing the manifestation of the features for the applied CRF. The experimental results show considerable improvements compared to the common approach. We achieve an average F1 score of 96.7% and an instance accuracy of 85.4% on the test data set. %Z %U http://www.kde.cs.uni-kassel.de/conf/lwa10/papers/kdml21.pdf %+ %^ %0 %0 Book %A Manning, Christopher D.; Raghavan, Prabhakar & Schütze, Hinrich %D 2008 %T Introduction to Information Retrieval %E %B %C %I Cambridge University Press %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 Introduction to Information Retrieval %3 book %4 %# %$ %F manning2008 %K information, introduction, ir, retrieval, sota %X %Z %U %+ %^ %0 %0 Conference Proceedings %A Sorg, Philipp & Cimiano, Philipp %D 2008 %T Cross-lingual Information Retrieval with Explicit Semantic Analysis %E %B Working Notes for the CLEF 2008 Workshop %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F sorg2008cirwesa %K cross, information, lingual, ol, ontology %X %Z %U http://www.aifb.kit.edu/images/7/7c/2008_1837_Sorg_Cross-lingual_I_1.pdf %+ %^ %0 %0 Conference Proceedings %A Hassan-Montero, Y. & Herrero-Solana, V. %D 2006 %T Improving Tag-Clouds as Visual Information Retrieval Interfaces %E %B InScit2006: International Conference on Multidisciplinary Information Sciences and Technologies %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# michael %$ %F HaHe06 %K clouds, dataset, del.icio.us, information, tag, tagging, taggingsurvey, toread, visual %X Tagging-based systems enable users to categorize web resources by means of tags (freely chosen keywords), in order to re-finding these resources later. Tagging is implicitly also a social indexing process, since users share their tags and resources, constructing a social tag index, so-called folksonomy. At the same time of tagging-based system, has been popularised an interface model for visual information retrieval known as Tag-Cloud. In this model, the most frequently used tags are displayed in alphabetical order. This paper presents a novel approach to Tag-Cloud�s tags selection, and proposes the use of clustering algorithms for visual layout, with the aim of improve browsing experience. The results suggest that presented approach reduces the semantic density of tag set, and improves the visual consistency of Tag-Cloud layout. %Z %U http://nosolousabilidad.com/hassan/improving_tagclouds.pdf %+ %^ %0 %0 Journal Article %A Kayed, Mohammed & Shaalan, Khaled F. %D 2006 %T A Survey of Web Information Extraction Systems %E %B IEEE Transactions on Knowledge and Data Engineering %C %I IEEE Educational Activities Department %V 18 %6 %N 10 %P 1411--1428 %& %Y %S %7 %8 %9 %? %! %Z %@ 1041-4347 %( %) %* %L %M %1 %2 A Survey of Web Information Extraction Systems %3 article %4 %# %$ %F 1159300 %K extraction, information, survey, ie %X %Z Member-Chia-Hui Chang and Member-Moheb Ramzy Girgis %U %+ %^ %0 %0 Conference Proceedings %A Tang, Jie; Hong, MingCai; Li, Juan-Zi & Liang, Bangyong %D 2006 %T Tree-Structured Conditional Random Fields for Semantic Annotation. %E Cruz, Isabel F.; Decker, Stefan; Allemang, Dean; Preist, Chris; Schwabe, Daniel; Mika, Peter; Uschold, Michael & Aroyo, Lora %B International Semantic Web Conference %C %I Springer %V 4273 %6 %N %P 640-653 %& %Y %S Lecture Notes in Computer Science %7 %8 %9 %? %! %Z %@ 3-540-49029-9 %( %) %* %L %M %1 %2 dblp %3 inproceedings %4 conf/semweb/2006 %# %$ %F conf/semweb/TangHLL06 %K annotation, crf, extraction, information, ml, semantic %X %Z %U http://dblp.uni-trier.de/db/conf/semweb/iswc2006.html#TangHLL06 %+ %^ %0 %0 Conference Proceedings %A Culotta, A.; Bekkerman, R. & A.McCallum %D 2004 %T Extracting social networks and contact information from email and the Web %E %B Proc.\ Conference on Email and Anti-Spam (CEAS) %C Mountain View, USA %I %V %6 %N %P %& %Y %S %7 %8 July %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F culotta04 %K extraction, social, information, email, oe, networks %X %Z %U %+ %^ %0 %0 Conference Proceedings %A Peng, Fuchun & McCallum, Andrew %D 2004 %T Accurate Information Extraction from Research Papers using Conditional Random Fields %E %B HLT-NAACL %C %I %V %6 %N %P 329-336 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 dblp %3 inproceedings %4 %# %$ %F pm04accurate %K extraction, bibtex, information, ie, bibliographic, references %X %Z %U http://www.cs.umass.edu/~mccallum/papers/hlt2004.pdf %+ %^ %0 %0 Book %A Ferber, Reginald %D 2003 %T Information Retrieval: Suchmodelle und Data-Mining-Verfahren für Textsammlungen und das Web %E %B %C Heidelberg %I dpunkt Verlag %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 book %4 %# %$ %F ferber2003information %K information, ir, lecture, mining, retrieval, standard, vorlesung %X %Z %U http://information-retrieval.de/ %+ %^ %0 %0 Generic %A Newman, M. E. J. %D 2003 %T The structure and function of complex networks %E %B %C %I %V %6 %N %P %& %Y %S %7 %8 March %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 misc %4 %# %$ %F citeulike:155 %K algorithm, clustering, complex_systems, folksonomy, information, kdubiq, network, retrieval, scale_free_networks, small, socialnetwork, summerschool, theory, web, web_graph, world %X Inspired by empirical studies of networked systems such as the Internet, social networks, and biological networks, researchers have in recent years developed a variety of techniques and models to help us understand or predict the behavior of these systems. Here we review developments in this field, including such concepts as the small-world effect, degree distributions, clustering, network correlations, random graph models, models of network growth and preferential attachment, and dynamical processes taking place on networks. %Z %U http://arxiv.org/abs/cond-mat/0303516 %+ %^ %0 %0 Journal Article %A Maedche, A.; Staab, S.; Studer, R.; Sure, Y. & Volz, R. %D 2002 %T SEAL -- Tying up Information Integration and Web Site Management by Ontologies %E %B IEEE-CS Data Engineering Bulletin, Special Issue on Organizingand Discovering the Semantic Web %C %I %V %6 %N %P %& %Y %S %7 %8 March %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F seal-ieeecs %K ontology, information, portal, integration %X %Z %U %+ %^ %0 %0 Book %A Baeza-Yates, Ricardo A. & Ribeiro-Neto, Berthier A. %D 1999 %T Modern Information Retrieval %E %B %C %I ACM Press / Addison-Wesley %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ 0-201-39829-X %( %) %* %L %M %1 %2 dblp %3 book %4 %# %$ %F books/aw/Baeza-YatesR99 %K information, ir, lecture, retrieval, standard, vorlesung %X %Z %U http://www.ischool.berkeley.edu/~hearst/irbook/glossary.html %+ %^ %0 %0 Journal Article %A Crestani, F. %D 1997 %T Application of Spreading Activation Techniques in Information Retrieval %E %B Artificial Intelligence Review %C %I %V 11 %6 %N 6 %P 453--482 %& %Y %S %7 %8 December %9 %? %! %Z %@ %( %) %* %L %M %1 %2 SpringerLink - Zeitschriftenbeitrag %3 article %4 %# %$ %F crestani1997spreading %K ***, activation, information, ir, msn, network, retrieval, search, semantic, spreading, survey %X This paper surveys the use of Spreading Activation techniques onSemantic Networks in Associative Information Retrieval. The majorSpreading Activation models are presented and their applications toIR is surveyed. A number of works in this area are criticallyanalyzed in order to study the relevance of Spreading Activation forassociative IR. ER - %Z %U http://dx.doi.org/10.1023/A:1006569829653 %+ %^ %0 %0 Book %A %D 1997 %T Readings in Information Retrieval %E Sparck-Jones, K. & Willett, P. %B %C %I Morgan Kaufmann %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 book %4 %# %$ sparck-jones+willett97 %F sparck-jones+willett97 %K retrieval, ir, information, readings %X %Z %U %+ %^ %0 %0 Book %A Sowa, J. F. %D 1984 %T Conceptual Structures: Information Processing in Mind and Machine %E %B %C Reading, MA %I Addison-Wesley Publishing Company %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 book %4 %# %$ %F sowa84 %K processing, information, structures, machine, mind, conceptual %X %Z %U %+ %^ %0 %0 Book %A van Rijsbergen, C. J. %D 1979 %T Information retrieval %E %B %C London %I Butterworths %V %6 %N %P %& %Y %S %7 2 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 book %4 %# %$ %F rijsbergen79information %K advanced, information, ir, lecture, vorlesung %X %Z %U http://www.dcs.gla.ac.uk/Keith/Preface.html %+ %^