%0 %0 Conference Proceedings %A Agirre, Eneko; Alfonseca, Enrique; Hall, Keith; Kravalova, Jana; Pa\cs,ca, Marius & Soroa, Aitor %D 2009 %T A study on similarity and relatedness using distributional and WordNet-based approaches %E %B Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics %C Stroudsburg, PA, USA %I Association for Computational Linguistics %V %6 %N %P 19--27 %& %Y %S NAACL '09 %7 %8 %9 %? %! %Z %@ 978-1-932432-41-1 %( %) %* %L %M %1 %2 A study on similarity and relatedness using distributional and WordNet-based approaches %3 inproceedings %4 %# %$ %F agirre2009study %K distributional, relatedness, semantic_similarity, similarity, wordnet %X This paper presents and compares WordNet-based and distributional similarity approaches. The strengths and weaknesses of each approach regarding similarity and relatedness tasks are discussed, and a combination is presented. Each of our methods independently provide the best results in their class on the RG and WordSim353 datasets, and a supervised combination of them yields the best published results on all datasets. Finally, we pioneer cross-lingual similarity, showing that our methods are easily adapted for a cross-lingual task with minor losses. %Z %U http://portal.acm.org/citation.cfm?id=1620754.1620758 %+ %^ %0 %0 Book Section %A Haridas, Mandar & Caragea, Doina %D 2009 %T Exploring Wikipedia and DMoz as Knowledge Bases for Engineering a User Interests Hierarchy for Social Network Applications %E Meersman, Robert; Dillon, Tharam & Herrero, Pilar %B On the Move to Meaningful Internet Systems: OTM 2009 %C Berlin / Heidelberg %I Springer %V 5871 %6 %N %P 1238-1245 %& %Y %S Lecture Notes in Computer Science %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 SpringerLink - Abstract %3 incollection %4 %# %$ %F haridas2009exploring %K dmoz, genta11, hierarchy, taxonomy, wordnet, ol_web2.0, data_wikis, methods_concepthierarchy %X The outgrowth of social networks in the recent years has resulted in opportunities for interesting data mining problems, such as interest or friendship recommendations. A global ontology over the interests specified by the users of a social network is essential for accurate recommendations. We propose, evaluate and compare three approaches to engineering a hierarchical ontology over user interests. The proposed approaches make use of two popular knowledge bases, Wikipedia and Directory Mozilla, to extract interest definitions and/or relationships between interests. More precisely, the first approach uses Wikipedia to find interest definitions, the latent semantic analysis technique to measure the similarity between interests based on their definitions, and an agglomerative clustering algorithm to group similar interests into higher level concepts. The second approach uses the Wikipedia Category Graph to extract relationships between interests, while the third approach uses Directory Mozilla to extract relationships between interests. Our results show that the third approach, although the simplest, is the most effective for building a hierarchy over user interests. %Z %U http://dx.doi.org/10.1007/978-3-642-05151-7_35 %+ %^ %0 %0 Conference Proceedings %A Laniado, David; Eynard, Davide & Colombetti, Marco %D 2007 %T Using WordNet to turn a folksonomy into a hierarchy of concepts %E %B Semantic Web Application and Perspectives - Fourth Italian Semantic Web Workshop %C %I %V %6 %N %P 192--201 %& %Y %S %7 %8 Dec %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F laniado2007using %K folksonomy, ol_web2.0, ontology, wordnet, enrichment %X As the volume of information in the read-write Web increases rapidly, folksonomies are becoming a widely used tool to organize and categorize resources in a bottom up, flat and inclusive way. However, due to their very structure, they show some drawbacks; in particular the lack of hierarchy bears some limitations in the possibilities of searching and browsing. In this paper we investigate a new approach, based on the idea of integrating an ontology in the navigation interface of a folksonomy, and we describe an application that filters del.icio.us keywords through the WordNet hierarchy of concepts, to enrich the possibilities of navigation. %Z %U http://home.dei.polimi.it/eynard/papers/swap2007.pdf %+ %^ %0 %0 Journal Article %A Budanitsky, Alexander & Hirst, Graeme %D 2006 %T Evaluating WordNet-based Measures of Lexical Semantic Relatedness %E %B Computational Linguists %C %I MIT Press %V 32 %6 %N 1 %P 13--47 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F budanitsky2006evaluating %K wordnet, semantic_relatedness %X %Z %U http://ftp.cs.toronto.edu/pub/gh/Budanitsky+Hirst-2006.pdf %+ %^ %0 %0 Book Section %A Ruiz-Casado, Maria; Alfonseca, Enrique & Castells, Pablo %D 2005 %T Automatic Extraction of Semantic Relationships for WordNet by Means of Pattern Learning from Wikipedia %E Montoyo, Andrés; Muñoz, Rafael & Métais, Elisabeth %B Natural Language Processing and Information Systems %C Berlin / Heidelberg %I Springer %V 3513 %6 %N %P 233-242 %& %Y %S Lecture Notes in Computer Science %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 SpringerLink - Abstract %3 incollection %4 %# %$ %F ruizcasado2005automatic %K ol_web2.0, patterns, wikipedia, wordnet, data_wikis, methods_relations %X This paper describes an automatic approach to identify lexical patterns which represent semantic relationships between concepts, from an on-line encyclopedia. Next, these patterns can be applied to extend existing ontologies or semantic networks with new relations. The experiments have been performed with the Simple English Wikipedia and WordNet 1.7. A new algorithm has been devised for automatically generalising the lexical patterns found in the encyclopedia entries. We have found general patterns for the hyperonymy, hyponymy, holonymy and meronymy relations and, using them, we have extracted more than 1200 new relationships that did not appear in WordNet originally. The precision of these relationships ranges between 0.61 and 0.69, depending on the relation. %Z %U http://dx.doi.org/10.1007/11428817_7 %+ %^ %0 %0 Conference Proceedings %A McHale, Michael %D 1998 %T A Comparison of WordNet and Roget's Taxonomy for Measuring Semantic Similarity %E Harabagiu, Sanda & Chai, Joyce Yue %B Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, August 16, 1998, Montreal, Canada %C %I Association for Computational Linguistics, Morristown, NJ, USA %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F mchale1998comparison %K thesaurus, roget, wordnet, ontology %X This paper presents the results of using Roget's International Thesaurus as the taxonomy in a semantic similarity measurement task. Four similarity metrics were taken from the literature and applied to Roget's The experimental evaluation suggests that the traditional edge counting approach does surprisingly well (a correlation of r=0.88 with a benchmark set of human similarity judgements, with an upper bound of r=0.90 for human subjects performing the same task.) %Z %U http://www.citebase.org/abstract?id=oai:arXiv.org:cmp-lg/9809003 %+ %^