Artikel in Tagungsbänden
Toponym Resolution in Social Media..
In: P. F. Patel-Schneider, Y. Pan, P. Hitzler, P. Mika, L. Zhang, J. Z. Pan, I. Horrocks und B. Glimm
(Herausgeber):
#iswc2010#, Band 6496, Reihe Lecture Notes in Computer Science, Seiten 370-385.
Springer, 2010.
Neil Ireson und Fabio Ciravegna.
[doi]
[Kurzfassung]
[BibTeX]
Increasingly user-generated content is being utilised as a source of information, however each individual piece of content tends to contain low levels of information. In addition, such information tends to be informal and imperfect in nature; containing imprecise, subjective, ambiguous expressions. However the content does not have to be interpreted in isolation as it is linked, either explicitly or implicitly, to a network of interrelated content; it may be grouped or tagged with similar content, comments may be added by other users or it may be related to other content posted at the same time or by the same author or members of the author's social network. This paper generally examines how ambiguous concepts within user-generated content can be assigned a specific/formal meaning by considering the expanding context of the information, i.e. other information contained within directly or indirectly related content, and specifically considers the issue of toponym resolution of locations.
I tag, you tag: translating tags for advanced user models..
In: B. D. Davison, T. Suel, N. Craswell und B. Liu
(Herausgeber):
WSDM, Seiten 71-80.
ACM, 2010.
Robert Wetzker, Carsten Zimmermann, Christian Bauckhage und Sahin Albayrak.
[doi]
[Kurzfassung]
[BibTeX]
Collaborative tagging services (folksonomies) have been among the stars of theWeb 2.0 era. They allow their users to label diverse resources with freely chosen keywords (tags). Our studies of two real-world folksonomies unveil that individual users develop highly personalized vocabularies of tags. While these meet individual needs and preferences, the considerable differences between personal tag vocabularies (personomies) impede services such as social search or customized tag recommendation. In this paper, we introduce a novel user-centric tag model that allows us to derive mappings between personal tag vocabularies and the corresponding folksonomies. Using these mappings, we can infer the meaning of user-assigned tags and can predict choices of tags a user may want to assign to new items. Furthermore, our translational approach helps in reducing common problems related to tag ambiguity, synonymous tags, or multilingualism. We evaluate the applicability of our method in tag recommendation and tag-based social search. Extensive experiments show that our translational model improves the prediction accuracy in both scenarios.
Preliminary Results in Tag Disambiguation using DBpedia.
In:
Knowledge Capture (K-Cap'09) - First International Workshop on Collective Knowledge Capturing and Representation - CKCaR'09.
2009.
Andres Garcia, Martin Szomszor, Harith Alani und Oscar Corcho.
[doi]
[Kurzfassung]
[BibTeX]
The availability of tag-based user-generated content for a variety of Web resources (music, photos, videos, text, etc.) has largely increased in the last years. Users can assign tags freely and then use them to share and retrieve information. However, tag-based sharing and retrieval is not optimal due to the fact that tags are plain text labels without an explicit or formal meaning, and hence polysemy and synonymy should be dealt with appropriately. To ameliorate these problems, we propose a context-based tag disambiguation algorithm that selects the meaning of a tag among a set of candidate DBpedia entries, using a common information retrieval similarity measure. The most similar DBpedia en-try is selected as the one representing the meaning of the tag. We describe and analyze some preliminary results, and discuss about current challenges in this area.
Tag Sense Disambiguation for Clarifying the Vocabulary of Social Tags.
In:
CSE, Seiten 729-734.
IEEE Computer Society, 2009.
Kangpyo Lee, Hyunwoo Kim, Hyopil Shin und Hyoung-Joo Kim.
[doi]
[Kurzfassung]
[BibTeX]
Tagging is one of the most popular services in Web 2.0. As a special form of tagging, social tagging is done collaboratively by many users, which forms a so-called folksonomy. As tagging has become widespread on the Web, the tag vocabulary is now very informal, uncontrolled, and personalized. For this reason, many tags are unfamiliar and ambiguous to users so that they fail to understand the meaning of each tag. In this paper, we propose a tag sense disambiguating method, called Tag Sense Disambiguation (TSD), which works in the social tagging environment. TSD can be applied to the vocabulary of social tags, thereby enabling users to understand the meaning of each tag through Wikipedia. To find the correct mappings from del.icio.us tags to Wikipedia articles, we define the Local )eighbor tags, the Global )eighbor tags, and finally the )eighbor tags that would be the useful keywords for disambiguating the sense of each tag based on the tag co-occurrences. The automatically built mappings are reasonable in most cases. The experiment shows that TSD can find the correct mappings with high accuracy.
Disambiguating Tags in Blogs.
In: Vá. Matousek und P. Mautner
(Herausgeber):
TSD, Band 5729, Reihe Lecture Notes in Computer Science, Seiten 139-146.
Springer, 2009.
Xiance Si und Maosong Sun.
[doi]
[Kurzfassung]
[BibTeX]
Blog users enjoy tagging for better document organization, while ambiguity in tags leads to inaccuracy in tag-based applications, such as retrieval, visualization or trend discovery. The dynamic nature of tag meanings makes current word sense disambiguation(WSD) methods not applicable. In this paper, we propose an unsupervised method for disambiguating tags in blogs. We first cluster the tags by their context words using Spectral Clustering. Then we compare a tag with these clusters to find the most suitable meaning. We use Normalized Google Distance to measure word similarity, which can be computed by querying search engines, thus reflects the up-to-date meaning of words. No human labeling efforts or dictionary needed in our method. Evaluation using crawled blog data showed a promising micro average precision of 0.842.
Semantify del.icio.us: Automatically Turn your Tags into Senses.
In:
Proceedings of the Workshop Social Data on the Web (SDoW2008).
2008.
Maurizio Tesconi, Francesco Ronzano, Andrea Marchetti und Salvatore Minutoli.
[doi]
[Kurzfassung]
[BibTeX]
At present tagging is experimenting a great diffusion as the most adopted way to collaboratively classify resources over the Web. In this paper, after a detailed analysis of the attempts made to improve the organization and structure of tagging systems as well as the usefulness of this kind of social data, we propose and evaluate the Tag Disambiguation Algorithm, mining del.icio.us data. It allows to easily semantify the tags of the users of a tagging service: it automatically finds out for each tag the related concept of Wikipedia in order to describe Web resources through senses. On the basis of a set of evaluation tests, we analyze all the advantages of our sense-based way of tagging, proposing new methods to keep the set of users tags more consistent or to classify the tagged resources on the basis of Wikipedia categories, YAGO classes or Wordnet synsets. We discuss also how our semanitified social tagging data are strongly linked to DBPedia and the datasets of the Linked Data community. 1
Buchbeiträge
Disambiguating Hypernym Relations for <i>Roget’s</i> Thesaurus.
In:
V. Matoušek und P. Mautner (Herausgeber):
Text, Speech and Dialogue, Seiten 66-75.
Springer, Berlin / Heidelberg, 2007.
Alistair Kennedy und Stanistaw Szpakowicz.
[doi]
[Kurzfassung]
[BibTeX]
Roget’s Thesaurus is a lexical resource which groups terms by semantic relatedness. It is Roget’s shortcoming that the relations are ambiguous, in that it does not name them; it only shows that there is a relation between terms. Our work focuses on disambiguating hypernym relations within Roget’s Thesaurus. Several techniques of identifying hypernym relations are compared and contrasted in this paper, and a total of over 50,000 hypernym relations have been disambiguated within Roget’s. Human judges have evaluated the quality of our disambiguation techniques, and we have demonstrated on several applications the usefulness of the disambiguated relations.
Artikel in Tagungsbänden
Tag Meaning Disambiguation through Analysis of Tripartite Structure of Folksonomies.
In:
Web Intelligence/IAT Workshops, Seiten 3-6.
IEEE, 2007.
Ching man Au Yeung, Nicholas Gibbins und Nigel Shadbolt.
[doi]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems are becoming very popular recently. Web users use freely-chosen tags to describe shared resources, resulting in a folksonomy. One problem of folksonomies is that tags which appear in the same form may carry multiple meanings and represent different concepts. As this kind of tags are ambiguous, the precisions in both description and retrieval of the shared resources are reduced. We attempt to develop effective methods to disambiguate tags by studying the tripartite structure of folksonomies. This paper describes the network analysis techniques that we employ to discover clusters of nodes in networks and the algorithm for tag disambiguation. Experiments show that the method is very effective in performing the task.
A Graph Model for Unsupervised Lexical Acquisition.
In:
COLING.
2002.
Dominic Widdows und Beate Dorow.
[BibTeX]