Lemmerich, F. & Atzmueller, M.: Describing Locations using Tags and Images: Explorative Pattern Mining in Social Media.
Modeling and Mining Ubiquitous Social Media. Heidelberg, Germany: Springer Verlag, 2012 (LNAI 7472)
[Volltext]
[BibTeX]
Antonellis, I.; Karim, J. & Garcia-Molina, H.:
Navigating the Web with Query Tags. , 2011
[Volltext] [Kurzfassung]
[BibTeX]
We propose to integrate various pieces of information about a web page (search queries, social annotations, terms extracted from the pagetext) into a navigational menu. This menu displays an auxiliary set of tags (navigational tags) selected with the goal of helping user navigation. We propose a novel framework (navigational utility) for comparing different tag selection methods. We also investigate which source of tags is more suitable for our scenario and we conclude that tags extracted from search queries (query tags) are more appropriate.
Gasevic, D.; Zouaq, A.; Torniai, C.; Jovanovic, J. & Hatala, M.: An Approach to Folksonomy-based Ontology Maintenance for Learning Environments. In:
IEEE Transactions on Learning Technologies 99 (2011), Nr. 1,
[Volltext]
[BibTeX]
Peters, I.; Haustein, S. & Terliesner, J.: Crowdsourcing in Article Evaluation.
ACM WebSci'11. 2011, S. 1-4
[Volltext] [Kurzfassung]
[BibTeX]
Qualitative journal evaluation makes use of cumulated content descriptions of single articles. These can either be represented by author-generated keywords, professionally indexed subject headings, automatically extracted terms or by reader-generated tags as used in social bookmarking systems. It is assumed that particularly the users? view on article content differs significantly from the authors? or indexers? perspectives. To verify this assumption, title and abstract terms, author keywords, Inspec subject headings, KeyWords PlusTM and tags are compared by calculating the overlap between the respective datasets. Our approach includes extensive term preprocessing (i.e. stemming, spelling unifications) to gain a homogeneous term collection. When term overlap is calculated for every single document of the dataset, similarity values are low. Thus, the presented study confirms the assumption, that the different types of keywords each reflect a different perspective of the articles? contents and that tags (cumulated across articles) can be used in journal evaluation to represent a reader-specific view on published content.
Venetis, P.; Koutrika, G. & Garcia-Molina, H.: On the selection of tags for tag clouds.
Proceedings of the fourth ACM international conference on Web search and data mining. New York, NY, USA: ACM, 2011WSDM '11 , S. 835-844
[Volltext] [Kurzfassung]
[BibTeX]
We examine the creation of a tag cloud for exploring and understanding a set of objects (e.g., web pages, documents). In the first part of our work, we present a formal system model for reasoning about tag clouds. We then present metrics that capture the structural properties of a tag cloud, and we briefly present a set of tag selection algorithms that are used in current sites (e.g., del.icio.us, Flickr, Technorati) or that have been described in recent work. In order to evaluate the results of these algorithms, we devise a novel synthetic user model. This user model is specifically tailored for tag cloud evaluation and assumes an "ideal" user. We evaluate the algorithms under this user model, as well as the model itself, using two datasets: CourseRank (a Stanford social tool containing information about courses) and del.icio.us (a social bookmarking site). The results yield insights as to when and why certain selection schemes work best.
Garcia-Silva, A.; Corcho, O. & Gracia, J.:
Associating Semantics to Multilingual Tags in Folksonomies. , 2010
[Volltext] [Kurzfassung]
[BibTeX]
Tagging systems are nowadays a common feature in web sites where user-generated content plays an important role. However, the lack of semantics and multilinguality hamper information retrieval process based on folksonomies. In this paper we propose an approach to bring semantics to multilingual folksonomies. This approach includes a sense disambiguation activity and takes advantage from knowledge generated by the masses in the form of articles, redirection and disambiguation links, and translations in Wikipedia. We use DBpedia[2] as semantic resource to define the tag meanings.
Gedikli, F. & Jannach, D.: Rating items by rating tags. In:
Systems and the Social Web at ACM (2010),
[BibTeX]
Carman, M. J.; Baillie, M.; Gwadera, R. & Crestani, F.: A statistical comparison of tag and query logs.
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval. New York, NY, USA: ACM, 2009SIGIR '09 , S. 123-130
[Volltext] [Kurzfassung]
[BibTeX]
We investigate tag and query logs to see if the terms people use to annotate websites are similar to the ones they use to query for them. Over a set of URLs, we compare the distribution of tags used to annotate each URL with the distribution of query terms for clicks on the same URL. Understanding the relationship between the distributions is important to determine how useful tag data may be for improving search results and conversely, query data for improving tag prediction. In our study, we compare both term frequency distributions using vocabulary overlap and relative entropy. We also test statistically whether the term counts come from the same underlying distribution. Our results indicate that the vocabulary used for tagging and searching for content are similar but not identical. We further investigate the content of the websites to see which of the two distributions (tag or query) is most similar to the content of the annotated/searched URL. Finally, we analyze the similarity for different categories of URLs in our sample to see if the similarity between distributions is dependent on the topic of the website or the popularity of the URL.
Giannakidou, E.; Koutsonikola, V. A.; Vakali, A. & Kompatsiaris, Y.: Co-Clustering Tags and Social Data Sources..
WAIM. IEEE, 2008, S. 317-324
[Volltext] [Kurzfassung]
[BibTeX]
Under social tagging systems, a typical Web 2.0 application, users label digital data sources by using freely chosen textual descriptions (tags). Poor retrieval in the aforementioned systems remains a major problem mostly due to questionable tag validity and tag ambiguity. Earlier clustering techniques have shown limited improvements, since they were based mostly on tag co-occurrences. In this paper, a co-clustering approach is employed, that exploits joint groups of related tags and social data sources, in which both social and semantic aspects of tags are considered simultaneously. Experimental results demonstrate the efficiency and the beneficial outcome of the proposed approach in correlating relevant tags and resources.
Grineva, M.; Grinev, M.; Turdakov, D. & Velikhov, P.: Harnessing Wikipedia for Smart Tags Clustering.
Proceedings of the International Workshop on Knowledge Acquisition from the Social Web (KASW2008). 2008
[Kurzfassung]
[BibTeX]
The quality of the current tagging services can be greatly improved if the service is able to cluster tags by their meaning. Tag clouds clustered by higher level topics enable the users to explore their tag space, which is especially needed when tag clouds become large. We demonstrate TagCluster - a tool for automated tag clustering that harnesses knowledge from Wikipedia about semantic relatedness between tags and names of categories to achieve smart clustering. Our approach shows much better quality of clusters compared to the existing techniques that rely on tag co-occurrence analysis in the tagging service.
Illig, J.:
Machine Learnability Analysis of Textclassifications in a Social Bookmarking Folksonomy. Kassel, University of Kassel, Bachelor Thesis, 2008
[BibTeX]
Jäschke, R.; Marinho, L.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag recommendations in social bookmarking systems. In:
AI Communications 21 (2008), Nr. 4, S. 231-247
[Volltext]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems allow users to assign keywords – so called “tags” – to resources. Tags are used for navigation, finding resources and serendipitous browsing and thus provide an immediate benefit for users. These systems usually include tag recommendation mechanisms easing the process of finding good tags for a resource, but also consolidating the tag vocabulary across users. In practice, however, only very basic recommendation strategies are applied. In this paper we evaluate and compare several recommendation algorithms on large-scale real life datasets: an adaptation of user-based collaborative filtering, a graph-based recommender built on top of the FolkRank algorithm, and simple methods based on counting tag occurrences. We show that both FolkRank and collaborative filtering provide better results than non-personalized baseline methods. Moreover, since methods based on counting tag occurrences are computationally cheap, and thus usually preferable for real time scenarios, we discuss simple approaches for improving the performance of such methods. We show, how a simple recommender based on counting tags from users and resources can perform almost as good as the best recommender.
Jäschke, R.; Marinho, L.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag Recommendations in Social Bookmarking Systems. In:
AI Communications 21 (2008), Nr. 4, S. 231-247
[Volltext]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems allow users to assign keywords - so called "tags" - to resources. Tags are used for navigation, finding resources and serendipitous browsing and thus provide an immediate benefit for users. These systems usually include tag recommendation mechanisms easing the process of finding good tags for a resource, but also consolidating the tag vocabulary across users. In practice, however, only very basic recommendation strategies are applied.
In this paper we evaluate and compare several recommendation algorithms on large-scale real life datasets: an adaptation of
user-based collaborative filtering, a graph-based recommender built on top of the FolkRank algorithm, and simple methods based on counting tag occurences. We show that both FolkRank and Collaborative Filtering provide better results than non-personalized baseline methods. Moreover, since methods based on counting tag occurrences are computationally cheap, and thus usually preferable for real time scenarios, we discuss simple approaches for improving the performance of such methods. We show, how a simple recommender based on counting tags from users and resources can perform almost as good as the best recommender.
Jäschke, R.; Marinho, L.; Hotho, A.; Schmidt-Thieme, L. & Stumme, G.: Tag Recommendations in Social Bookmarking Systems. In:
AI Communications 21 (2008), Nr. 4, S. 231-247
[Volltext]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems allow users to assign keywords - so called "tags" - to resources. Tags are used for navigation, finding resources and serendipitous browsing and thus provide an immediate benefit for users. These systems usually include tag recommendation mechanisms easing the process of finding good tags for a resource, but also consolidating the tag vocabulary across users. In practice, however, only very basic recommendation strategies are applied. In this paper we evaluate and compare several recommendation algorithms on large-scale real life datasets: an adaptation of user-based collaborative filtering, a graph-based recommender built on top of the FolkRank algorithm, and simple methods based on counting tag occurences. We show that both FolkRank and Collaborative Filtering provide better results than non-personalized baseline methods. Moreover, since methods based on counting tag occurrences are computationally cheap, and thus usually preferable for real time scenarios, we discuss simple approaches for improving the performance of such methods. We show, how a simple recommender based on counting tags from users and resources can perform almost as good as the best recommender.
Maala, M. Z.; Delteil, A. & Azough, A.: A conversion process from Flickr tags to RDF descriptions. In:
IADIS INTERNATIONAL JOURNAL ON WWW/INTERNET 6 (2008), Nr. 1,
[Volltext]
[Kurzfassung]
[BibTeX]
The recent evolution of the Web, now designated by the term Web 2.0, has seen the appearance of a huge number of resources created and annotated by users. However the annotations consist only in simple tags that are gathered in unstructured sets called folksonomies. The use of more complex languages to annotate resources and to define semantics according to the vision of the Semantic Web, would improve the understanding by machines and programs, like search engines, of what is on the Web. Indeed tags expressivity is very low compared to the representation standards of the Semantic Web, like RDF and OWL. But users appear to be still reluctant to annotate resources with RDF, and it should be recognized that Semantic Web, contrary to Web 2.0, is still not a reality of today’s Web. One way to take advantage of Semantic Web capabilities right now, without waiting for a change of the annotation usages, would be to be able to generate RDF annotations from tags. As a first step toward this direction, this paper presents a tentative to automatically convert a set of tags into a RDF description in the context of photos on Flickr. Such a method exploits some specificity of tags used on Flickr, some basic natural language processing tools and some semantic resources, in order to relate semantically tags describing a given photo and build a pertinent RDF annotation for this photo.
Al-Khalifa, H. S. & Davis, H. C.: Towards better understanding of folksonomic patterns.
Proceedings of the eighteenth conference on Hypertext and hypermedia. New York, NY, USA: ACM, 2007HT '07 , S. 163-166
[Volltext]
[BibTeX]
Al-Khalifa, H. S. & Davis, H. C.: Folksonomies versus Automatic Keyword Extraction: An Empirical Study. In:
IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (IJCSIS) Vol. 1 (2006), Nr. Number, S. 132-143
[Volltext]
[Kurzfassung]
[BibTeX]
Semantic Metadata, which describes the meaning of documents, can be produced either manually or else semi-automatically using information extraction techniques. Manual techniques are expensive if they rely on skilled cataloguers, but a possible alternative is to make use of community produced annotations such as those collected in folksonomies. This paper reports on an experiment that we carried out to validate the assumption that folksonomies contain higher semantic value than keywords extracted by machines. The experiment has been carried-out in two ways: subjectively, by asking a human indexer to evaluate the quality of the generated keywords from both systems; and automatically, by measuring the percentage of overlap between the folksonomy set and machine generated keywords set. The result of the experiment can be considered as evidence for the rich semantics of folksonomies, demonstrating that folksonomies used in the del.icio.us bookmarking service can be used in the process of generating semantic metadata to annotate web resources.
Dubinko, M.; Kumar, R.; Magnani, J.; Novak, J.; Raghavan, P. & Tomkins, A.: Visualizing tags over time.
WWW '06: Proceedings of the 15th international conference on World Wide Web. New York, NY, USA: ACM Press, 2006, S. 193-202
[Volltext]
[BibTeX]
Golder, S. A. & Huberman, B. A.: Usage patterns of collaborative tagging systems. In:
Journal of Information Science 32 (2006), Nr. 2, S. 198-208
[Volltext]
[Kurzfassung]
[BibTeX]
Collaborative tagging describes the process by which many users add metadata in the form of keywords to shared content. Recently, collaborative tagging has grown in popularity on the web, on sites that allow users to tag bookmarks, photographs and other content. In this paper we analyze the structure of collaborative tagging systems as well as their dynamic aspects. Specifically, we discovered regularities in user activity, tag frequencies, kinds of tags used, bursts of popularity in bookmarking and a remarkable stability in the relative proportions of tags within a given URL. We also present a dynamic model of collaborative tagging that predicts these stable patterns and relates them to imitation and shared knowledge.
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.: Das Entstehen von Semantik in BibSonomy.
Social Software in der Wertschöpfung. Baden-Baden: Nomos, 2006
[Volltext] [Kurzfassung]
[BibTeX]
Immer mehr Soziale-Lesezeichen-Systeme entstehen im heutigen Web. In solchen Systemen erstellen die Nutzer leichtgewichtige begriffliche Strukturen, so genannte Folksonomies. Ihren Erfolg verdanken sie der Tatsache, dass man keine speziellen Fähigkeiten benötigt, um an der Gestaltung mitzuwirken. In diesem Artikel beschreiben wir unser System BibSonomy. Es erlaubt das Speichern, Verwalten und Austauschen sowohl von Lesezeichen (Bookmarks) als auch von Literaturreferenzen in Form von BibTeX-Einträgen. Die Entwicklung des verwendeten Vokabulars und der damit einhergehenden Entstehung einer gemeinsamen Semantik wird detailliert diskutiert.