Illig,Jens
Hotho,Andreas
Jäschke,Robert
Stumme,Gerd
A Comparison of content-based Tag Recommendations in Folksonomy Systems
Springer
to appear
Hotho,Andreas
Jäschke,Robert
Benz,Dominik
Grahl,Miranda
Krause,Beate
Schmitz,Christoph
Stumme,Gerd
Social Bookmarking am Beispiel BibSonomy
Springer
363–391
2009
BibSonomy ist ein kooperatives Verschlagwortungssystem (Social Bookmarking System), betrieben vom Fachgebiet Wissensverarbeitungder Universität Kassel. Es erlaubt das Speichern und Organisieren von Web-Lesezeichen und Metadaten für wissenschaftlichePublikationen. In diesem Beitrag beschreiben wir die von BibSonomy bereitgestellte Funktionalität, die dahinter stehende Architektursowie das zugrunde liegende Datenmodell. Ferner erläutern wir Anwendungsbeispiele und gehen auf Methoden zur Analyse der in BibSonomy und ähnlichen Systemen enthaltenen Daten ein.
Markines,Benjamin
Cattuto,Ciro
Menczer,Filippo
Benz,Dominik
Hotho,Andreas
Stumme,Gerd
Evaluating Similarity Measures for Emergent Semantics of Social Tagging
641–641
2009
Social bookmarking systems and their emergent information structures, known as folksonomies, are increasingly important data sources for Semantic Web applications. A key question for harvesting semantics from these systems is how to extend and adapt traditional notions of similarity to folksonomies, and which measures are best suited for applications such as navigation support, semantic search, and ontology learning. Here we build an evaluation framework to compare various general folksonomy-based similarity measures derived from established information-theoretic, statistical, and practical measures. Our framework deals generally and symmetrically with users, tags, and resources. For evaluation purposes we focus on similarity among tags and resources, considering different ways to aggregate annotations across users. After comparing how tag similarity measures predict user-created tag relations, we provide an external grounding by user-validated semantic proxies based on WordNet and the Open Directory. We also investigate the issue of scalability. We ?nd that mutual information with distributional micro-aggregation across users yields the highest accuracy, but is not scalable; per-user projection with collaborative aggregation provides the best scalable approach via incremental computations. The results are consistent across resource and tag similarity.
Proceedings of the Dagstuhl Seminar on Social Web Communities
Schloss Dagstuhl
2008
Benz,Dominik
Grobelnik,Marko
Hotho,Andreas
Jäschke,Robert
Mladenic,Dunja
Servedio,Vito D.P.
Sizov,Sergej
Szomszor,Martin
Analyzing Tag Semantics Across Collaborative Tagging Systems
2008
The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging behavior and vocabulary of various folksonomies, with the overall aim to better understand the semantics of tags and the tagging process. Therefore we analyze the semantic content of tags in the Flickr and Delicious folksonomies. We find that: tag context similarity leads to meaningful results in Flickr, despite its narrow folksonomy character; the comparison of tags across Flickr and Delicious shows little semantic overlap, being tags in Flickr associated more to visual aspects rather than technological as it seems to be in Delicious; there are regions in the tag-tag space, provided with the cosine similarity metric, that are characterized by high density; the order of tags inside a post has a semantic relevance.
Cattuto,Ciro
Benz,Dominik
Hotho,Andreas
Stumme,Gerd
Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems
39–43
2008
Social bookmarking systems allow users to organise collections of resources on the Web in a collaborative fashion. The increasing popularity of these systems as well as first insights into their emergent semantics have made them relevant to disciplines like knowledge extraction and ontology learning. The problem of devising methods to measure the semantic relatedness between tags and characterizing it semantically is still largely open. Here we analyze three measures of tag relatedness: tag co-occurrence, cosine similarity of co-occurrence distributions, and FolkRank, an adaptation of the PageRank algorithm to folksonomies. Each measure is computed on tags from a large-scale dataset crawled from the social bookmarking system del.icio.us. To provide a semantic grounding of our findings, a connection to WordNet (a semantic lexicon for the English language) is established by mapping tags into synonym sets of WordNet, and applying there well-known metrics of semantic similarity. Our results clearly expose different characteristics of the selected measures of relatedness, making them applicable to different subtasks of knowledge extraction such as synonym detection or discovery of concept hierarchies.
Cattuto,Ciro
Benz,Dominik
Hotho,Andreas
Stumme,Gerd
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems
Springer
5318
615–631
2008
Collaborative tagging systems have nowadays become important data sources for populating semantic web applications. For taskslike synonym detection and discovery of concept hierarchies, many researchers introduced measures of tag similarity. Eventhough most of these measures appear very natural, their design often seems to be rather ad hoc, and the underlying assumptionson the notion of similarity are not made explicit. A more systematic characterization and validation of tag similarity interms of formal representations of knowledge is still lacking. Here we address this issue and analyze several measures oftag similarity: Each measure is computed on data from the social bookmarking system del.icio.us and a semantic grounding isprovided by mapping pairs of similar tags in the folksonomy to pairs of synsets in Wordnet, where we use validated measuresof semantic distance to characterize the semantic relation between the mapped tags. This exposes important features of theinvestigated similarity measures and indicates which ones are better suited in the context of a given semantic application.
Jäschke,Robert
Krause,Beate
Hotho,Andreas
Stumme,Gerd
Logsonomy – A Search Engine Folksonomy
AAAI Press
2008
In social bookmarking systems users describe bookmarksby keywords called tags. The structure behindthese social systems, called folksonomies, can beviewed as a tripartite hypergraph of user, tag and resourcenodes. This underlying network shows specificstructural properties that explain its growth and the possibilityof serendipitous exploration.Search engines filter the vast information of the web.Queries describe a user’s information need. In responseto the displayed results of the search engine, users clickon the links of the result page as they expect the answerto be of relevance. The clickdata can be represented as afolksonomy in which queries are descriptions of clickedURLs. This poster analyzes the topological characteristicsof the resulting tripartite hypergraph of queries,users and bookmarks of two query logs and compares ittwo a snapshot of the folksonomy del.icio.us.
Krause,Beate
Hotho,Andreas
Stumme,Gerd
A Comparison of Social Bookmarking with Traditional Search
Springer
4956
101-113
2008
Krause,Beate
Jäschke,Robert
Hotho,Andreas
Stumme,Gerd
Logsonomy - Social Information Retrieval with Logdata
ACM
157–166
2008
Social bookmarking systems constitute an establishedpart of the Web 2.0. In such systemsusers describe bookmarks by keywordscalled tags. The structure behind these socialsystems, called folksonomies, can be viewedas a tripartite hypergraph of user, tag and resourcenodes. This underlying network showsspecific structural properties that explain itsgrowth and the possibility of serendipitousexploration.Today’s search engines represent the gatewayto retrieve information from the World WideWeb. Short queries typically consisting oftwo to three words describe a user’s informationneed. In response to the displayedresults of the search engine, users click onthe links of the result page as they expectthe answer to be of relevance.This clickdata can be represented as a folksonomyin which queries are descriptions ofclicked URLs. The resulting network structure,which we will term logsonomy is verysimilar to the one of folksonomies. In orderto find out about its properties, we analyzethe topological characteristics of the tripartitehypergraph of queries, users and bookmarkson a large snapshot of del.icio.us andon query logs of two large search engines.All of the three datasets show small worldproperties. The tagging behavior of users,which is explained by preferential attachmentof the tags in social bookmark systems, isreflected in the distribution of single querywords in search engines. We can concludethat the clicking behaviour of search engineusers based on the displayed search resultsand the tagging behaviour of social bookmarkingusers is driven by similar dynamics.
Krause,Beate
Schmitz,Christoph
Hotho,Andreas
Stumme,Gerd
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems
2008
Benz,Dominik
Hotho,Andreas
Position Paper: Ontology Learning from Folksonomies
Martin-Luther-Universität Halle-Wittenberg
109–112
2007
The emergence of collaborative tagging systems with their underlying flat and uncontrolled resource organization paradigm has led to a large number of research activities focussing on a formal description and analysis of the resulting “folksonomies�?. An interesting outcome is that the characteristic qualities of these systems seem to be inverse to more traditional knowledge structuring approaches like taxonomies or ontologies: The latter provide rich and precise semantics, but suffer - amongst others - from a knowledge acquisition bottleneck. An important step towards exploiting the possible synergies by bridging the gap between both paradigms is the automatic extraction of relations between tags in a folksonomy. This position paper presents preliminary results of ongoing work to induce hierarchical relationships among tags by analyzing the aggregated data of collaborative tagging systems as a basis for an ontology learning procedure.
Cattuto,Ciro
Schmitz,Christoph
Baldassarri,Andrea
Servedio,Vito D.P.
Loreto,Vittorio
Hotho,Andreas
Grahl,Miranda
Stumme,Gerd
Network Properties of Folksonomies
AI Communications Journal, Special Issue on ``Network Analysis in Natural Sciences and Engineering''
IOS Press
20
245-262
2007
Grahl,Miranda
Hotho,Andreas
Stumme,Gerd
Conceptual Clustering of Social Bookmarking Sites
Know-Center
356-364
2007
Currently, social bookmarking systems provide intuitive support for browsing locally their content. A global view is usually presented by the tag cloud of thesystem, but it does not allow a conceptual drill-down, e. g., along a conceptual hierarchy. In this paper, we present a clustering approach for computing such a conceptual hierarchy for a given folksonomy. The hierarchy is complemented with ranked lists of users and resources most related to each cluster. The rankings are computed using our FolkRank algorithm. We have evaluated our approach on large scale data from the del.icio.us bookmarking system.
Jäschke,Robert
Grahl,Miranda
Hotho,Andreas
Krause,Beate
Schmitz,Christoph
Stumme,Gerd
Organizing Publications and Bookmarks in BibSonomy
2007
Hotho,Andreas
Jäschke,Robert
Schmitz,Christoph
Stumme,Gerd
BibSonomy: A Social Bookmark and Publication Sharing System
Aalborg Universitetsforlag
87-102
2006
Social bookmark tools are rapidly emerging on the Web. In suchsystems users are setting up lightweight conceptual structurescalled folksonomies. The reason for their immediate success is thefact that no specific skills are needed for participating. In thispaper we specify a formal model for folksonomies and briefly describe our own system BibSonomy, which allows for sharing both bookmarksand publication references in a kind of personal library.
Hotho,Andreas
Jäschke,Robert
Schmitz,Christoph
Stumme,Gerd
Das Entstehen von Semantik in BibSonomy
Nomos
2006
Immer mehr Soziale-Lesezeichen-Systeme entstehen im heutigen Web. In solchen Systemen erstellen die Nutzer leichtgewichtige begriffliche Strukturen, so genannte Folksonomies. Ihren Erfolg verdanken sie der Tatsache, dass man keine speziellen Fähigkeiten benötigt, um an der Gestaltung mitzuwirken. In diesem Artikel beschreiben wir unser System BibSonomy. Es erlaubt das Speichern, Verwalten und Austauschen sowohl von Lesezeichen (Bookmarks) als auch von Literaturreferenzen in Form von BibTeX-Einträgen. Die Entwicklung des verwendeten Vokabulars und der damit einhergehenden Entstehung einer gemeinsamen Semantik wird detailliert diskutiert.
Hotho,Andreas
Jäschke,Robert
Schmitz,Christoph
Stumme,Gerd
Emergent Semantics in BibSonomy
Gesellschaft für Informatik
P-94
2006
Social bookmark tools are rapidly emerging on the Web. In suchsystems users are setting up lightweight conceptual structurescalled folksonomies. The reason for their immediate success is thefact that no specific skills are needed for participating. In thispaper we specify a formal model for folksonomies, briefly describeour own system BibSonomy, which allows for sharing both bookmarks andpublication references, and discuss first steps towards emergent semantics.
Hotho,Andreas
Jäschke,Robert
Schmitz,Christoph
Stumme,Gerd
Trend Detection in Folksonomies
Springer
4306
56-70
2006
As the number of resources on the web exceeds by far the number ofdocuments one can track, it becomes increasingly difficult to remainup to date on ones own areas of interest. The problem becomes moresevere with the increasing fraction of multimedia data, from whichit is difficult to extract some conceptual description of theircontents.One way to overcome this problem are social bookmark tools, whichare rapidly emerging on the web. In such systems, users are settingup lightweight conceptual structures called folksonomies, andovercome thus the knowledge acquisition bottleneck. As more and morepeople participate in the effort, the use of a common vocabularybecomes more and more stable. We present an approach for discoveringtopic-specific trends within folksonomies. It is based on adifferential adaptation of the PageRank algorithm to the triadichypergraph structure of a folksonomy. The approach allows for anykind of data, as it does not rely on the internal structure of thedocuments. In particular, this allows to consider different datatypes in the same analysis step. We run experiments on a large-scalereal-world snapshot of a social bookmarking system.