Conference articles
An analysis of tag-recommender evaluation procedures.
In:
Proceedings of the 7th ACM conference on Recommender systems, series RecSys '13, pages 343-346.
ACM, New York, NY, USA, 2013.
Stephan Doerfel and Robert Jäschke.
[doi]
[abstract]
[BibTeX]
Since the rise of collaborative tagging systems on the web, the tag recommendation task -- suggesting suitable tags to users of such systems while they add resources to their collection -- has been tackled. However, the (offline) evaluation of tag recommendation algorithms usually suffers from difficulties like the sparseness of the data or the cold start problem for new resources or users. Previous studies therefore often used so-called post-cores (specific subsets of the original datasets) for their experiments. In this paper, we conduct a large-scale experiment in which we analyze different tag recommendation algorithms on different cores of three real-world datasets. We show, that a recommender's performance depends on the particular core and explore correlations between performances on different cores.
Journal articles
Deeper Into the Folksonomy Graph: FolkRank Adaptations and Extensions for Improved Tag Recommendations.
cs.IR, 1310.1498, 2013.
Nikolas Landia, Stephan Doerfel, Robert Jäschke, Sarabjot Singh Anand, Andreas Hotho and Nathan Griffiths.
[doi]
[abstract]
[BibTeX]
The information contained in social tagging systems is often modelled as a graph of connections between users, items and tags. Recommendation algorithms such as FolkRank, have the potential to leverage complex relationships in the data, corresponding to multiple hops in the graph. We present an in-depth analysis and evaluation of graph models for social tagging data and propose novel adaptations and extensions of FolkRank to improve tag recommendations. We highlight implicit assumptions made by the widely used folksonomy model, and propose an alternative and more accurate graph-representation of the data. Our extensions of FolkRank address the new item problem by incorporating content data into the algorithm, and significantly improve prediction results on unpruned datasets. Our adaptations address issues in the iterative weight spreading calculation that potentially hinder FolkRank's ability to leverage the deep graph as an information source. Moreover, we evaluate the benefit of considering each deeper level of the graph, and present important insights regarding the characteristics of social tagging data in general. Our results suggest that the base assumption made by conventional weight propagation methods, that closeness in the graph always implies a positive relationship, does not hold for the social tagging domain.
Conference articles
Tag Recommendations for SensorFolkSonomies.
In:
Recommender Systems and the Social Web Workshop at 7th ACM Conference on Recommender Systems, RecSys 2013, Hong Kong, China - October 12-16, 2013. Proceedings, pages New York, NY, USA.
ACM, 2013.
accepted for publication
Juergen Mueller, Stephan Doerfel, Martin Becker, Andreas Hotho and Gerd Stumme.
[abstract]
[BibTeX]
With the rising popularity of smart mobile devices, sensor data-based applications have become more and more popular. Their users record data during their daily routine or specifically for certain events. The application WideNoise Plus allows users to record sound samples and to annotate them with perceptions and tags. The app is being used to document and map the soundscape all over the world. The procedure of recording, including the assignment of tags, has to be as easy-to-use as possible. We therefore discuss the application of tag recommender algorithms in this particular scenario. We show, that this task is fundamentally different from the well-known tag recommendation problem in folksonomies as users do no longer tag fix resources but rather sensory data and impressions. The scenario requires efficient recommender algorithms that are able to run on the mobile device, since Internet connectivity cannot be assumed to be available. Therefore, we evaluate the performance of several tag recommendation algorithms and discuss their applicability in the mobile sensing use-case.
Miscellaneous
Recommender Systems for Social Tagging Systems.
2012.
L. Balby Marinho, A. Hotho, R. Jäschke, A. Nanopoulos, S. Rendle, L. Schmidt-Thieme, G. Stumme and P. Symeonidis.
[doi]
[abstract]
[BibTeX]
Social Tagging Systems are web applications in which users upload resources (e.g., bookmarks, videos, photos, etc.) and annotate it with a list of freely chosen keywords called tags. This is a grassroots approach to organize a site and help users to find the resources they are interested in. Social tagging systems are open and inherently social; features that have been proven to encourage participation. However, with the large popularity of these systems and the increasing amount of user-contributed content, information overload rapidly becomes an issue. Recommender Systems are well known applications for increasing the level of relevant content over the “noise” that continuously grows as more and more content becomes available online. In social tagging systems, however, we face new challenges. While in classic recommender systems the mode of recommendation is basically the resource, in social tagging systems there are three possible modes of recommendation: users, resources, or tags. Therefore suitable methods that properly exploit the different dimensions of social tagging systems data are needed. In this book, we survey the most recent and state-of-the-art work about a whole new generation of recommender systems built to serve social tagging systems. The book is divided into self-contained chapters covering the background material on social tagging systems and recommender systems to the more advanced techniques like the ones based on tensor factorization and graph-based models.
Book chapters
Challenges in Tag Recommendations for Collaborative Tagging Systems.
In:
J. J. Pazos Arias, A. Fernández Vilas and R. P. Díaz Redondo, editors,
Recommender Systems for the Social Web, pages 65-87.
Springer, Berlin/Heidelberg, 2012.
Robert Jäschke, Andreas Hotho, Folke Mitzlaff and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Originally introduced by social bookmarking systems, collaborative tagging, or social tagging, has been widely adopted by many web-based systems like wikis, e-commerce platforms, or social networks. Collaborative tagging systems allow users to annotate resources using freely chosen keywords, so called tags . Those tags help users in finding/retrieving resources, discovering new resources, and navigating through the system. The process of tagging resources is laborious. Therefore, most systems support their users by tag recommender components that recommend tags in a personalized way. The Discovery Challenges 2008 and 2009 of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) tackled the problem of tag recommendations in collaborative tagging systems. Researchers were invited to test their methods in a competition on datasets from the social bookmark and publication sharing system BibSonomy. Moreover, the 2009 challenge included an online task where the recommender systems were integrated into BibSonomy and provided recommendations in real time. In this chapter we review, evaluate and summarize the submissions to the two Discovery Challenges and thus lay the groundwork for continuing research in this area.
Conference articles
A Comparison of Content-Based Tag Recommendations in Folksonomy Systems.
In: K. E. Wolff, D. E. Palchunov, N. G. Zagoruiko and U. Andelfinger, editors,
Knowledge Processing and Data Analysis, volume 6581, series Lecture Notes in Computer Science, pages 136-149.
Springer, Berlin/Heidelberg, 2011.
Jens Illig, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Recommendation algorithms and multi-class classifiers can support users of social bookmarking systems in assigning tags to their bookmarks. Content based recommenders are the usual approach for facing the cold start problem, i.e., when a bookmark is uploaded for the first time and no information from other users can be exploited. In this paper, we evaluate several recommendation algorithms in a cold-start scenario on a large real-world dataset.
Journal articles
Query Logs as Folksonomies.
Datenbank-Spektrum, 10(1):15-24, 2010.
Dominik Benz, Andreas Hotho, Robert Jäschke, Beate Krause and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Query logs provide a valuable resource for preference information in search. A user clicking on a specific resource after submitting a query indicates that the resource has some relevance with respect to the query. To leverage the information ofquery logs, one can relate submitted queries from specific users to their clicked resources and build a tripartite graph ofusers, resources and queries. This graph resembles the folksonomy structure of social bookmarking systems, where users addtags to resources. In this article, we summarize our work on building folksonomies from query log files. The focus is on threecomparative studies of the system’s content, structure and semantics. Our results show that query logs incorporate typicalfolksonomy properties and that approaches to leverage the inherent semantics of folksonomies can be applied to query logsas well.
Conference articles
Semantic Analysis of Tag Similarity Measures in Collaborative Tagging Systems.
In:
Proceedings of the 3rd Workshop on Ontology Learning and Population (OLP3).
Patras, Greece, 2008.
Ciro Cattuto, Dominik Benz, Andreas Hotho and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Social bookmarking systems allow users to organise collections of resources on the Web in a collaborative fashion. The increasing popularity of these systems as well as first insights into their emergent semantics have made them relevant to disciplines like knowledge extraction and ontology learning. The problem of devising methods to measure the semantic relatedness between tags and characterizing it semantically is still largely open. Here we analyze three measures of tag relatedness: tag co-occurrence, cosine similarity of co-occurrence distributions, and FolkRank, an adaptation of the PageRank algorithm to folksonomies. Each measure is computed on tags from a large-scale dataset crawled from the social bookmarking system del.icio.us. To provide a semantic grounding of our findings, a connection to WordNet (a semantic lexicon for the English language) is established by mapping tags into synonym sets of WordNet, and applying there well-known metrics of semantic similarity. Our results clearly expose different characteristics of the selected measures of relatedness, making them applicable to different subtasks of knowledge extraction such as synonym detection or discovery of concept hierarchies.
A Comparison of Social Bookmarking with Traditional Search.
In: C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven and R. W. White, editors,
Advances in Information Retrieval, 30th European Conference on IR Research, ECIR 2008, volume 4956, series LNAI, pages 101-113.
Springer, Heidelberg, 2008.
Beate Krause, Andreas Hotho and Gerd Stumme.
[abstract]
[BibTeX]
Social bookmarking systems allow users to store links to internet resources on a web page. As social bookmarking systems are growing in popularity, search algorithms have been developed that transfer the idea of link-based rankings in the Web to a social bookmarking system’s data structure. These rankings differ from traditional search engine rankings in that they incorporate the rating of users. In this study, we compare search in social bookmarking systems with traditionalWeb search. In the first part, we compare the user activity and behaviour in both kinds of systems, as well as the overlap of the underlying sets of URLs. In the second part,we compare graph-based and vector space rankings for social bookmarking systems with commercial search engine rankings. Our experiments are performed on data of the social bookmarking system Del.icio.us and on rankings and log data from Google, MSN, and AOL. We will show that part of the difference between the systems is due to different behaviour (e. g., the concatenation of multi-word lexems to single terms in Del.icio.us), and that real-world events may trigger similar behaviour in both kinds of systems. We will also show that a graph-based ranking approach on folksonomies yields results that are closer to the rankings of the commercial search engines than vector space retrieval, and that the correlation is high in particular for the domains that are well covered by the social bookmarking system.
Logsonomy - Social Information Retrieval with Logdata.
In:
HT '08: Proceedings of the Nineteenth ACM Conference on Hypertext and Hypermedia, pages 157-166.
ACM, New York, NY, USA, 2008.
Beate Krause, Robert Jäschke, Andreas Hotho and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Social bookmarking systems constitute an established part of the Web 2.0. In such systems users describe bookmarks by keywords called tags. The structure behind these social systems, called folksonomies, can be viewed as a tripartite hypergraph of user, tag and resource nodes. This underlying network shows specific structural properties that explain its growth and the possibility of serendipitous exploration. Today’s search engines represent the gateway to retrieve information from the World Wide Web. Short queries typically consisting of two to three words describe a user’s information need. In response to the displayed results of the search engine, users click on the links of the result page as they expect the answer to be of relevance. This clickdata can be represented as a folksonomy in which queries are descriptions of clicked URLs. The resulting network structure, which we will term logsonomy is very similar to the one of folksonomies. In order to find out about its properties, we analyze the topological characteristics of the tripartite hypergraph of queries, users and bookmarks on a large snapshot of del.icio.us and on query logs of two large search engines. All of the three datasets show small world properties. The tagging behavior of users, which is explained by preferential attachment of the tags in social bookmark systems, is reflected in the distribution of single query words in search engines. We can conclude that the clicking behaviour of search engine users based on the displayed search results and the tagging behaviour of social bookmarking users is driven by similar dynamics.
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems.
In:
Proc. of the Fourth International Workshop on Adversarial Information Retrieval on the Web.
2008.
Beate Krause, Christoph Schmitz, Andreas Hotho and Gerd Stumme.
[doi]
[BibTeX]
Conceptual Clustering of Social Bookmarking Sites.
In:
7th International Conference on Knowledge Management (I-KNOW '07), pages 356-364.
Know-Center, Graz, Austria, 2007.
Miranda Grahl, Andreas Hotho and Gerd Stumme.
[abstract]
[BibTeX]
Currently, social bookmarking systems provide intuitive support for browsing locally their content. A global view is usually presented by the tag cloud of the
system, but it does not allow a conceptual drill-down, e. g., along a conceptual hierarchy. In this paper, we present a clustering approach for computing such a conceptual hierarchy for a given folksonomy. The hierarchy is complemented with ranked lists of users and resources most related to each cluster. The rankings are computed using our FolkRank algorithm. We have evaluated our approach on large scale data from the del.icio.us bookmarking system.
Tag Recommendations in Folksonomies.
In: A. Hinneburg, editor,
Workshop Proceedings of Lernen - Wissensentdeckung - Adaptivität (LWA 2007), pages 13-20.
Martin-Luther-Universität Halle-Wittenberg, 2007.
Robert Jaeschke, Leandro Marinho, Andreas Hotho, Lars Schmidt-Thieme and Gerd Stumme.
[doi]
[BibTeX]
Network Properties of Folksonomies.
In:
Proc. WWW2007 Workshop ``Tagging and Metadata for Social Information Organization''.
Banff, 2007.
Christoph Schmitz, Miranda Grahl, Andreas Hotho, Gerd Stumme, Ciro Catutto, Andrea Baldassarri, Vittorio Loreto and Vito D. P. Servedio.
[doi]
[BibTeX]
BibSonomy: A Social Bookmark and Publication Sharing System.
In:
Proc. of the ICCS 2006 Conceptual Structures Tool Interoperability
Workshop.
2006.
(to appear)
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[BibTeX]
BibSonomy: A Social Bookmark and Publication Sharing System.
In: A. de Moor, S. Polovina and H. Delugach, editors,
Proceedings of the First Conceptual Structures Tool Interoperability Workshop at the 14th International Conference on Conceptual Structures, pages 87-102.
Aalborg Universitetsforlag, Aalborg, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Social bookmark tools are rapidly emerging on the Web. In such
systems users are setting up lightweight conceptual structures
called folksonomies. The reason for their immediate success is the
fact that no specific skills are needed for participating. In this
paper we specify a formal model for folksonomies and briefly describe
our own system BibSonomy, which allows for sharing both bookmarks
and publication references in a kind of personal library.
Information Retrieval in Folksonomies: Search and Ranking.
In:
Proceedings of the 3rd European Semantic Web Conference, series Lecture Notes in Computer Science, pages 411-426.
Springer, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz and Gerd Stumme.
[BibTeX]
Mining Association Rules in Folksonomies.
In: V. Batagelj, H.-H. Bock, A. Ferligoj and A. Žiberna, editors,
Data Science and Classification. Proceedings of the 10th IFCS Conf., series Studies in Classification, Data Analysis, and Knowledge Organization, pages 261-270.
Springer, Heidelberg, 2006.
Christoph Schmitz, Andreas Hotho, Robert Jäschke and Gerd Stumme.
[doi]
[abstract]
[BibTeX]
Social bookmark tools are rapidly emerging on the Web. In such
systems users are setting up lightweight conceptual structures
called folksonomies. These systems provide currently relatively few
structure. We discuss in this paper, how association rule mining
can be adopted to analyze and structure folksonomies, and how the results can be used
for ontology learning and supporting emergent semantics. We
demonstrate our approach on a large scale dataset stemming from an
online system.
Technical reports
The Structure of Collaborative Tagging Systems.
Information Dynamics Lab, HP Labs , 2005.
Scott Golder and Bernardo A. Huberman.
[doi]
[BibTeX]
Miscellaneous
Collaborative tagging as a tripartite network.
2005. ttarXiv:cs.DS/0512090.
R. Lambiotte and M. Ausloos.
[doi]
[abstract]
[BibTeX]
We describe online collaborative communities by tripartite networks,
the nodes being persons, items and tags. We introduce projection
methods in order to uncover the structures of the networks, i.e.
communities of users, genre families... <br />To do so, we focus
on the correlations between the nodes, depending on their profiles,
and use percolation techniques that consist in removing less correlated
links and observing the shaping of disconnected islands. The structuring
of the network is visualised by using a tree representation. The
notion of diversity in the system is also discussed.