Community Assessment using Evidence Networks.
In:
Analysis of Social Media and Ubiquitous Data, Band 6904, Reihe LNAI.
2011.
Folke Mitzlaff, Martin Atzmueller, Dominik Benz, Andreas Hotho und Gerd Stumme.
[BibTeX]
Query Logs as Folksonomies.
Datenbank-Spektrum, 10(1):15-24, 2010.
Dominik Benz, Andreas Hotho, Robert Jäschke, Beate Krause und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
Query logs provide a valuable resource for preference information in search. A user clicking on a specific resource after submitting a query indicates that the resource has some relevance with respect to the query. To leverage the information ofquery logs, one can relate submitted queries from specific users to their clicked resources and build a tripartite graph ofusers, resources and queries. This graph resembles the folksonomy structure of social bookmarking systems, where users addtags to resources. In this article, we summarize our work on building folksonomies from query log files. The focus is on threecomparative studies of the system’s content, structure and semantics. Our results show that query logs incorporate typicalfolksonomy properties and that approaches to leverage the inherent semantics of folksonomies can be applied to query logsas well.
Semantics made by you and me: Self-emerging ontologies can capture the diversity of shared knowledge.
In:
Proceedings of the 2nd Web Science Conference (WebSci10).
Raleigh, NC, USA, 2010.
Dominik Benz, Andreas Hotho, Stefan Stützer und Gerd Stumme.
[doi]
[BibTeX]
Stop Thinking, start Tagging - Tag Semantics emerge from Collaborative Verbosity.
In:
Proceedings of the 19th International World Wide Web Conference (WWW 2010).
ACM, Raleigh, NC, USA, 2010.
Christian Körner, Dominik Benz, Markus Strohmaier, Andreas Hotho und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
Recent research provides evidence for the presence of emergent semantics in collaborative tagging systems. While several methods have been proposed, little is known about the factors that influence the evolution of semantic structures in these systems. A natural hypothesis is that the quality of the emergent semantics depends on the pragmatics of tagging: Users with certain usage patterns might contribute more to the resulting semantics than others. In this work, we propose several measures which enable a pragmatic differentiation of taggers by their degree of contribution to emerging semantic structures. We distinguish between categorizers, who typically use a small set of tags as a replacement for hierarchical classification schemes, and describers, who are annotating resources with a wealth of freely associated, descriptive keywords. To study our hypothesis, we apply semantic similarity measures to 64 different partitions of a real-world and large-scale folksonomy containing different ratios of categorizers and describers. Our results not only show that ‘verbose’ taggers are most useful for the emergence of tag semantics, but also that a subset containing only 40% of the most ‘verbose’ taggers can produce results that match and even outperform the semantic precision obtained from the whole dataset. Moreover, the results suggest that there exists a causal link between the pragmatics of tagging and resulting emergent semantics. This work is relevant for designers and analysts of tagging systems interested (i) in fostering the semantic development of their platforms, (ii) in identifying users introducing “semantic noise�?, and (iii) in learning ontologies.
Characterizing Semantic Relatedness of Search Query Terms.
In:
Proceedings of the 1st Workshop on Explorative Analytics of Information Networks (EIN2009).
Bled, Slovenia, 2009.
Dominik Benz, Beate Krause, G. Praveen Kumar, Andreas Hotho und Gerd Stumme.
[doi]
[BibTeX]
Managing publications and bookmarks with BibSonomy.
In: C. Cattuto, G. Ruffo und F. Menczer
(Herausgeber):
HT '09: Proceedings of the 20th ACM Conference on Hypertext and Hypermedia, Seiten 323-324.
ACM, New York, NY, USA, 2009.
Dominik Benz, Folke Eisterlehner, Andreas Hotho, Robert Jäschke, Beate Krause und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
In this demo we present BibSonomy, a social bookmark and publication sharing system.
Proceedings of the Dagstuhl Seminar on Social Web Communities.
2008.
[doi]
[BibTeX]
Evaluation Strategies for Learning Algorithms of Hierarchical Structures.
In:
Proceedings of the 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), Reihe Studies in Classification, Data Analysis, and Knowledge Organization.
Springer, Berlin-Heidelberg, 2008.
in press
Korinna Bade und Dominik Benz.
[doi]
[Kurzfassung]
[BibTeX]
Several learning tasks comprise hierarchies. Comparison with a "goldstandard" is often performed to evaluate the quality of a learned hierarchy. We assembled various similarity metrics that have been proposed in different disciplines and compared them in a unified interdisciplinary framework for hierarchical evaluation which is based on the distinction of three fundamental dimensions. Identifying deficiencies for measuring structural similarity, we suggest three new measures for this purpose, either extending existing ones or based on new ideas. Experiments with an artificial dataset were performed to compare the different measures. As shown by our results, the measures vary greatly in their properties.
Analyzing Tag Semantics Across Collaborative Tagging Systems.
In: H. Alani, S. Staab und G. Stumme
(Herausgeber):
Proceedings of the Dagstuhl Seminar on Social Web Communities.
2008.
Dominik Benz, Marko Grobelnik, Andreas Hotho, Robert Jäschke, Dunja Mladenic, Vito D. P. Servedio, Sergej Sizov und Martin Szomszor.
[doi]
[Kurzfassung]
[BibTeX]
The objective of our group was to exploit state-of-the-art Information Retrieval methods for finding associations and dependencies between tags, capturing and representing differences in tagging behavior and vocabulary of various folksonomies, with the overall aim to better understand the semantics of tags and the tagging process. Therefore we analyze the semantic content of tags in the Flickr and Delicious folksonomies. We find that: tag context similarity leads to meaningful results in Flickr, despite its narrow folksonomy character; the comparison of tags across Flickr and Delicious shows little semantic overlap, being tags in Flickr associated more to visual aspects rather than technological as it seems to be in Delicious; there are regions in the tag-tag space, provided with the cosine similarity metric, that are characterized by high density; the order of tags inside a post has a semantic relevance.
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems.
In: A. P. Sheth, S. Staab, M. Dean, M. Paolucci, D. Maynard, T. W. Finin und K. Thirunarayan
(Herausgeber):
The Semantic Web - ISWC 2008, Proc.Intl. Semantic Web Conference 2008, Band 5318, Reihe LNAI, Seiten 615-631.
Springer, Heidelberg, 2008.
Ciro Cattuto, Dominik Benz, Andreas Hotho und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
Collaborative tagging systems have nowadays become important data sources for populating semantic web applications. For taskslike synonym detection and discovery of concept hierarchies, many researchers introduced measures of tag similarity. Eventhough most of these measures appear very natural, their design often seems to be rather ad hoc, and the underlying assumptionson the notion of similarity are not made explicit. A more systematic characterization and validation of tag similarity interms of formal representations of knowledge is still lacking. Here we address this issue and analyze several measures oftag similarity: Each measure is computed on data from the social bookmarking system del.icio.us and a semantic grounding isprovided by mapping pairs of similar tags in the folksonomy to pairs of synsets in Wordnet, where we use validated measuresof semantic distance to characterize the semantic relation between the mapped tags. This exposes important features of theinvestigated similarity measures and indicates which ones are better suited in the context of a given semantic application.
Logsonomy - A Search Engine Folksonomy.
In:
Proceedings of the Second International Conference on Weblogs and Social Media(ICWSM 2008).
AAAI Press, 2008.
Robert Jäschke, Beate Krause, Andreas Hotho und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
In social bookmarking systems users describe bookmarksby keywords called tags. The structure behindthese social systems, called folksonomies, can beviewed as a tripartite hypergraph of user, tag and resourcenodes. This underlying network shows specificstructural properties that explain its growth and the possibilityof serendipitous exploration.Search engines filter the vast information of the web.Queries describe a user’s information need. In responseto the displayed results of the search engine, users clickon the links of the result page as they expect the answerto be of relevance. The clickdata can be represented as afolksonomy in which queries are descriptions of clickedURLs. This poster analyzes the topological characteristicsof the resulting tripartite hypergraph of queries,users and bookmarks of two query logs and compares ittwo a snapshot of the folksonomy del.icio.us.
A Comparison of Social Bookmarking with Traditional Search.
In: C. Macdonald, I. Ounis, V. Plachouras, I. Ruthven und R. W. White
(Herausgeber):
30th European Conference on IR Research, ECIR 2008, Band 4956, Reihe Lecture Notes in Computer Science, Seiten 101-113.
Springer, Glasgow, UK, 2008.
Beate Krause, Andreas Hotho und Gerd Stumme.
[BibTeX]
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems.
In:
Proc. of the Fourth International Workshop on Adversarial Information Retrieval on the Web.
2008.
Beate Krause, Christoph Schmitz, Andreas Hotho und Gerd Stumme.
[doi]
[BibTeX]
Supporting Collaborative Hierarchical Classification: Bookmarks as an Example.
Special Issue of the Computer Networks journal on Innovations in Web Communications Infrastructure, 51(16):4574-4585, 2007.
Dominik Benz, Karen H. L. Tso und Lars Schmidt-Thieme.
[doi]
[Kurzfassung]
[BibTeX]
Bookmarks (or favorites, hotlists) are popular strategies to relocate interesting websites on the WWW by creating a personalized URL repository. Most current browsers offer a facility to locally store and manage bookmarks in a hierarchy of folders; though, with growing size, users reportedly have trouble to create and maintain a stable organization structure. This paper presents a novel collaborative approach to ease bookmark management, especially the “classification�? of new bookmarks into a folder. We propose a methodology to realize the collaborative classification idea of considering how similar users have classified a bookmark. A combination of nearest-neighbor-classifiers is used to derive a recommendation from similar users on where to store a new bookmark. A prototype system called CariBo has been implemented as a plugin for the central bookmark server software SiteBar. All findings have been evaluated on a reasonably large scale, real user dataset with promising results, and possible implications for shared and social bookmarking systems are discussed.
Network Properties of Folksonomies.
AI Communications Journal, Special Issue on ``Network Analysis in Natural Sciences and Engineering'', 20(4):245-262, 2007.
Ciro Cattuto, Christoph Schmitz, Andrea Baldassarri, Vito D. P. Servedio, Vittorio Loreto, Andreas Hotho, Miranda Grahl und Gerd Stumme.
[doi]
[BibTeX]
Conceptual Clustering of Social Bookmarking Sites.
In:
7th International Conference on Knowledge Management (I-KNOW '07), Seiten 356-364.
Know-Center, Graz, Austria, 2007.
Miranda Grahl, Andreas Hotho und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
Currently, social bookmarking systems provide intuitive support for browsing locally their content. A global view is usually presented by the tag cloud of thesystem, but it does not allow a conceptual drill-down, e. g., along a conceptual hierarchy. In this paper, we present a clustering approach for computing such a conceptual hierarchy for a given folksonomy. The hierarchy is complemented with ranked lists of users and resources most related to each cluster. The rankings are computed using our FolkRank algorithm. We have evaluated our approach on large scale data from the del.icio.us bookmarking system.
Organizing Publications and Bookmarks in BibSonomy.
In: H. Alani, N. Noy, G. Stumme, P. Mika, Y. Sure und D. Vrandecic
(Herausgeber):
Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at WWW 2007.
Banff, Canada, 2007.
Robert Jäschke, Miranda Grahl, Andreas Hotho, Beate Krause, Christoph Schmitz und Gerd Stumme.
[doi]
[BibTeX]
Automatic Bookmark Classification - A Collaborative Approach.
In:
Proceedings of the 2nd Workshop in Innovations in Web Infrastructure (IWI2) at WWW2006.
Edinburgh, Scotland, 2006.
isbn = 085432853X
Dominik Benz, Karen H. L. Tso und Lars Schmidt-Thieme.
[doi]
[Kurzfassung]
[BibTeX]
Bookmarks (or Favorites, Hotlists) are a popular strategy to relocate interesting websites on the WWW by creating a personalized local URL repository. Most current browsers offer a facility to store and manage bookmarks in a hierarchy of folders; though, with growing size, users reportedly have trouble to create and maintain a stable taxonomy. This paper presents a novel collaborative approach to ease bookmark management, especially the “classification�? of new bookmarks into a folder. We propose a methodology to realize the collaborative classification idea of considering how similar users have classified a bookmark. A combination of nearest-neighbour-classifiers is used to derive a recommendation from similar users on where to store a new bookmark. Additionally, a procedure to generate keyword recommendations is proposed to ease the annotation of new bookmarks. A prototype system called CariBo has been implemented as a plugin of the central bookmark server software SiteBar. A case study conducted with real user data supports the validity of the approach.
Emergent Semantics in BibSonomy.
In: C. Hochberger und R. Liskowsky
(Herausgeber):
Informatik 2006 - Informatik für Menschen. Band 2, Band P-94, Reihe Lecture Notes in Informatics.
Gesellschaft für Informatik, Bonn, 2006.
Proc. Workshop on Applications of Semantic Technologies, Informatik 2006
Andreas Hotho, Robert Jäschke, Christoph Schmitz und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
Social bookmark tools are rapidly emerging on the Web. In suchsystems users are setting up lightweight conceptual structurescalled folksonomies. The reason for their immediate success is thefact that no specific skills are needed for participating. In thispaper we specify a formal model for folksonomies, briefly describeour own system BibSonomy, which allows for sharing both bookmarks andpublication references, and discuss first steps towards emergent semantics.
Trend Detection in Folksonomies.
In: Y. S. Avrithis, Y. Kompatsiaris, S. Staab und N. E. O'Connor
(Herausgeber):
Proc. First International Conference on Semantics And Digital Media Technology (SAMT) , Band 4306, Reihe LNCS, Seiten 56-70.
Springer, Heidelberg, 2006.
Andreas Hotho, Robert Jäschke, Christoph Schmitz und Gerd Stumme.
[doi]
[Kurzfassung]
[BibTeX]
As the number of resources on the web exceeds by far the number ofdocuments one can track, it becomes increasingly difficult to remainup to date on ones own areas of interest. The problem becomes moresevere with the increasing fraction of multimedia data, from whichit is difficult to extract some conceptual description of theircontents.One way to overcome this problem are social bookmark tools, whichare rapidly emerging on the web. In such systems, users are settingup lightweight conceptual structures called folksonomies, andovercome thus the knowledge acquisition bottleneck. As more and morepeople participate in the effort, the use of a common vocabularybecomes more and more stable. We present an approach for discoveringtopic-specific trends within folksonomies. It is based on adifferential adaptation of the PageRank algorithm to the triadichypergraph structure of a folksonomy. The approach allows for anykind of data, as it does not rely on the internal structure of thedocuments. In particular, this allows to consider different datatypes in the same analysis step. We run experiments on a large-scalereal-world snapshot of a social bookmarking system.