Jäschke, R.; Hotho, A.; Schmitz, C. & Stumme, G. (2006),
Wege zur Entdeckung von Communities in Folksonomies, in
Stefan Braß & Alexander Hinneburg, ed.,
'Proc. 18. Workshop Grundlagen von Datenbanken'
, Martin-Luther-Universität , Halle-Wittenberg
, pp. 80-84
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Ein wichtiger Baustein des neu entdeckten World Wide Web -- des "`Web 2.0"' -- stellen
Folksonomies dar. In diesen Systemen können Benutzer gemeinsam Ressourcen verwalten und
mit Schlagwörtern versehen. Die dadurch entstehenden begrifflichen Strukturen stellen
ein interessantes Forschungsfeld dar. Dieser Artikel untersucht Ansätze und Wege zur
Entdeckung und Strukturierung von Nutzergruppen ("Communities") in Folksonomies.
Jäschke, R.; Hotho, A.; Schmitz, C.; Ganter, B. & Stumme, G. (2006),
TRIAS - An Algorithm for Mining Iceberg Tri-Lattices, in
'Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 06)'
, IEEE Computer Society, Hong Kong
, pp. 907-911
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G. (2006),
Trend detection in folksonomies, in
'Proceedings of the First international conference on Semantic and Digital Media Technologies'
, Springer-Verlag, Berlin, Heidelberg
, pp. 56--70
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
As the number of resources on the web exceeds by far the number of documents one can track, it becomes increasingly difficult to remain up to date on ones own areas of interest. The problem becomes more severe with the increasing fraction of multimedia data, from which it is difficult to extract some conceptual description of their contents.</p> <p>One way to overcome this problem are social bookmark tools, which are rapidly emerging on the web. In such systems, users are setting up lightweight conceptual structures called folksonomies, and overcome thus the knowledge acquisition bottleneck. As more and more people participate in the effort, the use of a common vocabulary becomes more and more stable. We present an approach for discovering topic-specific trends within folksonomies. It is based on a differential adaptation of the PageRank algorithm to the triadic hypergraph structure of a folksonomy. The approach allows for any kind of data, as it does not rely on the internal structure of the documents. In particular, this allows to consider different data types in the same analysis step. We run experiments on a large-scale real-world snapshot of a social bookmarking system.
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G. (2006),
Trend Detection in Folksonomies, in
Yannis S. Avrithis; Yiannis Kompatsiaris; Steffen Staab & Noel E. O'Connor, ed.,
'Proc. First International Conference on Semantics And Digital Media Technology (SAMT) '
, Springer, Heidelberg
, pp. 56-70
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
As the number of resources on the web exceeds by far the number ofdocuments one can track, it becomes increasingly difficult to remainup to date on ones own areas of interest. The problem becomes moresevere with the increasing fraction of multimedia data, from whichit is difficult to extract some conceptual description of theircontents.One way to overcome this problem are social bookmark tools, whichare rapidly emerging on the web. In such systems, users are settingup lightweight conceptual structures called folksonomies, andovercome thus the knowledge acquisition bottleneck. As more and morepeople participate in the effort, the use of a common vocabularybecomes more and more stable. We present an approach for discoveringtopic-specific trends within folksonomies. It is based on adifferential adaptation of the PageRank algorithm to the triadichypergraph structure of a folksonomy. The approach allows for anykind of data, as it does not rely on the internal structure of thedocuments. In particular, this allows to consider different datatypes in the same analysis step. We run experiments on a large-scalereal-world snapshot of a social bookmarking system.
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B.; Mitzlaff, F.; Schmitz, C. & Stumme, G. (2010),
'The Social Bookmark and Publication Management System Bibsonomy', The VLDB Journal
19
(6)
, 849--875
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Social resource sharing systems are central elements of the Web 2.0 and use the same kind of lightweight knowledge representation, called folksonomy. Their large user communities and ever-growing networks of user-generated content have made them an attractive object of investigation for researchers from different disciplines like Social Network Analysis, Data Mining, Information Retrieval or Knowledge Discovery. In this paper, we summarize and extend our work on different aspects of this branch of Web 2.0 research, demonstrated and evaluated within our own social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind. We structure this presentation along the different interaction phases of a user with our system, coupling the relevant research questions of each phase with the corresponding implementation issues. This approach reveals in a systematic fashion important aspects and results of the broad bandwidth of folksonomy research like capturing of emergent semantics, spam detection, ranking algorithms, analogies to search engine log data, personalized tag recommendations and information extraction techniques. We conclude that when integrating a real-life application like BibSonomy into research, certain constraints have to be considered; but in general, the tight interplay between our scientific work and the running system has made BibSonomy a valuable platform for demonstrating and evaluating Web 2.0 research.
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B.; Mitzlaff, F.; Schmitz, C. & Stumme, G. (2010),
'The Social Bookmark and Publication Management System Bibsonomy', The VLDB Journal
19
(6)
, 849--875
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Social resource sharing systems are central elements of the Web 2.0 and use the same kind of lightweight knowledge representation, called folksonomy. Their large user communities and ever-growing networks of user-generated content have made them an attractive object of investigation for researchers from different disciplines like Social Network Analysis, Data Mining, Information Retrieval or Knowledge Discovery. In this paper, we summarize and extend our work on different aspects of this branch of Web 2.0 research, demonstrated and evaluated within our own social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind. We structure this presentation along the different interaction phases of a user with our system, coupling the relevant research questions of each phase with the corresponding implementation issues. This approach reveals in a systematic fashion important aspects and results of the broad bandwidth of folksonomy research like capturing of emergent semantics, spam detection, ranking algorithms, analogies to search engine log data, personalized tag recommendations and information extraction techniques. We conclude that when integrating a real-life application like BibSonomy into research, certain constraints have to be considered; but in general, the tight interplay between our scientific work and the running system has made BibSonomy a valuable platform for demonstrating and evaluating Web 2.0 research.
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B.; Mitzlaff, F.; Schmitz, C. & Stumme, G. (2010),
'The Social Bookmark and Publication Management System Bibsonomy', The VLDB Journal
19
(6)
, 849--875
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Social resource sharing systems are central elements of the Web 2.0 and use the same kind of lightweight knowledge representation, called folksonomy. Their large user communities and ever-growing networks of user-generated content have made them an attractive object of investigation for researchers from different disciplines like Social Network Analysis, Data Mining, Information Retrieval or Knowledge Discovery. In this paper, we summarize and extend our work on different aspects of this branch of Web 2.0 research, demonstrated and evaluated within our own social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind. We structure this presentation along the different interaction phases of a user with our system, coupling the relevant research questions of each phase with the corresponding implementation issues. This approach reveals in a systematic fashion important aspects and results of the broad bandwidth of folksonomy research like capturing of emergent semantics, spam detection, ranking algorithms, analogies to search engine log data, personalized tag recommendations and information extraction techniques. We conclude that when integrating a real-life application like BibSonomy into research, certain constraints have to be considered; but in general, the tight interplay between our scientific work and the running system has made BibSonomy a valuable platform for demonstrating and evaluating Web 2.0 research.
Benz, D.; Hotho, A.; Jäschke, R.; Krause, B.; Mitzlaff, F.; Schmitz, C. & Stumme, G. (2010),
'The Social Bookmark and Publication Management System BibSonomy', The VLDB Journal
19
(6)
, 849--875
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Social resource sharing systems are central elements of the Web 2.0 and use the same kind of lightweight knowledge representation, called folksonomy. Their large user communities and ever-growing networks of user-generated content have made them an attractive object of investigation for researchers from different disciplines like Social Network Analysis, Data Mining, Information Retrieval or Knowledge Discovery. In this paper, we summarize and extend our work on different aspects of this branch of Web 2.0 research, demonstrated and evaluated within our own social bookmark and publication sharing system BibSonomy, which is currently among the three most popular systems of its kind. We structure this presentation along the different interaction phases of a user with our system, coupling the relevant research questions of each phase with the corresponding implementation issues. This approach reveals in a systematic fashion important aspects and results of the broad bandwidth of folksonomy research like capturing of emergent semantics, spam detection, ranking algorithms, analogies to search engine log data, personalized tag recommendations and information extraction techniques. We conclude that when integrating a real-life application like BibSonomy into research, certain constraints have to be considered; but in general, the tight interplay between our scientific work and the running system has made BibSonomy a valuable platform for demonstrating and evaluating Web 2.0 research.
Krause, B.; Schmitz, C.; Hotho, A. & Stumme, G. (2008),
The Anti-social Tagger: Detecting Spam in Social Bookmarking Systems, in
'Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web'
, ACM, New York, NY, USA
, pp. 61--68
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
The annotation of web sites in social bookmarking systems has become a popular way to manage and find information on the web. The community structure of such systems attracts spammers: recent post pages, popular pages or specific tag pages can be manipulated easily. As a result, searching or tracking recent posts does not deliver quality results annotated in the community, but rather unsolicited, often commercial, web sites. To retain the benefits of sharing one's web content, spam-fighting mechanisms that can face the flexible strategies of spammers need to be developed. A classical approach in machine learning is to determine relevant features that describe the system's users, train different classifiers with the selected features and choose the one with the most promising evaluation results. In this paper we will transfer this approach to a social bookmarking setting to identify spammers. We will present features considering the topological, semantic and profile-based information which people make public when using the system. The dataset used is a snapshot of the social bookmarking system BibSonomy and was built over the course of several months when cleaning the system from spam. Based on our features, we will learn a large set of different classification models and compare their performance. Our results represent the groundwork for a first application in BibSonomy and for the building of more elaborate spam detection mechanisms.
Krause, B.; Schmitz, C.; Hotho, A. & Stumme, G. (2008),
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems, in
'AIRWeb '08: Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web'
, ACM, New York, NY, USA
, pp. 61--68
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
The annotation of web sites in social bookmarking systemshas become a popular way to manage and find informationon the web. The community structure of such systems attractsspammers: recent post pages, popular pages or specifictag pages can be manipulated easily. As a result, searchingor tracking recent posts does not deliver quality resultsannotated in the community, but rather unsolicited, oftencommercial, web sites. To retain the benefits of sharingone’s web content, spam-fighting mechanisms that can facethe flexible strategies of spammers need to be developed.
Krause, B.; Schmitz, C.; Hotho, A. & Stumme, G. (2008),
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems, in
'Proc. of the Fourth International Workshop on Adversarial Information Retrieval on the Web'
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Tane, J.; Schmitz, C. & Stumme, G. (2004),
Semantic resource management for the web: an e-learning application, in
'Proc. 13th International World Wide Web Conference (WWW 2004)'
, pp. 1-10
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Hotho, A.; Benz, D.; Eisterlehner, F.; Jäschke, R.; Krause, B.; Schmitz, C. & Stumme, G. (2010),
'Publikationsmanagement mit BibSonomy - ein Social-Bookmarking-System für Wissenschaftler', HMD - Praxis der Wirtschaftsinformatik
271
, 47--58
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Kooperative Verschlagwortungs- bzw. Social-Bookmarking-Systeme wie Delicious, Mister Wong oder auch unser eigenes System BibSonomy erfreuen sich immer größerer Beliebtheit und bilden einen zentralen Bestandteil des heutigen Web 2.0. In solchen Systemen erstellen Nutzer leichtgewichtige Begriffssysteme, sogenannte Folksonomies, die die Nutzerdaten strukturieren. Die einfache Bedienbarkeit, die Allgegenwärtigkeit, die ständige Verfügbarkeit, aber auch die Möglichkeit, Gleichgesinnte spontan in solchen Systemen zu entdecken oder sie schlicht als Informationsquelle zu nutzen, sind Gründe für ihren gegenwärtigen Erfolg. Der Artikel führt den Begriff Social Bookmarking ein und diskutiert zentrale Elemente wie Browsing und Suche am Beispiel von BibSonomy anhand typischer Arbeitsabläufe eines Wissenschaftlers. Wir beschreiben die Architektur von BibSonomy sowie Wege der Integration und Vernetzung von BibSonomy mit Content-Management-Systemen und Webauftritten. Der Artikel schließt mit Querbezügen zu aktuellen Forschungsfragen im Bereich Social Bookmarking.
Jäschke, R.; Grahl, M.; Hotho, A.; Krause, B.; Schmitz, C. & Stumme, G. (2007),
Organizing Publications and Bookmarks in BibSonomy, in
Harith Alani; Natasha Noy; Gerd Stumme; Peter Mika; York Sure & Denny Vrandecic, ed.,
'Workshop on Social and Collaborative Construction of Structured Knowledge (CKC 2007) at WWW 2007'
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G. (2006),
Mining Association Rules in Folksonomies, in
V. Batagelj; H.-H. Bock; A. Ferligoj & A. vZiberna, ed.,
'Data Science and Classification: Proc. of the 10th IFCS Conf.'
, Springer, Berlin, Heidelberg
, pp. 261--270
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G. (2006),
Information Retrieval in Folksonomies: Search and Ranking, in
York Sure & John Domingue, ed.,
'The Semantic Web: Research and Applications'
, Springer, Heidelberg
, pp. 411-426
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Hotho, A.; J?schke, R.; Schmitz, C. & Stumme, G. (2006),
Information Retrieval in Folksonomies: Search and Ranking, in
York Sure & John Domingue, ed.,
'The Semantic Web: Research and Applications'
, Springer, Heidelberg
, pp. 411-426
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G. (2006),
FolkRank: A Ranking Algorithm for Folksonomies, in
'Proc. FGIR 2006'
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
In social bookmark tools users are setting up
lightweight conceptual structures called folksonomies. Currently,
the information retrieval support is limited. We present a formal
model and a new search algorithm for folksonomies, called
FolkRank, that exploits the structure of the folksonomy. The
proposed algorithm is also applied to find communities within the
folksonomy and is used to structure search results. All findings are
demonstrated on a large scale dataset. A long version of this paper
has been published at the European Semantic Web Conference
2006.
Jäschke, R.; Hotho, A.; Schmitz, C.; Ganter, B. & Stumme, G. (2008),
'Discovering Shared Conceptualizations in Folksonomies', Journal of Web Semantics
6
(1)
, 38-53
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G. (2006),
Content Aggregation on Knowledge Bases using Graph Clustering, in
York Sure & John Domingue, ed.,
'The Semantic Web: Research and Applications'
, Springer, Heidelberg
, pp. 530-544
.
[Volltext]
[Kurzfassung]
[BibTeX]
[Endnote]
Recently, research projects such as PADLR and SWAP
have developed tools like Edutella or Bibster, which are targeted at
establishing peer-to-peer knowledge management (P2PKM) systems. In
such a system, it is necessary to obtain provide brief semantic
descriptions of peers, so that routing algorithms or matchmaking
processes can make decisions about which communities peers should
belong to, or to which peers a given query should be forwarded.
This paper provides a graph clustering technique on
knowledge bases for that purpose. Using this clustering, we can show
that our strategy requires up to 58% fewer queries than the
baselines to yield full recall in a bibliographic P2PKM scenario.