Publications
Mining Social Media: Key Players, Sentiments, and Communities
Atzmueller, M.
WIREs: Data Mining and Knowledge Discovery, In Press() (2012)
Privacy-aware spam detection in social bookmarking systems
Bullock, B. N.; Lerch, H.; Ro A.; Hotho, A. & Stumme, G.
, 'Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies', i-KNOW '11, ACM, New York, NY, USA, [10.1145/2024288.2024306], 15:1-15:8 (2011) [pdf]
With the increased popularity of Web 2.0 services in the last years data privacy has become a major concern for users. The more personal data users reveal, the more difficult it becomes to control its disclosure in the web. However, for Web 2.0 service providers, the data provided by users is a valuable source for offering effective, personalised data mining services. One major application is the detection of spam in social bookmarking systems: in order to prevent a decrease of content quality, providers need to distinguish spammers and exclude them from the system. They thereby experience a conflict of interests: on the one hand, they need to identify spammers based on the information they collect about users, on the other hand, they need to respect privacy concerns and process as few personal data as possible. It would therefore be of tremendous help for system developers and users to know which personal data are needed for spam detection and which can be ignored. In this paper we address these questions by presenting a data privacy aware feature engineering approach. It consists of the design of features for spam classification which are evaluated according to both, performance and privacy conditions. Experiments using data from the social bookmarking system BibSonomy show that both conditions must not exclude each other.
The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems
Krause, B.; Schmitz, C.; Hotho, A. & Stumme, G.
, 'AIRWeb '08: Proceedings of the 4th International Workshop on Adversarial Information Retrieval on the Web', ACM, New York, NY, USA, [10.1145/1451983.1451998], 61-68 (2008) [pdf]
The annotation of web sites in social bookmarking systemshas become a popular way to manage and find informationon the web. The community structure of such systems attractsspammers: recent post pages, popular pages or specifictag pages can be manipulated easily. As a result, searchingor tracking recent posts does not deliver quality resultsannotated in the community, but rather unsolicited, oftencommercial, web sites. To retain the benefits of sharingone’s web content, spam-fighting mechanisms that can facethe flexible strategies of spammers need to be developed.
DENGRAPH: A Density-based Community Detection Algorithm
Falkowski, T.; Barth, A. & Spiliopoulou, M.
, 'In Proc. of the 2007 IEEE / WIC / ACM International Conference on Web Intelligence,', 112-115 (2007) [pdf]
Trend Detection in Folksonomies
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.
Avrithis, Y. S.; Kompatsiaris, Y.; Staab, S. & O'Connor, N. E., ed., 'Proc. First International Conference on Semantics And Digital Media Technology (SAMT) ', 4306(), LNCS, Springer, Heidelberg, 56-70 (2006) [pdf]
As the number of resources on the web exceeds by far the number of
cuments one can track, it becomes increasingly difficult to remain
to date on ones own areas of interest. The problem becomes more
vere with the increasing fraction of multimedia data, from which
is difficult to extract some conceptual description of their
ntents.

ne way to overcome this problem are social bookmark tools, which
e rapidly emerging on the web. In such systems, users are setting
lightweight conceptual structures called folksonomies, and
ercome thus the knowledge acquisition bottleneck. As more and more
ople participate in the effort, the use of a common vocabulary
comes more and more stable. We present an approach for discovering
pic-specific trends within folksonomies. It is based on a
fferential adaptation of the PageRank algorithm to the triadic
pergraph structure of a folksonomy. The approach allows for any
nd of data, as it does not rely on the internal structure of the
cuments. In particular, this allows to consider different data
pes in the same analysis step. We run experiments on a large-scale
al-world snapshot of a social bookmarking system.

Wege zur Entdeckung von Communities in Folksonomies
Jäschke, R.; Hotho, A.; Schmitz, C. & Stumme, G.
Braß, S. & Hinneburg, A., ed., 'Proc. 18. Workshop Grundlagen von Datenbanken', Martin-Luther-Universität , Halle-Wittenberg, 80-84 (2006) [pdf]
Ein wichtiger Baustein des neu entdeckten World Wide Web -- des "`Web 2.0"' -- stellen
lksonomies dar. In diesen Systemen können Benutzer gemeinsam Ressourcen verwalten und
t Schlagwörtern versehen. Die dadurch entstehenden begrifflichen Strukturen stellen
n interessantes Forschungsfeld dar. Dieser Artikel untersucht Ansätze und Wege zur
tdeckung und Strukturierung von Nutzergruppen ("Communities") in Folksonomies.
Trawling the Web for emerging cyber-communities
Kumar, R.; Raghavan, P.; Rajagopalan, S. & Tomkins, A.
Computer Networks , 31(11--16) 1481-1493 (1999) [pdf]