Publications
Sentiment Analysis of Twitter Data
Go, A.; Huang, L. & Bhayani, R.
Entropy, 2009(June) 17 (2009) [pdf]
Of course we share! Testing Assumptions about Social Tagging Systems
Doerfel, S.; Zoller, D.; Singer, P.; Niebler, T.; Hotho, A. & Strohmaier, M.
2014 [pdf]
Social tagging systems have established themselves as an important part in
day's web and have attracted the interest from our research community in a
riety of investigations. The overall vision of our community is that simply
rough interactions with the system, i.e., through tagging and sharing of
sources, users would contribute to building useful semantic structures as
ll as resource indexes using uncontrolled vocabulary not only due to the
sy-to-use mechanics. Henceforth, a variety of assumptions about social
gging systems have emerged, yet testing them has been difficult due to the
sence of suitable data. In this work we thoroughly investigate three
ailable assumptions - e.g., is a tagging system really social? - by examining
ve log data gathered from the real-world public social tagging system
bSonomy. Our empirical results indicate that while some of these assumptions
ld to a certain extent, other assumptions need to be reflected and viewed in
very critical light. Our observations have implications for the design of
ture search and other algorithms to better reflect the actual user behavior.
Linked Data Games: Simulating Human Association with Linked Data
Hees, J.; Roth-Berghofer, T. & Dengel, A.
Atzmüller, M.; Benz, D.; Hotho, A. & Stumme, G., ed., 'Proceedings of LWA2010 - Workshop-Woche: Lernen, Wissen & Adaptivitaet', Kassel, Germany (2010)
Teaching machines to understand human communication is one of the central goals of artificial intelligence. Psychological research indicates that human associations are an essential requirement to understand human communication. In this paper the hypothesis is presented, that simulating human associations with the help of Linked Data could greatly improve text understanding capabilities of machines. To more thoroughly investigate whether human associations can be simulated with Linked Data, two preliminary problems are identified: (i) There does not seem to be a reasonable ground truth for human associations and (ii) while human associations have different strengths, Linked Data treats all triples equally and does not provide edge weights. Two ideas for games in accordance with Luis von Ahn's Games with a Purpose are proposed, turning the tedious process of entering associations or ratings into fun games.
Leveraging Publication Metadata and Social Data into FolkRank for Scientific Publication Recommendation
Doerfel, S.; Jäschke, R.; Hotho, A. & Stumme, G.
, 'Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web', RSWeb '12, ACM, New York, NY, USA, [10.1145/2365934.2365937], 9-16 (2012) [pdf]
The ever-growing flood of new scientific articles requires novel retrieval mechanisms. One means for mitigating this instance of the information overload phenomenon are collaborative tagging systems, that allow users to select, share and annotate references to publications. These systems employ recommendation algorithms to present to their users personalized lists of interesting and relevant publications. In this paper we analyze different ways to incorporate social data and metadata from collaborative tagging systems into the graph-based ranking algorithm FolkRank to utilize it for recommending scientific articles to users of the social bookmarking system BibSonomy. We compare the results to those of Collaborative Filtering, which has previously been applied for resource recommendation.
Clustering by pattern similarity in large data sets.
Wang, H.; 0010, W. W.; Yang, J. & Yu, P. S.
Franklin, M. J.; Moon, B. & Ailamaki, A., ed., 'SIGMOD Conference', ACM, 394-405 (2002) [pdf]
Characterizing User Behavior in Online Social Networks
Benevenuto, F.; Rodrigues, T.; Cha, M. & Almeida, V.
, 'Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference', IMC '09, ACM, New York, NY, USA, [10.1145/1644893.1644900], 49-62 (2009) [pdf]
Understanding how users behave when they connect to social networking sites creates opportunities for better interface design, richer studies of social interactions, and improved design of content distribution systems. In this paper, we present a first of a kind analysis of user workloads in online social networks. Our study is based on detailed clickstream data, collected over a 12-day period, summarizing HTTP sessions of 37,024 users who accessed four popular social networks: Orkut, MySpace, Hi5, and LinkedIn. The data were collected from a social network aggregator website in Brazil, which enables users to connect to multiple social networks with a single authentication. Our analysis of the clickstream data reveals key features of the social network workloads, such as how frequently people connect to social networks and for how long, as well as the types and sequences of activities that users conduct on these sites. Additionally, we crawled the social network topology of Orkut, so that we could analyze user interaction data in light of the social graph. Our data analysis suggests insights into how users interact with friends in Orkut, such as how frequently users visit their friends' or non-immediate friends' pages. In summary, our analysis demonstrates the power of using clickstream data in identifying patterns in social network workloads and social interactions. Our analysis shows that browsing, which cannot be inferred from crawling publicly available data, accounts for 92% of all user activities. Consequently, compared to using only crawled data, considering silent interactions like browsing friends' pages increases the measured level of interaction among users.