Publications
An analysis of tag-recommender evaluation procedures
Doerfel, S. & Jäschke, R.
, 'Proceedings of the 7th ACM conference on Recommender systems', RecSys '13, ACM, New York, NY, USA, [10.1145/2507157.2507222], 343-346 (2013) [pdf]
Since the rise of collaborative tagging systems on the web, the tag recommendation task -- suggesting suitable tags to users of such systems while they add resources to their collection -- has been tackled. However, the (offline) evaluation of tag recommendation algorithms usually suffers from difficulties like the sparseness of the data or the cold start problem for new resources or users. Previous studies therefore often used so-called post-cores (specific subsets of the original datasets) for their experiments. In this paper, we conduct a large-scale experiment in which we analyze different tag recommendation algorithms on different cores of three real-world datasets. We show, that a recommender's performance depends on the particular core and explore correlations between performances on different cores.
Internet-Graphen
Heidtmann, K.
Informatik-Spektrum, 36(5) 440-448 (2013) [pdf]
Bildeten die Keimzellen des Internet noch kleine und einfach strukturierte Netze, so vergrößerten sich sowohl seine physikalischen als auch seine logischen Topologien später rasant. Wuchs einerseits das Netz aus Rechnern als Knoten und Verbindungsleitungen als Kanten immer weiter, so bedienten sich andererseits gleichzeitig immer mehr Anwendungen dieser Infrastruktur, um darüber ihrerseits immer größere und komplexere virtuelle Netze zu weben, z. B. das WWW oder soziale Online-Netze. Auf jeder Ebene dieser Hierarchie lassen sich die jeweiligen Netztopologien mithilfe von Graphen beschreiben und so mathematisch untersuchen. So ergeben sich interessante Einblicke in die Struktureigenschaften unterschiedlicher Graphentypen, die großen Einfluss auf die Leistungsfähigkeit des Internet haben. Hierzu werden charakteristische Eigenschaften und entsprechende Kenngrößen verschiedener Graphentypen betrachtet wie der Knotengrad, die Durchschnittsdistanz, die Variation der Kantendichte in unterschiedlichen Netzteilen und die topologische Robustheit als Widerstandsfähigkeit gegenüber Ausfällen und Angriffen. Es wird dabei Bezug genommen auf analytische, simulative und zahlreiche empirische Untersuchungen des Internets und hingewiesen auf Simulationsprogramme sowie Abbildungen von Internetgraphen im Internet.
Deeper Into the Folksonomy Graph: FolkRank Adaptations and Extensions for Improved Tag Recommendations
Landia, N.; Doerfel, S.; Jäschke, R.; Anand, S. S.; Hotho, A. & Griffiths, N.
cs.IR, 1310.1498() (2013) [pdf]
The information contained in social tagging systems is often modelled as a graph of connections between users, items and tags. Recommendation algorithms such as FolkRank, have the potential to leverage complex relationships in the data, corresponding to multiple hops in the graph. We present an in-depth analysis and evaluation of graph models for social tagging data and propose novel adaptations and extensions of FolkRank to improve tag recommendations. We highlight implicit assumptions made by the widely used folksonomy model, and propose an alternative and more accurate graph-representation of the data. Our extensions of FolkRank address the new item problem by incorporating content data into the algorithm, and significantly improve prediction results on unpruned datasets. Our adaptations address issues in the iterative weight spreading calculation that potentially hinder FolkRank's ability to leverage the deep graph as an information source. Moreover, we evaluate the benefit of considering each deeper level of the graph, and present important insights regarding the characteristics of social tagging data in general. Our results suggest that the base assumption made by conventional weight propagation methods, that closeness in the graph always implies a positive relationship, does not hold for the social tagging domain.
Structure of Heterogeneous Networks
Ghosh, R. & Lerman, K.
(2009) [pdf]
Heterogeneous networks play a key role in the evolution of communities and
e decisions individuals make. These networks link different types of
tities, for example, people and the events they attend. Network analysis
gorithms usually project such networks unto simple graphs composed of
tities of a single type. In the process, they conflate relations between
tities of different types and loose important structural information. We
velop a mathematical framework that can be used to compactly represent and
alyze heterogeneous networks that combine multiple entity and link types. We
neralize Bonacich centrality, which measures connectivity between nodes by
e number of paths between them, to heterogeneous networks and use this
asure to study network structure. Specifically, we extend the popular
dularity-maximization method for community detection to use this centrality
tric. We also rank nodes based on their connectivity to other nodes. One
vantage of this centrality metric is that it has a tunable parameter we can
e to set the length scale of interactions. By studying how rankings change
th this parameter allows us to identify important nodes in the network. We
ply the proposed method to analyze the structure of several heterogeneous
tworks. We show that exploiting additional sources of evidence corresponding
links between, as well as among, different entity types yields new insights
to network structure.
Modularity clustering is force-directed layout
Noack, A.
(2008) [pdf]
Two natural and widely used representations for the community structure of networks are clusterings, which partition the vertex set into disjoint subsets, and layouts, which assign the vertices to positions in a metric space. This paper unifies prominent characterizations of layout quality and clustering quality, by showing that energy models of pairwise attraction and repulsion subsume Newman and Girvan's modularity measure. Layouts with optimal energy are relaxations of, and are thus consistent with, clusterings with optimal modularity, which is of practical relevance because both representations are complementary and often used together.
Graph OLAP: Towards Online Analytical Processing on Graphs
Zhu, F.; Chen, C.; Yan, X.; Han, J. & Yu, P. S.
, 'Proc. 2008 Int. Conf. on Data Mining (ICDM'08), Pisa, Italy, Dec. 2008.' (2008)
Information Retrieval in Folksonomies: Search and Ranking
Hotho, A.; Jäschke, R.; Schmitz, C. & Stumme, G.
, 'Proceedings of the 3rd European Semantic Web Conference', Lecture Notes in Computer Science, Springer, 411-426 (2006)
Content Aggregation on Knowledge Bases using Graph Clustering
Schmitz, C.; Hotho, A.; Jäschke, R. & Stumme, G.
Sure, Y. & Domingue, J., ed., 'The Semantic Web: Research and Applications', 4011(), LNAI, Springer, Heidelberg, 530-544 (2006) [pdf]
Recently, research projects such as PADLR and SWAP
have developed tools like Edutella or Bibster, which are targeted at
establishing peer-to-peer knowledge management (P2PKM) systems. In
such a system, it is necessary to obtain provide brief semantic
descriptions of peers, so that routing algorithms or matchmaking
processes can make decisions about which communities peers should
belong to, or to which peers a given query should be forwarded.

This paper provides a graph clustering technique on
knowledge bases for that purpose. Using this clustering, we can show
that our strategy requires up to 58% fewer queries than the
baselines to yield full recall in a bibliographic P2PKM scenario.

Visualization of bibliographic networks with a reshaped landscape metaphor
Brandes, U. & Willhalm, T.
, 'Proceedings of the symposium on Data Visualisation 2002', VISSYM '02, Eurographics Association, Aire-la-Ville, Switzerland, Switzerland, 159-ff (2002) [pdf]
We describe a novel approach to visualize bibliographic networks that facilitates the simultaneous identification of clusters (e.g., topic areas) and prominent entities (e.g., surveys or landmark papers). While employing the landscape metaphor proposed in several earlier works, we introduce new means to determine relevant parameters of the landscape. Moreover, we are able to compute prominent entities, clustering of entities, and the landscape's surface in a surprisingly simple and uniform way. The effectiveness of our network visualizations is illustrated on data from the graph drawing literature.
Contextual-Logic Extension of TOSCANA.
Eklund, P.; Groh, B.; Stumme, G. & Wille, R.
Ganter, B. & Mineau, G. W., ed., 'Conceptual Structures: Logical, Linguistic, and Computational', 1867(), LNAI, Springer, Heidelberg, 453-467 (2000) [pdf]
The Lattice of Concept Graphs of a Relationally Scaled Context
Prediger, S. & Wille, R.
Tepfenhart, W. M. & Cyre, W. R., ed., 'ICCS', 1640(), Lecture Notes in Computer Science, Springer, 401-414 (1999) [pdf]
Conceptual Graphs and Formal Concept Analysis
Wille, R.
Lukose, D.; Delugach, H.; Keeler, M.; Searle, L. & Sowa, J. F., ed., 'Conceptual Structures: Fulfilling Peirce's Dream', 1257(), Lecture Notes in Artificial Intelligence, Springer, Heidelberg, 290-303 (1997)
A Geometrical Heuristic for Drawing Concept Lattices
Stumme, G. & Wille, R.
Tamassia, R. & Tollis, I., ed., 'Graph Drawing', 894(), LNCS, Springer, Heidelberg, 452-459 (1995) [pdf]