On Finding Graph Clusterings with Maximum Modularity

Modularity is a recently introduced quality measure for graph clusterings. It has immediately received considerable attention in several disciplines, and in particular in the complex systems literature, although its properties are not well understood. We study the problem of finding clusterings with maximum modularity, thus providing theoretical foundations for past and present work based on this measure. More precisely, we prove the conjectured hardness of maximizing modularity both in the general case and with the restriction to cuts, and give an Integer Linear Programming formulation. This is complemented by first insights into the behavior and performance of the commonly applied greedy agglomaration approach.

Content Aggregation on Knowledge Bases using Graph Clustering

Recently, research projects such as PADLR and SWAP

have developed tools like Edutella or Bibster, which are targeted at

establishing peer-to-peer knowledge management (P2PKM) systems. In

such a system, it is necessary to obtain provide brief semantic

descriptions of peers, so that routing algorithms or matchmaking

processes can make decisions about which communities peers should

belong to, or to which peers a given query should be forwarded.

This paper provides a graph clustering technique on

knowledge bases for that purpose. Using this clustering, we can show

that our strategy requires up to 58% fewer queries than the

baselines to yield full recall in a bibliographic P2PKM scenario.

have developed tools like Edutella or Bibster, which are targeted at

establishing peer-to-peer knowledge management (P2PKM) systems. In

such a system, it is necessary to obtain provide brief semantic

descriptions of peers, so that routing algorithms or matchmaking

processes can make decisions about which communities peers should

belong to, or to which peers a given query should be forwarded.

This paper provides a graph clustering technique on

knowledge bases for that purpose. Using this clustering, we can show

that our strategy requires up to 58% fewer queries than the

baselines to yield full recall in a bibliographic P2PKM scenario.

Content Aggregation on Knowledge Bases using Graph Clustering

Recently, research projects such as PADLR and SWAP

have developed tools like Edutella or Bibster, which are targeted at

establishing peer-to-peer knowledge management (P2PKM) systems. In

such a system, it is necessary to obtain provide brief semantic

descriptions of peers, so that routing algorithms or matchmaking

processes can make decisions about which communities peers should

belong to, or to which peers a given query should be forwarded.

have developed tools like Edutella or Bibster, which are targeted at

establishing peer-to-peer knowledge management (P2PKM) systems. In

such a system, it is necessary to obtain provide brief semantic

descriptions of peers, so that routing algorithms or matchmaking

processes can make decisions about which communities peers should

belong to, or to which peers a given query should be forwarded.

This paper provides a graph clustering technique on

knowledge bases for that purpose. Using this clustering, we can show

that our strategy requires up to 58% fewer queries than the

baselines to yield full recall in a bibliographic P2PKM scenario.

The structure and function of complex networks

Inspired by empirical studies of networked systems such as the Internet,

cial networks, and biological networks, researchers have in recent years

veloped a variety of techniques and models to help us understand or predict

e behavior of these systems. Here we review developments in this field,

cluding such concepts as the small-world effect, degree distributions,

ustering, network correlations, random graph models, models of network growth

d preferential attachment, and dynamical processes taking place on networks.

cial networks, and biological networks, researchers have in recent years

veloped a variety of techniques and models to help us understand or predict

e behavior of these systems. Here we review developments in this field,

cluding such concepts as the small-world effect, degree distributions,

ustering, network correlations, random graph models, models of network growth

d preferential attachment, and dynamical processes taking place on networks.

On spectral clustering: Analysis and an algorithm

Despite many empirical successes of spectral clustering methods| algorithms that cluster points using eigenvectors of matrices derived from the data|there are several unresolved issues. First, there are a wide variety of algorithms that use the eigenvectors in slightly dierent ways. Second, many of these algorithms have no proof that they will actually compute a reasonable clustering. In this paper, we present a simple spectral clustering algorithm that can be implemented using a few lines of Matlab. Using tools from matrix perturbation theory, we analyze the algorithm, and give conditions under which it can be expected to do well. We also show surprisingly good experimental results on a number of challenging clustering problems. 1