TY - CONF AU - Li, Xin AU - Guo, Lei AU - Zhao, Yihong E. A2 - T1 - Tag-based Social Interest Discovery T2 - Proceedings of the 17th International World Wide Web Conference PB - ACM CY - PY - 2008/ M2 - VL - IS - SP - 675 EP - 684 UR - http://www2008.org/papers/pdf/p675-liA.pdf M3 - KW - *** KW - association KW - clustering KW - community KW - del.icio.us KW - detection KW - folksonomy KW - rules L1 - SN - N1 - N1 - AB - The success and popularity of social network systems, such as del.icio.us, Facebook, MySpace, and YouTube, have generated many interesting and challenging problems to the research community. Among others, discovering social interests shared by groups of users is very important because it helps to connect people with common interests and encourages people to contribute and share more contents. The main challenge to solving this problem comes from the diffi- culty of detecting and representing the interest of the users. The existing approaches are all based on the online connections of users and so unable to identify the common interest of users who have no online connections. In this paper, we propose a novel social interest discovery approach based on user-generated tags. Our approach is motivated by the key observation that in a social network, human users tend to use descriptive tags to annotate the contents that they are interested in. Our analysis on a large amount of real-world traces reveals that in general, user-generated tags are consistent with the web content they are attached to, while more concise and closer to the understanding and judgments of human users about the content. Thus, patterns of frequent co-occurrences of user tags can be used to characterize and capture topics of user interests. We have developed an Internet Social Interest Discovery system, ISID, to discover the common user interests and cluster users and their saved URLs by different interest topics. Our evaluation shows that ISID can effectively cluster similar documents by interest topics and discover user communities with common interests no matter if they have any online connections. ER - TY - JOUR AU - Cattuto, C. AU - Schmitz, C. AU - Baldassarri, A. AU - Servedio, V. D. P. AU - Loreto, V. AU - Hotho, A. AU - Grahl, M. AU - Stumme, G. T1 - Network Properties of Folksonomies JO - AI Communications PY - 2007/ VL - 20 IS - 4 SP - 245 EP - 262 UR - http://www.kde.cs.uni-kassel.de/hotho/pub/2007/aicomm_2007_folksonomy_clustering.pdf M3 - KW - clustering KW - community KW - folksonomy L1 - SN - N1 - N1 - AB - ER - TY - GEN AU - Radicchi, Filippo AU - Castellano, Claudio AU - Cecconi, Federico AU - Loreto, Vittorio AU - Parisi, Domenico A2 - T1 - Defining and identifying communities in networks JO - PB - AD - PY - 2004/02 VL - IS - SP - EP - UR - http://arxiv.org/abs/cond-mat/0309488 M3 - KW - clustering KW - folksonomy KW - toread KW - graph KW - community L1 - N1 - "In general algorithms define communities operationally as what the they finds. A dendrogram, i. e. a community

structure, is always produced by the algorithms down to the level of single nodes, independently from the type

of graph analyzed. This is due to the lack of explicit prescriptions to discriminate between networks that

are actually endowed with a community structure and those that are not. As a consequence, in practical

applications one needs additional, non topological, information on the nature of the network to understand

which of the branches of the tree have a real significance. Without such information it is not clear at all whether

the identification of a community is reliable or not."

---

Domain: scientific collaborations

Task: calculate a dendrogram (the community graph)

Method: effucuebt GN (Girvan & Newman( algorithm based on edge betweenness. Their algorithm allows to be fine-tuned beween acting local or global. To be more efficient they replace the "edge betweenness" by "edge clustering coefficient" which is based on the number of triangles the edge is contained in VS the degree of the incident nodes.

Motto: "Algorithm must include the quantitative community definition" N1 - AB - The investigation of community structures in networks is an important issue

in many domains and disciplines. This problem is relevant for social tasks

(objective analysis of relationships on the web), biological inquiries

(functional studies in metabolic, cellular or protein networks) or

technological problems (optimization of large infrastructures). Several types

of algorithm exist for revealing the community structure in networks, but a

general and quantitative definition of community is still lacking, leading to

an intrinsic difficulty in the interpretation of the results of the algorithms

without any additional non-topological information. In this paper we face this

problem by introducing two quantitative definitions of community and by showing

how they are implemented in practice in the existing algorithms. In this way

the algorithms for the identification of the community structure become fully

self-contained. Furthermore, we propose a new local algorithm to detect

communities which outperforms the existing algorithms with respect to the

computational cost, keeping the same level of reliability. The new algorithm is

tested on artificial and real-world graphs. In particular we show the

application of the new algorithm to a network of scientific collaborations,

which, for its size, can not be attacked with the usual methods. This new class

of local algorithms could open the way to applications to large-scale

technological and biological applications. ER -