TY  - CONF
AU  - Li, Xin
AU  - Guo, Lei
AU  - Zhao, Yihong E.
A2  - 
T1  - Tag-based Social Interest Discovery
T2  - Proceedings of the 17th International World Wide Web Conference
PB  - ACM
CY  - 
PY  - 2008/
M2  - 
VL  - 
IS  - 
SP  - 675
EP  - 684
UR  - http://www2008.org/papers/pdf/p675-liA.pdf
M3  - 
KW  - ***
KW  - association
KW  - clustering
KW  - community
KW  - del.icio.us
KW  - detection
KW  - folksonomy
KW  - rules
L1  - 
SN  - 
N1  - 
N1  - 
AB  - The success and popularity of social network systems, such as del.icio.us, Facebook, MySpace, and YouTube, have generated many interesting and challenging problems to the research community. Among others, discovering social interests shared by groups of users is very important because it helps to connect people with common interests and encourages people to contribute and share more contents. The main challenge to solving this problem comes from the diffi- culty of detecting and representing the interest of the users. The existing approaches are all based on the online connections of users and so unable to identify the common interest of users who have no online connections. In this paper, we propose a novel social interest discovery approach based on user-generated tags. Our approach is motivated by the key observation that in a social network, human users tend to use descriptive tags to annotate the contents that they are interested in. Our analysis on a large amount of real-world traces reveals that in general, user-generated tags are consistent with the web content they are attached to, while more concise and closer to the understanding and judgments of human users about the content. Thus, patterns of frequent co-occurrences of user tags can be used to characterize and capture topics of user interests. We have developed an Internet Social Interest Discovery system, ISID, to discover the common user interests and cluster users and their saved URLs by different interest topics. Our evaluation shows that ISID can effectively cluster similar documents by interest topics and discover user communities with common interests no matter if they have any online connections.
ER  -

TY  - JOUR
AU  - Cattuto, C.
AU  - Schmitz, C.
AU  - Baldassarri, A.
AU  - Servedio, V. D. P.
AU  - Loreto, V.
AU  - Hotho, A.
AU  - Grahl, M.
AU  - Stumme, G.
T1  - Network Properties of Folksonomies
JO  - AI Communications
PY  - 2007/
VL  - 20
IS  - 4
SP  - 245 
EP  -  262
UR  - http://www.kde.cs.uni-kassel.de/hotho/pub/2007/aicomm_2007_folksonomy_clustering.pdf
M3  - 
KW  - clustering
KW  - community
KW  - folksonomy
L1  - 
SN  - 
N1  - 
N1  - 
AB  - 
ER  -

TY  - GEN
AU  - Radicchi, Filippo
AU  - Castellano, Claudio
AU  - Cecconi, Federico
AU  - Loreto, Vittorio
AU  - Parisi, Domenico
A2  - 
T1  - Defining and identifying communities in networks
JO  - 
PB  - 
AD  - 
PY  - 2004/02
VL  - 
IS  - 
SP  - 
EP  - 
UR  - http://arxiv.org/abs/cond-mat/0309488
M3  - 
KW  - clustering
KW  - folksonomy
KW  - toread
KW  - graph
KW  - community
L1  - 
N1  - "In general algorithms define communities operationally as what the they finds. A dendrogram, i. e. a community<p>structure, is always produced by the algorithms down to the level of single nodes, independently from the type<p>of graph analyzed. This is due to the lack of explicit prescriptions to discriminate between networks that<p>are actually endowed with a community structure and those that are not. As a consequence, in practical<p>applications one needs additional, non topological, information on the nature of the network to understand<p>which of the branches of the tree have a real significance. Without such information it is not clear at all whether<p>the identification of a community is reliable or not."<p><p>---<p><p>Domain: scientific collaborations<p><p>Task: calculate a dendrogram (the community graph)<p><p>Method: effucuebt GN (Girvan &amp; Newman( algorithm based on edge betweenness. Their algorithm allows to be fine-tuned beween acting local or global. To be more efficient they replace the "edge betweenness" by "edge clustering coefficient" which is based on the number of triangles the edge is contained in VS the degree of the incident nodes.<p><p>Motto: "Algorithm must include the quantitative community definition"
N1  - 
AB  - The investigation of community structures in networks is an important issue<p>in many domains and disciplines. This problem is relevant for social tasks<p>(objective analysis of relationships on the web), biological inquiries<p>(functional studies in metabolic, cellular or protein networks) or<p>technological problems (optimization of large infrastructures). Several types<p>of algorithm exist for revealing the community structure in networks, but a<p>general and quantitative definition of community is still lacking, leading to<p>an intrinsic difficulty in the interpretation of the results of the algorithms<p>without any additional non-topological information. In this paper we face this<p>problem by introducing two quantitative definitions of community and by showing<p>how they are implemented in practice in the existing algorithms. In this way<p>the algorithms for the identification of the community structure become fully<p>self-contained. Furthermore, we propose a new local algorithm to detect<p>communities which outperforms the existing algorithms with respect to the<p>computational cost, keeping the same level of reliability. The new algorithm is<p>tested on artificial and real-world graphs. In particular we show the<p>application of the new algorithm to a network of scientific collaborations,<p>which, for its size, can not be attacked with the usual methods. This new class<p>of local algorithms could open the way to applications to large-scale<p>technological and biological applications.
ER  -