Cerf,Loïc
Besson,Jérémy
Robardet,Céline
Boulicaut,Jean-François
Closed patterns meet n-ary relations
ACM Transactions on Knowledge Discovery from Data
ACM
3
1–36
2009
Set pattern discovery from binary relations has been extensively studied during the last decade. In particular, many complete and efficient algorithms for frequent closed set mining are now available. Generalizing such a task to n-ary relations (n ≥ 2) appears as a timely challenge. It may be important for many applications, for example, when adding the time dimension to the popular objects × features binary case. The generality of the task (no assumption being made on the relation arity or on the size of its attribute domains) makes it computationally challenging. We introduce an algorithm called Data-Peeler. From an n-ary relation, it extracts all closed n-sets satisfying given piecewise (anti) monotonic constraints. This new class of constraints generalizes both monotonic and antimonotonic constraints. Considering the special case of ternary relations, Data-Peeler outperforms the state-of-the-art algorithms CubeMiner and Trias by orders of magnitude. These good performances must be granted to a new clever enumeration strategy allowing to efficiently enforce the closeness property. The relevance of the extracted closed n-sets is assessed on real-life 3-and 4-ary relations. Beyond natural 3-or 4-ary relations, expanding a relation with an additional attribute can help in enforcing rather abstract constraints such as the robustness with respect to binarization. Furthermore, a collection of closed n-sets is shown to be an excellent starting point to compute a tiling of the dataset.
Cerf,Loïc
Besson,Jérémy
Robardet,Céline
Boulicaut,Jean-Francois
Data-Peeler: Constraint-based Closed Pattern Mining in n-ary Relations
37–48
2008
Set pattern discovery from binary relations has been extensively studied during the last decade. In particular, many complete and efficient algorithms which extract frequent closed sets are now available. Generalizing such a task to n-ary relations (n ≥ 2) appears as a timely challenge. It may be important for many applications, e.g., when adding the time dimension to the popular objects × features binary case. The generality of the task — no assumption being made on the relation arity or on the size of its attribute domains — makes it computationally challenging. We introduce an algorithm called Data-Peeler. From a n-ary relation, it extracts all closed n-sets satisfying given piecewise (anti)-monotonic constraints. This new class of constraints generalizes both monotonic and anti-monotonic constraints. Considering the special case of ternary relations, Data-Peeler outperforms the state-of-the-art algorithms CubeMiner and Trias by orders of magnitude. These good performances must be granted to a new clever enumeration strategy allowing an efficient closeness checking. An original application on a real-life 4-ary relation is used to assess the relevancy of closed n-sets constraint-based mining.
Cattuto,Ciro
Schmitz,Christoph
Baldassarri,Andrea
Servedio,Vito D.P.
Loreto,Vittorio
Hotho,Andreas
Grahl,Miranda
Stumme,Gerd
Network Properties of Folksonomies
AI Communications Journal, Special Issue on ``Network Analysis in Natural Sciences and Engineering''
IOS Press
20
245-262
2007
Jäschke,Robert
Hotho,Andreas
Schmitz,Christoph
Stumme,Gerd
Analysis of the Publication Sharing Behaviour in BibSonomy
Springer-Verlag
4604
283–295
2007
BibSonomy is a web-based social resource sharing system which allows users to organise and share bookmarks and publications in a collaborative manner. In this paper we present the system, followed by a description of the insights in the structure of its bibliographic data that we gained by applying techniques we developed in the area of Formal Concept Analysis.
Hotho,Andreas
Jäschke,Robert
Schmitz,Christoph
Stumme,Gerd
BibSonomy: A Social Bookmark and Publication Sharing System
Aalborg Universitetsforlag
87-102
2006
Social bookmark tools are rapidly emerging on the Web. In suchsystems users are setting up lightweight conceptual structurescalled folksonomies. The reason for their immediate success is thefact that no specific skills are needed for participating. In thispaper we specify a formal model for folksonomies and briefly describe our own system BibSonomy, which allows for sharing both bookmarksand publication references in a kind of personal library.
Jäschke,Robert
Hotho,Andreas
Schmitz,Christoph
Ganter,Bernhard
Stumme,Gerd
TRIAS - An Algorithm for Mining Iceberg Tri-Lattices
IEEE Computer Society
907-911
2006
In this paper, we present the foundations for mining frequent tri-concepts, which extend the notion of closed itemsets to three-dimensional data to allow for mining folksonomies. We provide a formal definition of the problem, and present an efficient algorithm for its solution as well as experimental results on a large real-world example.
Cimiano,P.
Hotho,A.
Staab,S.
Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis
Journal of Artificial Intelligence Research
24
305-339
2005