Performance-friendly rule extraction in large water data-sets with AOC posets and relational concept analysis - Université de Strasbourg Accéder directement au contenu
Article Dans Une Revue International Journal of General Systems Année : 2016

Performance-friendly rule extraction in large water data-sets with AOC posets and relational concept analysis

Résumé

In this paper, we consider data analysis methods for knowledge extraction from large water data-sets. More specifically, we try to connect physico-chemical parameters and the characteristics of taxons living in sample sites. Among these data analysis methods, we consider formal concept analysis (FCA), which is a recognized tool for classification and rule discovery on object–attribute data. Relational concept analysis (RCA) relies on FCA and deals with sets of object–attribute data provided with relations. RCA produces more informative results but at the expense of an increase in complexity. Besides, in numerous applications of FCA, the partially ordered set of concepts introducing attributes or objects (AOC poset, for Attribute–Object–Concept poset) is used rather than the concept lattice in order to reduce combinatorial problems. AOC posets are much smaller and easier to compute than concept lattices and still contain the information needed to rebuild the initial data. This paper introduces a variant of the RCA process based on AOC posets rather than concept lattices. This approach is compared with RCA based on iceberg lattices. Experiments are performed with various scaling operators, and a specific operator is introduced to deal with noisy data. We show that using AOC poset on water data-sets provides a reasonable concept number and allows us to extract meaningful implication rules (association rules whose confidence is 1), whose semantics depends on the chosen scaling operator.
Fichier principal
Vignette du fichier
IJGS_update_sept2023.pdf (781.69 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01265521 , version 1 (24-09-2023)

Identifiants

Citer

Xavier Dolques, Florence Le Ber, Marianne Huchard, Corinne Grac. Performance-friendly rule extraction in large water data-sets with AOC posets and relational concept analysis. International Journal of General Systems, 2016, SI, 45 (2), pp.187-210. ⟨10.1080/03081079.2015.1072927⟩. ⟨hal-01265521⟩
285 Consultations
13 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More