Akademska digitalna zbirka SLovenije - logo
E-viri
Celotno besedilo
Recenzirano
  • Closed frequent similar pat...
    Rodríguez-González, Ansel Y.; Lezama, Fernando; Iglesias-Alvarez, Carlos A.; Martínez-Trinidad, José Fco; Carrasco-Ochoa, Jesús A.; de Cote, Enrique Munoz

    Expert systems with applications, 04/2018, Letnik: 96
    Journal Article

    •The concept of closed frequent similar pattern mining is introduced.•Several lemmas to prune the search space are introduced and proved.•A novel closed frequent similar pattern mining algorithm (CFSP-Miner), is proposed.•CFSP-Miner is more efficient than the frequent pattern mining algorithms.•CFSP-Miner has excellent scalability properties. Frequent pattern mining is considered a key task to discover useful information. Despite the quality of solutions given by frequent pattern mining algorithms, most of them face the challenge of how to reduce the number of frequent patterns without information loss. Frequent itemset mining addresses this problem by discovering a reduced set of frequent itemsets, named closed frequent itemsets, from which the entire frequent pattern set can be recovered. However, for frequent similar pattern mining, where the number of patterns is even larger than for Frequent itemset mining, this problem has not been addressed yet. In this paper, we introduce the concept of closed frequent similar pattern mining to discover a reduced set of frequent similar patterns without information loss. Additionally, a novel closed frequent similar pattern mining algorithm, named CFSP-Miner, is proposed. The algorithm discovers frequent patterns by traversing a tree that contains all the closed frequent similar patterns. To do this efficiently, several lemmas to prune the search space are introduced and proven. The results show that CFSP-Miner is more efficient than the state-of-the-art frequent similar pattern mining algorithms, except in cases where the number of frequent similar patterns and closed frequent similar patterns are almost equal. However, CFSP-Miner is able to find the closed similar patterns, yielding a reduced size of the discovered frequent similar pattern set without information loss. Also, CFSP-Miner shows good scalability while maintaining an acceptable runtime performance.