Skip to main content
Log in

Association Rules Mining

Exact, Approximate and Parallel Methods: A Survey

  • Survey Article
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Association rules mining (ARM) is an unsupervised learning task. It is used to generate significant and relevant association rules among items in a database. APRIORI and FP-GROWTH are the most popular and used algorithms nowadays for extracting such rules. They are exact methods that consist of two phases. First, frequent itemsets are generated. Then, the latter are used to generate rules. The main drawback of both algorithms is their high execution time. To overcome this drawback, metaheuristics have been proposed. Moreover, to optimize the execution time, since the amount of data is in continuous growth, some parallel architectures can be found in the literature. In this paper, we present an overview of existing literature that investigate the ARM problem using both metaheuristics and parallelism. We will focus on the recent algorithms that tackle these problems using approximate methods and GPU. We will present a non-exhaustive classification of different algorithms according to the type of execution (sequential or parallel) and type of method (exact or approximate).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19

Similar content being viewed by others

References

  1. Adil SH, Qamar S. Implementation of association rule mining using CUDA. Emerging Technologies. ICET 2009, IEEE. 2009.

  2. Agarwal RC, Aggarwal CC, Prasad VVV. Depth First Generation of long patterns. KDD’00: Proceedings of the sixth ACM SIGKDD international conference on knowledge discovery and data mining, 2000;108–18.

  3. Aggarwal CC. Neural Networks and Deep Learning: A Textbook. New York: Springer; 2018.

    Book  Google Scholar 

  4. Agrawal R, Srikant R. Fast algorithms for mining association rules. Proceedings of 20th international conference on very large data bases, VLDB, (Vol. 1215, pp. 487–99). 1994.

  5. Agrawal R, Shafer JC. Parallel mining of association rules. IEEE Trans Knowl Data Eng. 1996;8(6):962–9.

    Article  Google Scholar 

  6. Blanchard J, Guillet F et al. Exploratory visualization for association rule rummaging. Workshop on Multimedia Data Mining, MDM-03. 2003.

  7. Burdick D, Calimlim M, Gehrke J. MAFIA: a maximal frequent itemset algorithm for transactional databases. In: Proceedings 17th International Conference on Data Engineering, IEEE 2001.

  8. Cano A, Luna J, Ventura S. High performance evaluation of evolutionary-mined association rules on GPUs. J Supercomput. 2013;66(3):1438–61.

    Article  Google Scholar 

  9. Cui Q, Guo X. Research on parallel association rules mining on GPU. Proceedings of the 2nd International Conference on Green Communications and Networks 2012 (GCN 2012): Volume 2, Springer. 2013.

  10. Dasgupta S, Saha B. Study of various parallel implementations of association rule mining algorithm. Am J Adv Comput 2020;1(3), 1–7.

  11. Deshmukh RA, Bharathi HN, Tripathy AK. Processing parallel, of frequent itemset based on MapReduce programming model. In 5th International Conference On Computing, Communication. Control And Automation (ICCUBEA). 2019;2019:1–6. https://doi.org/10.1109/ICCUBEA47591.2019.9128369.

  12. Djenouri Y, Drias H et Chemchem A. A hybrid Bees Swarm Optimization and Tabu Search algorithm for Association rule mining. 2013 World Congress on Nature and Biologically Inspired Computing, 2013.

  13. Djenouri Y, Bendjoudi A, Mehdi M et al. Parallel association rules mining using GPUS and bees behaviors. In 2014 6th International Conference on Soft Computing and Pattern Recognition (SoCPaR), pp. 401–5, IEEE. 2014.

  14. Djenouri Y, Bendjoudi A, Mahdi M, et al. GPU-based bees swarm optimization for association rules mining. J Supercomput. 2015;71(4):1318–44.

    Article  Google Scholar 

  15. Djenouri Y, Belhadi A, Fournier-Vigerd F, Fujitae H. Mining diversified association rules in big datasets: a cluster/GPU/genetic approach. Inf Sci. 2018a;459:117–34.

    Article  MathSciNet  Google Scholar 

  16. Djenouri Y, Djenouri D, Habbas Z. Intelligent mapping between GPU and cluster computing for discovering big association rules. Appl Soft Comput. 2018b;65:387–99.

    Article  Google Scholar 

  17. Djenouri Y, Djenouri D, Belhadi A, Fournier-Vigerd, F. GBSO-RSS: GPU-based BSO for rules space summarization, ICBDL 2018: Big Data Analysis and Deep Learning Applications, pp. 123–9. 2018c.

  18. Djenouri Y, Bendjoudi A, Djenouri D, Comuzzi M. GPU-based Bio-inspired Model for Solving Association Rules Mining Problem, 2017 25th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP), St. Petersburg, 2017;pp. 262–9.

  19. Djenouri Y, Bendjoudi A, Habbas Z, Mahdi M, Djenouri D. Reducing thread divergence in GPU-based bees swarm optimization applied to association rule mining. Concurr Comput Pract Exp. 2017a;29(9).

  20. Fang W, Lu M et al. Frequent itemset mining on graphics processors. Proceedings of the fifth international workshop on data management on new hardware, ACM. 2009.

  21. Fu C, Wang X, Zhang L, Qiao L. Mining algorithm for association rules in big data based on Hadoop. AIP Conf Proc. 2018;1955:040035.

    Article  Google Scholar 

  22. Fund I. Comparing Association Rules and Deep Neural Networks on Medical Data. Master of Science in Computer Science. University of Houston, 2019.

  23. Garg R, Mishra PK. Exploiting parallelism in association rule mining algorithms. Int J Adv Technol. 2011;2:222–32.

    Google Scholar 

  24. Ghafari SM, Tjortjis C. A survey on association rules mining using heuristics. 2019. https://doi.org/10.1002/widm.1307, Wiley online library.

  25. Gouda K, Zaki MJ. Genmax: An efficient algorithm for mining maximal frequent itemsets. Data Mining and Knowledge Discovery. New York: Springer; 2005.

    Google Scholar 

  26. Guo H, Zhou Y. An algorithm for mining association rules based on improved genetic algorithm and its application. In: 2009 Third International Conference on Genetic and Evolutionary Computing, 2009, pp 117–20. https://doi.org/10.1109/WGEC.2009.15.

  27. Gunopulos D, Khardon R, Mannila H, Saluja S et al. Discovering all most specific sentences, ACM Transactions on Database Systems. 2003;Vol. 28, No. 2.

  28. Goethals B. Frequent Itemset Mining Implementations Repository. Avalaible in http://fimi.ua.ac.be/data/. 2004.

  29. Han J, Pei J. Yin Y. Mining frequent patterns without candidate generation. ACM sigmod record. Vol. 29. No. 2. ACM, 2000.

  30. Hamdad L, Benatchba K, Bendjoudi A, Ournani Z. Impact of Genetic Algorithms Operators on Association Rules Extraction. In: Rojas I, Joya G, Catala A (eds) Advances in Computational Intelligence. IWANN 2019.

  31. Hamdad L, Ournani Z, Benatchba K, Bendjoudi . Two-level parallel CPU/GPU-based genetic algorithm for association rule mining. Int J Comput Sci Eng. 2020;22(2/3):335–45.

  32. Indira K, Kanmani S. Performance analysis of genetic algorithm for mining association rules. Int. J. Comput. Sci. Issue. 2012;9(2).

  33. Jenifer Nancy J, Jansi Rani M, Devaraj D. Association Rule Mining in Big Data using Map Reduce Approach in Hadoop. International Conference on Innovations in Engineering and Technology(ICIET) - 2016.

  34. Jogin M, Madhulika MS, Divya GD. Feature extraction using convolution neural networks (CNN) and deep learning. 3rd IEEE International Conference on Recent Trends in Electronics, Information and Communication Technology (RTEICT), 2018.

  35. Jourdan L. Métaheuristiques pour l’extraction de connaissances: Application à la génomique, Université des Sciences et Technologie de LilleI. PhD dissertation, 2003.

  36. Kabir MMJ, Xu S, Kang BH, Zhao Z. A new evolutionary algorithm for extracting a reduced set of interesting association rules. International Conference on Neural Information Processing, p. 133–142. Springer International Publishing, 2015.

  37. Khabzaoui M, Dhaenens C, Talbi E-G. Parallel genetic algorithms for multi-objective rule mining. In: The 6th MIC 2005.

  38. Khademolghorani F, Baraani A, Kamran Z. Efficient mining of association rules based on gravitational search algorithm. Int J Comput Sci. 2014;8.

  39. Kuo RJ, Chao CM, Chiu YT. Application of particle swarm optimization to association rule mining. Appl Soft Comput. 2011;11(1):326–36.

    Article  Google Scholar 

  40. Lenca P, Meyer P, Vaillant B, Lallich S. On selecting interestingness measures for association rules: user oriented description and multiple criteria decision aid. Eur J Oper Res. 2008;184(2):610–26.

    Article  Google Scholar 

  41. Li X, Li D, Deng Y, Xing J. Intelligent mining algorithm for complex medical data based on deeplearning. J Ambient Intell Hum Comput. 2021;12:1667–78.

    Article  Google Scholar 

  42. Lin DI, Kedem ZM. Pincer-search: A new algorithm for discovering the maximum frequent set. In Advances in Database Technology, EDBT 1998, pp. 105–119. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1377 LNCS). Springer Verlag. 1998. https://doi.org/10.1007/bfb0100980.

  43. Mata J, Alvarez JL, Riquelme JC. Mining numeric association rules with genetic algorithms. Artificial Neural Nets and Genetic Algorithms. Vienna: Springer; 2001.

    Book  Google Scholar 

  44. Mata J, Alvarez J-L, Riquelme J-C. Discovering numeric association rules via evolutionary algorithm. Advances in knowledge discovery and data mining. 2002;40–51.

  45. Moslehi P, Bidgoli BM, Nasiri M, Salajegheh A. Multi-objective numeric association rules mining via ant colony optimization for continuous domains without specifying minimum support and minimum confidence. International J Comput Sci Issue (IJCSI)8. 2011.

  46. Padillo F, Luna JM, Herrera F, Ventura S. Mining association rules on Big Data through MapReduce genetic programming. Integr Comput-Aid Eng. 2018;25(1):31–48.

    Article  Google Scholar 

  47. Pasquier N. Extraction de Bases pour les Règles d’Association á partir des Itemsets Fermés Fréquents. In Actes de la conférence INFORSID sur les Systèmes d’Information, 2000;56–77.

  48. Parthasarathy S, Zaki M, Ogihara M, et al. Parallel data mining for association rules on shared-memory systems. Knowl Inf Syst. 2001;3:1–29. https://doi.org/10.1007/PL00011656.

    Article  MATH  Google Scholar 

  49. Roberto J, Bayardo Jr. Efficiently Mining Long Patterns from Databases. in Proc. of the 1998 ACM-SIGMOD Int’l Conf. on Management of Data, 85–93.

  50. Savasere A, Omiecinski E, Navathe SB. An efficient algorithm for mining association rules in large databases. In Proceedings of the 21th International Conference on Very Large Data Bases, 1995, 432–44.

  51. Skansi C. Introduction to deep learning: from logical calculus to artificial intelligence. février: Springer; 2018.

    Book  Google Scholar 

  52. Singh S, Garg R, Mishra PK. Review of Apriori Based Algorithms on MapReduce Framework. arXiv:1702.06284. 2017.

  53. Telikania A, Gandomi AH, Shahbahrami A. A survey of evolutionary computation for association rule mining. Inf Sci. 2020;524:318–52.

    Article  MathSciNet  Google Scholar 

  54. Thilina A, Attanayake A et al. Intruder detection using deep learning and association rule mining. In 2016 IEEE International Conference on Computer and Information Technology (CIT). 2016.

  55. Toivonen H, Klemettinen M et al. Pruning and grouping discovered association rules. Available on : http://citeseerx.ist.psu.edu/viewdoc/. 1995.

  56. Tufféry S. Data Mining and Statistics for Decision Making, Chichester, GB: John Wiley and Sons, translated from the French Data Mining et statistique décisionnelle (Éditions Technip, 2008), 2011.

  57. Wang M, Zou Q, Liu C. Multi-dimension association rule mining based on adaptive genetic algorithm. In Uncertainty Reasoning and Knowledge Engineering (URKE), International Conference on, IEEE, 2011.

  58. Yang X, Ye Y, Li X, Lau RYK, Zhang X, Huang X. Hyperspectral image classification with deep learning models. IEEE Trans Geosci Remote Sens 2018;56(9).

  59. Yan X, Zhang C, Zhang S. Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support. Expert Syst Appl. 2009;36(2):3066–76.

    Article  Google Scholar 

  60. Zaki MJ. Parallel and distributed association mining: a survey. IEEE Concurr. 1999;4:14–25.

    Article  Google Scholar 

  61. Zaki MJ, Hsiao C. Efficient algorithms for mining closed itemsets and their lattice structure. In: IEEE Transactions on Knowledge and Data Engineering, vol 17, no 4, pp 462–78, April 2005. https://doi.org/10.1109/TKDE.2005.60.

  62. Zhang F, Zhang Y et al. Gpapriori: Gpu-accelerated frequent itemset mining. Cluster Computing (CLUSTER), 2011 IEEE International Conference on, IEEE. 2011.

  63. Zhang Z, He Q, Gao J, Ni M. A deep learning approach for detecting traffic accidents from social media data. Transportation Research Part C: Emerging Technologies Volume 86, January 2018, Pages 580–96, Elsevier.

  64. Zhou J, Yu KM et al. Parallel frequent patterns mining algorithm on GPU. Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on IEEE.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Leila HAMDAD.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

HAMDAD, L., BENATCHBA, K. Association Rules Mining. SN COMPUT. SCI. 2, 449 (2021). https://doi.org/10.1007/s42979-021-00819-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-021-00819-x

Keywords

Navigation