Analysis of A Priori Algorithm in Medical Data for Heart Disease Identification with Association Rule Mining

Authors

  • Davip Sutejo Teknik Informatika, Universitas Dr. Soetomo
  • Villa Indra Yudha Adhi Jaya Teknik Informatika, Universitas Dr. Soetomo
  • Slamet Kacung Teknik Informatika, Universitas Dr. Soetomo

DOI:

https://doi.org/10.30983/knowbase.v4i2.8909

Keywords:

Association Rule Mining, A priori Algorithm, Heart Disease

Abstract

Heart disease is one of the leading causes of death worldwide, so it is important to identify risk factors that can contribute to the development of this disease in order to carry out early prevention. This study aims to identify patterns of association between risk variables and the incidence of heart disease using the Association Rule Mining (ARM) method combined with the A priori algorithm. The data used in this study includes lifestyle information, medical history, and other health parameters, obtained from the UCI Machine Learning repository. The analysis results showed that with a support value between 30% and 70%, the strongest association rule was found between sex (sex = 1) and angina (exang = 1), with a lift value of 1.67, indicating a strong positive relationship towards a positive diagnosis (target = 1). In addition, other moderate association rules were found, such as the combination of cp_1 = 1 and ca_0 = 1, with a lift value of about 0.73, indicating a weaker association. These findings suggest that some attribute combinations have higher predictive power, which can be used to improve prediction models in the medical diagnosis of heart disease. This research also highlights the main challenges faced by the A priori algorithm, such as computational complexity and selecting the right threshold to obtain significant rules

References

M. M. A. H. Zarei, S. M. M. Fakhraei, “A novel association rule mining-based algorithm for healthcare data analysis,” J. Healthc. Eng., pp. 1–10, 2021.

P. S. S. Singh, M. M. S. Soni, “Data mining techniques and tools for healthcare data,” Int. J. Comput. Appl., vol. 174, no. 3, pp. 10–17, 2021.

A. Yazdani, K. D. Varathan, Y. K. Chiam, A. W. Malik, and W. A. Wan Ahmad, “A novel approach for heart disease prediction using strength scores with significant predictors,” BMC Med. Inform. Decis. Mak., vol. 21, no. 1, pp. 1–16, 2021.

E. Yunianto, “Prediksi penyakit kardiovaskular berbasis asosiation rule,” 2017.

and H. Z. M. Aljarah, S. M. M. Fakhraei, “Predicting cardiovascular disease using machine learning: A systematic review,” Healthcare, vol. 8, no. 3, pp. 1–16, 2022.

and F. T. V. C. A. Rojas, J. C. Peña, “Association rule mining and its application in medical diagnosis: A review,” J. Appl. Res. Technol., vol. 19, no. 2, pp. 113–124, 2021.

and D. N. R. A. R. Singh, K. R. Singh, “Analysis of cardiovascular diseases using association rule mining,” Int. J. Data Sci. Anal., vol. 3, no. 4, pp. 200–210, 2021.

K. Rameshkumar, M. Sambath, and S. Ravi, “Relevant association rule mining from medical dataset using new irrelevant rule elimination technique,” 2013 Int. Conf. Inf. Commun. Embed. Syst. ICICES 2013, pp. 300–304, 2013.

R. B. Tiwari and A. G. Sharma, “Predictive modeling of heart disease using association rule mining and neural network,” Int. J. Eng. Technol., vol. 13, no. 3, pp. 1124–1131, 2023.

S. M. M. F. H. Zarei, M. Aljarah, “Predicting cardiovascular disease using machine learning: A systematic review,” Healthcare, vol. 8, no. 3, pp. 1–16, 2022.

S. Darrab, D. Broneske, and G. Saake, “Exploring the predictive factors of heart disease using rare association rule mining,” Sci. Rep., vol. 14, no. 1, pp. 1–26, 2024.

M. Kaushik, R. Sharma, I. Fister, and D. Draheim, “Numerical Association Rule Mining: A Systematic Literature Review,” vol. 1, no. 1, 2023.

H. Haryanto, H. Winarto, and C. Juliane, “Application of Data Mining Techniques in Healthcare: Identifying Inter-Disease Relationships through Association Rule Mining,” J. World Sci., vol. 3, no. 5, pp. 520–529, 2024.

X. Zhang and J. Zhang, “Analysis and research on library user behavior based on apriori algorithm,” Meas. Sensors, vol. 27, no. April, p. 100802, 2023.

UCI Machine Learning Repository, “Heart Disease.” [Online]. Available: https://archive.ics.uci.edu/dataset/45/heart+disease. [Accessed: 30-Dec-2024].

Kusrini, Luthfi T, E., Algoritma Data Mining. Buku Algoritma Data Mining. Andi Offset, Yogyakarta, 2009.

M. H. Santoso, “Application of Association Rule Method Using Apriori Algorithm to Find Sales Patterns Case Study of Indomaret Tanjung Anom,” vol. 1, no. 2, pp. 54–66, 2021.

C. Mihaescu, “Mining Frequent Itemsets – Apriori Algorithm,” Lab. Modul. 8, 2012.

E. Hikmawati, N. U. Maulidevi, and K. Surendro, “Minimum threshold determination method based on dataset characteristics in association rule mining,” J. Big Data, vol. 8, no. 1, 2021.

B. Mudumba and M. F. Kabir, “Mine-first association rule mining: An integration of independent frequent patterns in distributed environments,” Decis. Anal. J., vol. 10, no. February, p. 100434, 2024.

J. Tempaiboolkul, “Mining rare association rules in a distributed environment using multiple minimum supports,” 2013 IEEE/ACIS 12th Int. Conf. Comput. Inf. Sci. ICIS 2013 - Proc., pp. 295–299, 2013.

M. I. Habibi, A. Nazir, E. Haerani, and E. Budianita, “Application of Data Mining for Ceramic Sales Data Association Using Apriori Algorithm,” vol. 05, no. 02, pp. 1–11, 2024.

F. A. Rohmah, S. Kacung, and E. Prihartono, “Association Rule Mining To Enhance Sata Bottle Sales,” vol. 04, no. 01, pp. 28–37, 2024.

Downloads

Submitted

2024-12-05

Accepted

2024-12-24

Published

2024-12-24