Breast Cancer Likelihood Prediction Using Bayesian Networks

Nanda, G. and Sundararajan, R. (2022) Breast Cancer Likelihood Prediction Using Bayesian Networks. In: Current Innovations in Medicine and Medical Science Vol. 4. B P International, pp. 68-86. ISBN 978-93-5547-870-2

Full text not available from this repository.

Abstract

Breast cancer is the most common cancer among women, worldwide. Early detection of breast cancer using machine learning algorithms, with validation by doctors can lead to early diagnosis, treatment, prolonged life of the patient, reduction in the cost of treatment, and decrease in the number of deaths. To achieve this goal, we examined Bayesian Network machine learning models in this study to analyze the big medical data and tested various approaches to increase the accuracy for predicting the likelihood and early detection of the breast cancer. The machine learning library, WEKA (Waikato Environment for Knowledge Analysis) and the Breast Cancer Surveillance Consortium (BCSC) dataset with 53,370 screening records were used. This dataset included thirteen variables and the variable “breast_cancer_history” was considered as the main class variable to be predicted with “class 0” meaning negative breast cancer diagnosis and “class 1” meaning positive breast cancer diagnosis. We used three different Bayesian Networks generated by different structure-determination algorithms- K2 (Hill Climbing Algorithm), TAN (Tree Augmented Naive Bayes), and Simulated Annealing. We compared their performances and associated network structure of the Bayes Networks using performance measures Recall and Precision. Our results indicate that the Bayesian Network generated by Simulated Annealing (SA) resulted in overall best prediction performance, followed by TAN and K2. For Class 1, the Recall was highest for K2 followed by SA and TAN, and Precision was highest for SA followed by TAN and K2. Comparison of network structures of the three Bayesian Networks indicated that the SA had the most complex network structures with maximum connections between the variable nodes, followed by TAN, and K2.

Item Type: Book Section
Subjects: STM Open Academic > Medical Science
Depositing User: Unnamed user with email admin@eprint.stmopenacademic.com
Date Deposited: 05 Oct 2023 12:56
Last Modified: 05 Oct 2023 12:56
URI: http://publish.sub7journal.com/id/eprint/1189

Actions (login required)

View Item
View Item