Breast cancer prediction using different machine learning methods applying multi factors

  • Elham Nazari
  • , Hamid Naderi
  • , Mahla Tabadkani
  • , Reza ArefNezhad
  • , Amir Hossein Farzin
  • , Mohammad Dashtiahangar
  • , Majid Khazaei
  • , Gordon A. Ferns
  • , Amin Mehrabian
  • , Hamed Tabesh*
  • , Amir Avan*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Objective: Breast cancer (BC) is a multifactorial disease and is one of the most common cancers globally. This study aimed to compare different machine learning (ML) techniques to develop a comprehensive breast cancer risk prediction model based on features of various factors.

Methods: The population sample contained 810 records (115 cancer patients and 695 healthy individuals). 45 attributes out of 85 were selected based on the opinion of experts. These selected attributes are in genetic, biochemical, biomarker, gender, demographic and pathological factors. 13 Machine learning models were trained with proposed attributes and coefficient of attributes and internal relationships were calculated.

Result: Compared to other methods random forest (RF) has higher performance (accuracy 99.26%, precision 99%, and area under the curve (AUC) 99%). The results of assessing the impact and correlation of variables using the RF method based on PCA indicated that pathology, biomarker, biochemistry, gene, and demographic factors with a coefficient of 0.35, 0.23, 0.15, 0.14, and 0.13 respectively, affected the risk of BC (r 2 = 0.54).

Conclusion: Breast cancer has several risk factors. Medical experts use these risk factors for early diagnosis. Therefore, identifying related risk factors and their effect can increase the accuracy of diagnosis. Considering the broad features for predicting breast cancer leads to the development of a comprehensive prediction model. In this study, using RF technique a breast cancer prediction model with 99.3% accuracy was developed based on multifactorial features.

Original languageEnglish
Pages (from-to)17133-17146
Number of pages14
JournalJournal of Cancer Research and Clinical Oncology
Volume149
Issue number19
Early online date29 Sept 2023
DOIs
Publication statusPublished - Dec 2023

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Keywords

  • Breast cancer
  • Cancer prediction
  • Factor affecting
  • Machine learning

ASJC Scopus subject areas

  • Oncology
  • Cancer Research

Fingerprint

Dive into the research topics of 'Breast cancer prediction using different machine learning methods applying multi factors'. Together they form a unique fingerprint.

Cite this