International Journal of Advanced Computer Research (IJACR) ISSN (Print): 2249-7277 ISSN (Online): 2277-7970 Volume - 14 Issue - 66 March - 2024
  1. 1
    Google Scholar
A hybrid random forest and k-nearest neighbors approach for breast cancer detection

Om Prakash Kumar and Animesh Kumar Dubey

Abstract

In this paper, a novel hybrid approach was presented combining random forest (RF) and k-nearest neighbors (kNN) for the classification of breast cancer data. RF is selected for its robustness against overfitting and its ability to handle high-dimensional data effectively, providing a measure of feature importance and generalizing well due to its ensemble nature. kNN is chosen for its simplicity and effectiveness in capturing local data patterns. Our hybrid RF-kNN approach involves feature importance weighting in kNN, dynamic k selection, polynomial feature expansion, and ensemble output combination. The Wisconsin breast cancer database (WBCD) is used for experimentation, evaluated using 10-fold cross-validation. Performance metrics include accuracy, precision, recall, and F1-score. The results demonstrate that the hybrid RF-kNN model outperforms individual models, achieving superior performance across all metrics and data splits. This highlights the robustness and effectiveness of the hybrid model in reducing false positives and correctly identifying patients with breast cancer, making it a reliable model for breast cancer detection.

Keyword

Breast cancer detection, Random forest, k-nearest neighbors, Hybrid model, Machine learning.

Cite this article

Kumar OP, Dubey AK.A hybrid random forest and k-nearest neighbors approach for breast cancer detection . International Journal of Advanced Computer Research. 2024;14(66):42-49. DOI:10.19101/IJACR.2024.1466003

Refference

[1]Ng AY, Oberije CJ, Ambrózay É, Szabó E, Serfőző O, Karpati E, et al. Prospective implementation of AI-assisted screen reading to improve early detection of breast cancer. Nature Medicine. 2023; 29(12):3044-9.

[2]Stergiopoulou D, Markou A, Strati A, Zavridou M, Tzanikou E, Mastoraki S, et al. Comprehensive liquid biopsy analysis as a tool for the early detection of minimal residual disease in breast cancer. Scientific Reports. 2023; 13(1):1258.

[3]Singh L, Alam A. An efficient hybrid methodology for an early detection of breast cancer in digital mammograms. Journal of Ambient Intelligence and Humanized Computing. 2024; 15(1):337-60.

[4]Abhisheka B, Biswas SK, Purkayastha B. A comprehensive review on breast cancer detection, classification and segmentation using deep learning. Archives of Computational Methods in Engineering. 2023; 30(8):5023-52.

[5]Gago A, Aguirre JM, Wong L. Machine learning system for the effective diagnosis and survival prediction of breast cancer patients. International Journal of Online & Biomedical Engineering. 2024; 20(2): 95-113.

[6]Hekal AA, Moustafa HE, Elnakib A. Ensemble deep learning system for early breast cancer detection. Evolutionary Intelligence. 2023; 6(3):1045-54.

[7]Khalid A, Mehmood A, Alabrah A, Alkhamees BF, Amin F, Alsalman H, et al. Breast cancer detection and prevention using machine learning. Diagnostics. 2023; 13(19):1-21.

[8]Dubey AK, Gupta U, Jain S. Analysis of k-means clustering approach on the breast cancer wisconsin dataset. International journal of computer assisted radiology and surgery. 2016; 11:2033-47.

[9]Abunasser BS, Al-hiealy MR, Zaqout IS, Abu-naser SS. Convolution neural network for breast cancer detection and classification using deep learning. Asian Pacific Journal of Cancer Prevention. 2023; 24(2):531-44.

[10]Prodan M, Paraschiv E, Stanciu A. Applying deep learning methods for mammography analysis and breast cancer detection. Applied Sciences. 2023; 13(7):1-18.

[11]Rachna, Choudhary C, Thakur J. A robust machine learning model for breast cancer prediction. Optimized Predictive Models in Healthcare Using Machine Learning. 2024:117-34.

[12]Kawina I, Amarendra K, Marapelli B. Deep learning and machine learning approach to breast cancer classification with random search hyperparameter tuning. International Journal of Intelligent Systems and Applications in Engineering. 2024; 12(16s):264-75.

[13]Saroğlu HE, Shayea I, Saoud B, Azmi MH, El-saleh AA, Saad SA, et al. Machine learning, IoT and 5g technologies for breast cancer studies: a review. Alexandria Engineering Journal. 2024; 89:210-23.

[14]Dianati-nasab M, Salimifard K, Mohammadi R, Saadatmand S, Fararouei M, Hosseini KS, et al. Machine learning algorithms to uncover risk factors of breast cancer: insights from a large case-control study. Frontiers in Oncology. 2024; 13:1276232.

[15]Lomboy KE, Hernandez RM. A comparative performance of breast cancer classification using hyper-parameterized machine learning models. International Journal of Advanced Technology and Engineering Exploration. 2021; 8(82):1080-101.

[16]Neelima G, Kanchanamala P, Misra A, Nugraha RA. Detection of breast cancer based on fuzzy logic. In international conference on advancement in data science, e-learning and information system 2023 (pp. 1-6). IEEE.

[17]Jain R, Kukreja V, Chattopadhyay S, Verma A, Sharma R. Radial basis function integrated with support vector machine model for breast cancer detection. In 2nd international conference on artificial intelligence and machine learning applications theme: healthcare and internet of things 2024 (pp. 1-5). IEEE.

[18]Khan RH, Miah J, Rahman MM, Tayaba M. A comparative study of machine learning algorithms for detecting breast cancer. In 13th annual computing and communication workshop and conference 2023 (pp. 647-52). IEEE.

[19]Sun F, Yang X. Advancing breast cancer diagnosis: a comprehensive study of machine learning algorithms on histological tumor characteristics. In international conference on data science & informatics 2023 (pp. 302-6). IEEE.

[20]Sawant A, Patil D, Khuman D, Pingle Y, Shinde V. Enhancing breast cancer detection: a machine learning approach for early diagnosis and classification. In 11th international conference on computing for sustainable global development 2024 (pp. 235-9). IEEE.

[21]Vasista VL, Sona K, Pedarla J, Sahithi B, Rao TK, Prakash KB. Predicting breast cancer using classical machine learning and deep learning algorithms. In international conference on intelligent and innovative technologies in computing, electrical and electronics 2023 (pp. 988-91). IEEE.

[22]Tinao MM, Rodriguez RB, Calibara ER. Breast cancer detection in the Philippines using machine learning approaches. In international conference on electronics, information, and communication 2024 (pp. 1-4). IEEE.

[23]Kumar R, Chaudhry M, Patel HK, Prakash N, Dogra A, Kumar S. An analysis of ensemble machine learning algorithms for breast cancer detection: performance and generalization. In 11th international conference on computing for sustainable global development 2024 (pp. 366-70). IEEE.

[24]Tripathi RP, Khatri SK, Van GD, Ather D. Unleashing the power of machine learning: a precision paradigm for breast cancer subtype classification using open-source data, with caution on dataset size and interpretability. In 6th international conference on contemporary computing and informatics 2023 (pp. 1004-8). IEEE.

[25]Latha DU, Mahesh TR. Analysis of deep learning and machine learning methods for breast cancer detection. In international conference on computer science and emerging technologies 2023 (pp. 1-6). IEEE.

[26]Manjunathan N, Gomathi N, Muthulingam S. Early detection of breast cancer using machine learning. In international conference on sustainable computing and smart systems 2023 (pp. 165-9). IEEE.

[27]https://archive.ics.uci.edu/dataset/17/breast+cancer+wisconsin+diagnostic. Accessed 27 October 2023.