International Journal of Advanced Computer Research (IJACR) ISSN (Print): 2249-7277 ISSN (Online): 2277-7970 Volume - 11 Issue - 57 November - 2021
  1. 1
    Google Scholar
KMK based hybrid approach for the performance estimation in case of diabetes data

Yasir Minhaj Khan and Animesh Kumar Dubey

Abstract

In this paper k-means clustering algorithm has been used with k-points (KMK) selection. It has been applied on the PIMA Indian diabetes dataset. It has been used for distance estimation, centroid selection, effect of data size variations and for the analysis of the complete record. The cluster section has been found to be improved based on k-point selection. It has been used for the assignment of initial centroid. The results indicate that the KMK algorithm is capable in the improvement of centroid selection and distance measures in the assignments of data points. It is due to the better centroid selection mechanism by k-points selection based on the weight measures from the selected dataset. So, the obtained clusters are better in comparison to k-means.

Keyword

K-means, KMK, PIMA, Similarity score, Centroid estimation.

Cite this article

Khan YM, Dubey AK.KMK based hybrid approach for the performance estimation in case of diabetes data. International Journal of Advanced Computer Research. 2021;11(57):116-121. DOI:10.19101/IJACR.2021.1152043

Refference

[1]https://www.who.int/news-room/fact-sheets/detail/diabetes. Accessed 12 January 2022.

[2]Rosy JV, Kumar SB. Optimized encryption based elliptical curve Diffie-Hellman approach for secure heart disease prediction. International Journal of Advanced Technology and Engineering Exploration. 2021; 8(83):1367- 82.

[3]Dubey AK, Gupta U, Jain S. Analysis of k-means clustering approach on the breast cancer Wisconsin dataset. International journal of computer assisted radiology and surgery. 2016; 11(11):2033-47.

[4]Mansour AM. Decision tree-based expert system for adverse drug reaction detection using fuzzy logic and genetic algorithm. International Journal of Advanced Computer Research. 2018; 8(36):110-28.

[5]Dubey AK, Gupta U, Jain S. Epidemiology of lung cancer and approaches for its prediction: a systematic review and analysis. Chinese Journal of Cancer. 2016; 35(1):1-3.

[6]AbdelMaksoud E, Barakat S, Elmogy M. Diabetic retinopathy grading system based on transfer learning. arXiv preprint arXiv:2012.12515. 2020.

[7]Dubey AK, Gupta U, Jain S. Comparative study of K-means and fuzzy C-means algorithms on the breast cancer data. International Journal on Advanced Science, Engineering and Information Technology. 2018; 8(1):18-29.

[8]Khandelwal A, Jain YK. An efficient k-means algorithm for the cluster head selection based on SAW and WPM. International Journal of Advanced Computer Research. 2018; 8(37):191-202.

[9]Pei J, Han J, Lu H, Nishio S, Tang S, Yang D. H-mine: hyper-structure mining of frequent patterns in large databases. In proceedings international conference on data mining 2001 (pp. 441-8). IEEE.

[10]Dubey AK, Dubey AK, Agarwal V, Khandagre Y. Knowledge discovery with a subset-superset approach for mining heterogeneous data with dynamic support. In CSI sixth international conference on software engineering (CONSEG) 2012 (pp. 1-6). IEEE.

[11]Babu DB, Prasad RS, Umamaheswararao Y. Efficient frequent pattern tree construction. International Journal of Advanced Computer Research. 2014; 4(1):331-6.

[12]Li K, Cui L. A kernel fuzzy clustering algorithm with generalized entropy based on weighted sample. International Journal of Advanced Computer Research. 2014; 4(2):596-600.

[13]Dubey AK, Gupta U, Jain S. Medical data clustering and classification using TLBO and machine learning algorithms. CMC-Computers Materials & Continua. 2022; 70(3):4523-43.

[14]Zou Q, Qu K, Luo Y, Yin D, Ju Y, Tang H. Predicting diabetes mellitus with machine learning techniques. Frontiers in Genetics. 2018:515.

[15]Jamil A, Salam A, Amin F. Performance evaluation of top-k sequential mining methods on synthetic and real datasets. International Journal of Advanced Computer Research. 2017; 7(32):176.

[16]Chahar R, Dubey AK, Narang SK. A review and meta-analysis of machine intelligence approaches for mental health issues and depression detection. International Journal of Advanced Technology and Engineering Exploration. 2021; 8(83):1279- 314.

[17]Lan GC, Hong TP, Tseng VS. An efficient projection-based indexing approach for mining high utility itemsets. Knowledge and Information Systems. 2014; 38(1):85-107.

[18]Dubey AK, Shandilya SK. Exploiting need of data mining services in mobile computing environments. In international conference on computational intelligence and communication networks 2010 (pp. 409-14). IEEE.

[19]Khanam JJ, Foo SY. A comparison of machine learning algorithms for diabetes prediction. ICT Express. 2021; 7(4):432-9.

[20]Frimpong EA, Oluwasanmi A, Baagyere EY, Zhiguang Q. A feedforward artificial neural network model for classification and detection of type 2 diabetes. In journal of physics: conference series 2021 (p. 012026). IOP Publishing.

[21]Rasheed J, Hameed AA, Djeddi C, Jamil A, Al-Turjman F. A machine learning-based framework for diagnosis of COVID-19 from chest X-ray images. Interdisciplinary Sciences: Computational Life Sciences. 2021; 13(1):103-17.

[22]Jackins V, Vimal S, Kaliappan M, Lee MY. AI-based smart prediction of clinical disease using random forest classifier and Naive Bayes. The Journal of Supercomputing. 2021; 77(5):5198-219.

[23]Marcos-Zambrano LJ, Karaduzovic-Hadziabdic K, Loncar Turukalo T, Przymus P, Trajkovik V, Aasmets O, et al. Applications of machine learning in human microbiome studies: a review on feature selection, biomarker identification, disease prediction and treatment. Frontiers in microbiology. 2021; 12:313.

[24]Islam MT, Al-Absi HR, Ruagh EA, Alam T. DiaNet: A deep learning based architecture to diagnose diabetes using retinal images only. IEEE Access. 2021; 9:15686-95.

[25]Rahman MR, Islam T, Nicoletti F, Petralia MC, Ciurleo R, Fisicaro F, et al. Identification of common pathogenetic processes between schizophrenia and diabetes mellitus by systems biology analysis. Genes. 2021; 12(2):237.

[26]Ishaq A, Sadiq S, Umer M, Ullah S, Mirjalili S, Rupapara V, Nappi M. Improving the prediction of heart failure patients’ survival using SMOTE and effective data mining techniques. IEEE access. 2021; 9:39707-16.