Exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier for drift detection with data stream
Thangam M and A. Bhuvaneswari
Abstract
Data streams are potentially large and thus data stream classification tasks are not strictly stationary. In the process of data analysis, the fundamental structure may vary over time and the changes in the primary distribution of the data are known as drift. Early drift detection achieves better detection results in the evolving data stream analysis. In order to perform accurate drift detection with minimum time, a novel deep learning technique called exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier (EKFMTR-DBNLC) was introduced. The main aim of the proposed EKFMTR-DBNLC technique is to perform multiple drift detection from the data stream using multiple layers. The proposed deep belief network comprises of various layers such as an input layer, an output layer and two hidden layers. The input layer receives the number of features and data from the dataset. The hidden layers perform the significant feature selection to reduce the drift detection time. The exponential kernelized semantic feature mapping technique is applied for identifying the significant feature for data classifications. Then, using Theil-Sen regression (TSR) function, the drifts in the data stream are detected and classified from the selected relevant features in the next hidden layer. The regression function analyzes the distribution of the data between the two-time intervals. Based on regression analysis, multiple drifts such as incremental drift, gradual drift, sudden drift and recurring drift are identified. Experimental estimation of the proposed EKFMTR-DBNLC technique and conventional methods are performed with different factors such as classification accuracy, precision, recall, F-score and drift detection time using real-world and synthetic datasets. The analyzed numerical result confirms that the proposed technique EKFMTR-DBNLC achieves 10% higher classification accuracy and also minimizes the time consumption by 13.5% than the conventional methods.
Keyword
Data stream classification, Drift detection, Feature mapping, Neural learning classifier, Regression function.
Cite this article
Thangam M, Bhuvaneswari A.Exponential kernelized feature map Theil-Sen regression-based deep belief neural learning classifier for drift detection with data stream. International Journal of Advanced Technology and Engineering Exploration. 2022;9(90):663-675. DOI:10.19101/IJATEE.2021.874851
Refference
[1]Khamassi I, Sayed-mouchaweh M, Hammami M, Ghédira K. Discussion and review on evolving data streams and concept drift adapting. Evolving Systems. 2018; 9(1):1-23.
[2]Krawczyk B, Minku LL, Gama J, Stefanowski J, Woźniak M. Ensemble learning for data stream analysis: a survey. Information Fusion. 2017; 37:132-56.
[3]Wares S, Isaacs J, Elyan E. Data stream mining: methods and challenges for handling concept drift. SN Applied Sciences. 2019; 1(11):1-19.
[4]Gama J, Medas P, Castillo G, Rodrigues P. Learning with drift detection. In Brazilian symposium on artificial intelligence 2004 (pp. 286-95). Springer, Berlin, Heidelberg.
[5]Wang X, Chen W, Xia J, Chen Z, Xu D, Wu X, et al. ConceptExplorer: visual analysis of concept drifts in multi-source time-series data. In conference on visual analytics science and technology 2020 (pp. 1-11). IEEE.
[6]Hatamikhah N, Barari M, Kangavari MR, Keyvanrad MA. Concept drift detection via improved deep belief network. In electrical engineering Iranian conference on 2018 (pp. 1703-7). IEEE.
[7]Shah SH, Rehman A, Rashid T, Karim J, Shah S. A comparative study of ordinary least squares regression and Theil-Sen regression through simulation in the presence of outliers. Journal of Science and Technology. 2016; 137-42.
[8]Hua Y, Guo J, Zhao H. Deep belief networks and deep learning. In proceedings of 2015 international conference on intelligent computing and internet of things 2015 (pp. 1-4). IEEE.
[9]Zheng X, Li P, Hu X, Yu K. Semi-supervised classification on data streams with recurring concept drift and concept evolution. Knowledge-Based Systems. 2021.
[10]Pratama M, Pedrycz W, Webb GI. An incremental construction of deep neuro fuzzy system for continual learning of nonstationary data streams. IEEE Transactions on Fuzzy Systems. 2019; 28(7):1315-28.
[11]Yan MM. Accurate detecting concept drift in evolving data streams. ICT Express. 2020; 6(4):332-8.
[12]Prasad KS, Rao AS, Ramana AV. Ensemble framework for concept-drift detection in multidimensional streaming data. International Journal of Computers and Applications. 2020:1-8.
[13]Mahdi OA, Pardede E, Ali N. KAPPA as drift detector in data stream mining. Procedia Computer Science. 2021; 184:314-21.
[14]Namitha K, Kumar GS. Learning in the presence of concept recurrence in data stream clustering. Journal of Big Data. 2020; 7(1):1-28.
[15]Liu A, Lu J, Zhang G. Concept drift detection via equal intensity k-means space partitioning. IEEE Transactions on Cybernetics. 2020; 51(6):3198-211.
[16]Bi X, Zhang C, Zhao X, Li D, Sun Y, Ma Y. CODES: efficient incremental semi-supervised classification over drifting and evolving social streams. IEEE Access. 2020; 8:14024-35.
[17]Chen D, Yang Q, Liu J, Zeng Z. Selective prototype-based learning on concept-drifting data streams. Information Sciences. 2020; 516:20-32.
[18]Mahdi OA, Pardede E, Ali N, Cao J. Diversity measure as a new drift detection method in data streaming. Knowledge-Based Systems. 2020.
[19]Singh VK, Verma S, Kumar M. Stream processing with concept drift for event identification in sensors enabled IoT environment. IEEE Sensors Journal. 2019; 19(24):12187-95.
[20]Ancy S, Paulraj D. Handling imbalanced data with concept drift by applying dynamic sampling and ensemble classification model. Computer Communications. 2020; 153:553-60.
[21]Altendeitering M, Dübler S. Scalable detection of concept drift: a learning technique based on support vector machines. Procedia Manufacturing. 2020; 51:400-7.
[22]Liu A, Lu J, Zhang G. Concept drift detection: dealing with missing values via fuzzy distance estimations. IEEE Transactions on Fuzzy Systems. 2020; 29(11):3219-33.
[23]Yang Z, Al-Dahidi S, Baraldi P, Zio E, Montelatici L. A novel concept drift detection method for incremental learning in nonstationary environments. IEEE Transactions on Neural Networks and Learning Systems. 2019; 31(1):309-20.
[24]Jedrzejowicz J, Jedrzejowicz P. GEP-based classifier with drift detection for mining imbalanced data streams. Procedia Computer Science. 2020; 176:41-9.
[25]Mehmood H, Kostakos P, Cortes M, Anagnostopoulos T, Pirttikangas S, Gilman E. Concept drift adaptation techniques in distributed environment for real-world data streams. Smart Cities. 2021; 4(1):349-71.
[26]Yuan Y, Wang Z, Wang W. Unsupervised concept drift detection based on multi-scale slide windows. Ad Hoc Networks. 2021.
[27]Priya S, Uthra RA. Deep learning framework for handling concept drift and class imbalanced complex decision-making on streaming data. Complex & Intelligent Systems. 2021:1-17.
[28]Oikarinen E, Tiittanen H, Henelius A, Puolamäki K. Detecting virtual concept drift of regressors without ground truth values. Data Mining and Knowledge Discovery. 2021; 35(3):726-47.
[29]Mayaki MZ, Riveill M. Autoregressive based drift detection method. arXiv preprint arXiv:2203.04769. 2022.
[30]Nikpour S, Asadi S. A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift. Journal of Ambient Intelligence and Humanized Computing. 2022; 13(6):2983-300.