Combined Effect of Concept Drift and Class Imbalance on Model Performance During Stream Classification

下载PDF

导出

摘要 Every application in a smart city environment like the smart grid,health monitoring, security, and surveillance generates non-stationary datastreams. Due to such nature, the statistical properties of data changes overtime, leading to class imbalance and concept drift issues. Both these issuescause model performance degradation. Most of the current work has beenfocused on developing an ensemble strategy by training a new classifier on thelatest data to resolve the issue. These techniques suffer while training the newclassifier if the data is imbalanced. Also, the class imbalance ratio may changegreatly from one input stream to another, making the problem more complex.The existing solutions proposed for addressing the combined issue of classimbalance and concept drift are lacking in understating of correlation of oneproblem with the other. This work studies the association between conceptdrift and class imbalance ratio and then demonstrates how changes in classimbalance ratio along with concept drift affect the classifier’s performance.We analyzed the effect of both the issues on minority and majority classesindividually. To do this, we conducted experiments on benchmark datasetsusing state-of-the-art classifiers especially designed for data stream classification.Precision, recall, F1 score, and geometric mean were used to measure theperformance. Our findings show that when both class imbalance and conceptdrift problems occur together the performance can decrease up to 15%. Ourresults also show that the increase in the imbalance ratio can cause a 10% to15% decrease in the precision scores of both minority and majority classes.The study findings may help in designing intelligent and adaptive solutionsthat can cope with the challenges of non-stationary data streams like conceptdrift and class imbalance.

作者 Abdul Sattar Palli Jafreezal Jaafar Manzoor Ahmed Hashmani Heitor Murilo Gomes Aeshah Alsughayyir Abdul Rehman Gilal

机构地区 Department of Computer and Information Sciences Centre for Research in Data Science High Performance Cloud Computing Centre(HPC School of Engineering and Computer Science AI Institute Anti-Narcotics Force College of Computer Science and Engineering

出处《Computers, Materials & Continua》 SCIE EI 2023年第4期1827-1845,共19页 计算机、材料和连续体（英文）

基金 The authors would like to extend their gratitude to Universiti Teknologi PETRONAS (Malaysia)for funding this research through grant number (015LA0-037).

关键词 CLASSIFICATION data streams class imbalance concept drift class imbalance ratio

分类号 TP311.1 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

1徐佳忆(译).论文工厂:COPE和STM的研究报告[J].学术出版与传播,2022,1(1):68-77.
2Manju Subedi,Naresh Kazi Tamrakar.Fluvial Geomorphology and Basin Development of Karra Khola Basin, Hetauda, Central Nepal[J].Journal of Geological Research,2020,2(4):1-13.
3ANDREW CHIN.QUEEN SEA BIG SHARK:Chapter Three of the Beijing Surfers' Adventures[J].城市漫步（GBA版）,2016(4):34-35.
4Tan Cheng,Jielong Wang.Incremental Learning Based on Data Translation and Knowledge Distillation[J].International Journal of Intelligence Science,2023,13(2):33-47.
5Alaa Eisa,Nora E.L-Rashidy,Mohammad Dahman Alshehri,Hazem M.El-bakry,Samir Abdelrazek.Incremental Learning Framework for Mining Big Data Stream[J].Computers, Materials & Continua,2022(5):2901-2921.
6Faizan Rasheed,Yasir Saleem,Kok-Lim Alvin Yau,Yung-Wey Chong,Sye Loong Keoh.The Role of Deep Learning in Parking Space Identification and Prediction Systems[J].Computers, Materials & Continua,2023(4):761-784.
7李尚泽,韩天福(指导).I Have a Big Family[J].中学生英语,2023(9):7-7.
8Lily Wang.The 2023 Two Sessions:Boosting Confidence and Embarking on A New Journey[J].China's Foreign Trade,2023(2):8-17.
9刘婧,游才印,马丽,李云,马凌,田娜.In-plane current-induced magnetization reversal of Pd/CoZr/MgO magnetic multilayers[J].Chinese Physics B,2022,31(12):511-515.
10Mengmeng Han,Qiyan Wang,Xue Wang,Yuhui Zeng,Yong Huang,Qingqiang Meng,Jun Zhang,Xunbin Wei.Near infra-red light treatment of Alzheimer's disease[J].Journal of Innovative Optical Health Sciences,2018(1):57-64.

Computers, Materials & Continua

2023年第4期

浏览历史

内容加载中请稍等...

Combined Effect of Concept Drift and Class Imbalance on Model Performance During Stream Classification

相关作者

相关机构

相关主题

浏览历史