An ensemble machine learning model to uncover potential sites of hazardous waste illegal dumping based on limited supervision experience

导出

摘要 With the soaring generation of hazardous waste(HW)during industrialization and urbanization,HW illegal dumping continues to be an intractable global issue.Particularly in developing regions with lax regulations,it has become a major source of soil and groundwater contamination.One dominant challenge for HW illegal dumping supervision is the invisibility of dumping sites,which makes HW illegal dumping difficult to be found,thereby causing a long-term adverse impact on the environment.How to utilize the limited historic supervision records to screen the potential dumping sites in the whole region is a key challenge to be addressed.In this study,a novel machine learning model based on the positive-unlabeled(PU)learning algorithm was proposed to resolve this problem through the ensemble method which could iteratively mine the features of limited historic cases.Validation of the random forest-based PU model showed that the predicted top 30%of high-risk areas could cover 68.1%of newly reported cases in the studied region,indicating the reliability of the model prediction.This novel framework will also be promising in other environmental management scenarios to deal with numerous unknown samples based on limited prior experience.

作者 Jinghua Geng Yimeng Ding Wenjun Xie Wen Fang Miaomiao Liu Zongwei Ma Jianxun Yang Jun Bi

机构地区 State Key Laboratory of Pollution Control and Resource Reuse

出处《Fundamental Research》 CAS CSCD 2024年第4期972-978,共7页 自然科学基础研究（英文版）

基金 the National Natural Science Foundation of China(71761147002,71921003,and 52270199) Jiangsu R&D Special Fund for Carbon Peaking and Carbon Neutrality(BK20220014) State Key Laboratory of Pollution Control and Resource Reuse(PCRRZZ-202109).

关键词 Hazardous waste Illegal dumping site Positive-unlabeled machine learning Probability prediction Model interpretation

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献2

1zhi-hua zhou.A brief introduction to weakly supervised learning[J].National Science Review,2018,5(1):44-53. 被引量：106
2张成,张后虎,申秀芳,赵泽华,焦少俊.长江经济带固体废物污染防治和管理研究[J].环境保护,2018,46(16):22-28. 被引量：19

二级参考文献2

1周志华.Multi-Instance Learning from Supervised View[J].Journal of Computer Science & Technology,2006,21(5):800-809. 被引量：12
2WANG Wei,ZHOU Zhi-Hua.Crowdsourcing label quality: a theoretical analysis[J].Science China(Information Sciences),2015,58(11):109-120. 被引量：6

共引文献123

1孙朝云,裴莉莉,徐磊,李伟,杜耀辉.基于DS-LOF与GA-XGBoost的路域环境感知数据智能检测与修复[J].中国公路学报,2023,36(4):15-26. 被引量：2
2罗益超,李争彦,张奇.基于句子选择的关键短语生成[J].中文信息学报,2021,35(8):64-72.
3郝昕毓,周建涛,王昊.表格单元格分类的端到端不完全监督方法[J].计算机与数字工程,2023,51(1):59-65.
4宋闯,赵佳佳,王康,梁欣凯.面向智能感知的小样本学习研究综述[J].航空学报,2020(S01):15-28. 被引量：16
5樊艺,吴章勇.WTO与我国商业银行的业务拓展[J].现代商业银行导刊,2000(6):22-25. 被引量：1
6商立军,臧益民,王四旺.耐钙心肌细胞的分离及基本电生理特性[J].第四军医大学学报,2000,21(2):247-249. 被引量：12
7朱越,姜远,周志华.一种基于多示例多标记学习的新标记学习方法[J].中国科学：信息科学,2018,48(12):1670-1680. 被引量：6
8刘尚旺,郜翔.基于深度模型迁移的细粒度图像分类方法[J].计算机应用,2018,38(8):2198-2204. 被引量：5
9熊智翔,陆青,王胤.使用少量有标签样本学习的方法[J].计算机应用,2018,38(A02):11-15.
10张慧,高吉喜,乔亚军.长江经济带生态环境形势和问题及建议[J].环境与可持续发展,2019,44(5):28-32. 被引量：33

1Reported Cases and Deaths of National Notifiable Infectious Diseases-China,April 2024[J].China CDC weekly,2024,6(25):617-618.
2Reported Cases and Deaths of National Notifiable Infectious Diseases—China,March 2024[J].China CDC weekly,2024,6(22):535-536.
3Reported Cases and Deaths of National Notifiable Infectious Diseases-China,February 2024[J].China CDC weekly,2024,6(17):383-384.
4Wang Zixuan,Miao Cheng,Xu Yuhua,Li Zeyi,Sun Zhixin,Wang Pan.Trusted Encrypted Traffic Intrusion Detection Method Based on Federated Learning and Autoencoder[J].China Communications,2024,21(8):211-235.
5Reported Cases and Deaths of National Notifiable Infectious Diseases—China,June 2024[J].China CDC weekly,2024,6(34):883-884.
6Paula Chen,Jerome Darbon,Tingwei Meng.Lax-Oleinik-Type Formulas and Efficient Algorithms for Certain High-Dimensional Optimal Control Problems[J].Communications on Applied Mathematics and Computation,2024,6(2):1428-1471.
7Wen-Xiu Ma.A combined Liouville integrable hierarchy associated with a fourth-order matrix spectral problem[J].Communications in Theoretical Physics,2024,76(7):1-8.
8Howard Heaton,Samy Wu Fung,Stanley Osher.Global Solutions to Nonconvex Problems by Evolution of Hamilton-Jacobi PDEs[J].Communications on Applied Mathematics and Computation,2024,6(2):790-810.
9Zhen Wang,Qingzhi Liu,Li Mei,Junlin Guo,Xiaolong Gao,Bi Zhang,Chang Cai,Yipeng Sun,Xiaoyu Feng,Yongqiang Wang.Risk factors and molecular epidemiology of canine rabies in Beijing[J].One Health Advances,2023(1):175-183.
10Heng Chen,Zhenhua Chen,Liwen Hu,Fengzhu Tang,Dan Kuang,Jiayi Han,Yao Wang,Xiao Zhang,Yue Cheng,Jiantong Meng,Rong Lu,Lan Zhang.Application of wastewater-based epidemiological monitoring of COVID-19 for disease surveillance in the city[J].Frontiers of Environmental Science & Engineering,2024,18(8):81-87.

Fundamental Research

2024年第4期

浏览历史

内容加载中请稍等...

An ensemble machine learning model to uncover potential sites of hazardous waste illegal dumping based on limited supervision experience

参考文献2

二级参考文献2

共引文献123

相关作者

相关机构

相关主题

浏览历史