摘要
新型智能电表普及后,为了准确检测出电网中的窃电用户,可以结合机器学习的方法.为此,选择了支持向量机、随机森林和迭代决策树3种机器学习中较常用的大数据算法进行分析,通过不断调整试验数据集的大小,对3种算法的效率和准确率进行测试.对比分析结果发现,随机森林算法运行的时间和数据量的大小基本呈线性关系,效率较高,且准确率稳定在86%以上,表现较好.
Accurate detection of the power grid users can be combined with the machine learning method after the popularity of new smart meters. For this purpose, three kinds of machine learning more commonly used in large data algorithm are chosen for analysis:random forest, support vector machine and gradient boosting decision tree. The efficiency and accuracy of the three algorithms are tested by constantly adjusting the size of the test data set. Analysis of the results shows that the ran- dom forest algorithm runs in a linear relationship with the amount of time and the amount of data, while the accuracy rate of stability is higher than 86% ,with better performances.
出处
《上海电力学院学报》
CAS
2017年第4期389-393,共5页
Journal of Shanghai University of Electric Power
基金
国家自然科学基金(61403247)
上海市信息安全综合管理技术研究重点实验室开放课题项目(AGK2015 005)
上海市科学技术委员会地方能力建设项目(15110500700)
关键词
窃电
智能电表
随机森林
支持向量机
迭代决策树
stealing electricity
smart meter
random forest
support vector machine
gradient boosting decision tree