摘要
针对如何有效利用我国当前农业网站海量数据信息为我国农业生产服务的难题,结合数据挖掘的相关理论和知识,引入HTML文本抽取技术对Web农业数据信息进行抽取,从而构建一个经处理后的数据仓库,再设计数据挖掘流程对数据进行挖掘。最终,以某蔬菜网为例对上述数据挖掘模型进行验证。结果表明,引入的数据挖掘算法在农产品价格等信息的挖掘方面与实际信息相符,验证了构建的数据处理和挖掘方法的有效性,为当前农业信息服务提供了借鉴与参考依据。
Aiming at problem of how to effectively use massive data information of China' s current agricultural website to serve China's agricultural production,combined with relevant theories and knowledge of data mining,HTML text extraction technology was introduced to extract Web agricultural data information,so as to build a data warehouse after data mining process for data mining was processed and designed. Finally,above data mining model was verified by taking a vegetable web as an example.Research results showed that data mining algorithm was consistent with actual information in aspect of agricultural product price and other information mining. At the same time,validity of data processing and mining methods constructed was verified. It would provide reference and reference basis for current agricultural information service.
作者
刘彩利
LIU Caili(Xi'an International University,Xi'an Shaanxi 710077,China)
出处
《农业工程》
2018年第8期54-57,共4页
AGRICULTURAL ENGINEERING
关键词
数据挖掘
农业信息
HTML文本
抽取技术
价格预测
data mining
agricultural information
HTML text
extraction technology
price prediction