摘要
当今社会,互联网技术迅猛发展,各种信息发布方式和渠道都在不断地变化,同时,社会关注的焦点也在实时发生变化,互联网信息监督管理的范围在不断增加,对于互联网信息的采集、维护和管理提出新要求。文章主要研究互联网信息多源异构数据的融合,将数据标准化,建立统一整合的大数据平台,深入分析各类数据,以多渠道数据源作为支撑,遵循大数据的建设理念和架构思想进行建设,实现对各类推送数据的融合、存储和处理。
In today’s society,with the rapid development of Internet technology,various methods and channels of information release are constantly changing.At the same time,the focus of social attention is also changing in real time,and the scope of Internet information supervision and management is constantly increasing,which puts forward new requirements for the collection,maintenance and management of Internet information.This article mainly studies the fusion of multi-source heterogeneous data of Internet information,standardizes data,establishes a unified and integrated big data platform,and analyzes various types of data in depth.We build with multi-channel data sources as support,and follow the construction concepts and architecture ideas of big data to achieve the integration,storage and processing of various pushed data.
作者
岳婧文
李晓霞
秦少林
Yue Jingwen;Li Xiaoxia;Qin Shaolin(National Computer Network Emergency Technical Processing and coordination center Inner Mongolia Branch,Inner Mongolia Hohhot 010070,China;Inner Mongolia Yunke Data Service Co.,Ltd,Inner Mongolia Hohhot 010070,China)
出处
《长江信息通信》
2021年第9期119-122,共4页
Changjiang Information & Communications
关键词
互联网信息管理
大数据
数据抽取
多源异构
数据融合
Internet Information Management
Big data
Data extraction
Multi-source Heterogeneous
Data Fusion