摘要
论文分析了敏感信息过滤的重要性和常见的信息过滤手段,提出了一种基于Web挖掘的敏感信息过滤模型。该模型的主要思想是:采用Web挖掘技术对页面文字与图像内容以及用户访问行为特征进行分析,采用在线与离线分析相结合、并行处理等技术建立综合过滤体系,为建设文明、健康的网络环境提供技术保障。
This paper describes the importance of sensitive information filtering, introduces the typical methods of information filtering and proposes a sensitive information filtering model based on web mining. In order to create a health network environment, some new techniques are adopted by the new filter model. First. the text, image content and uses accessing features are extracted by using Web mining techniques, Second, some new techniques are adopted to construct integrated filtering architecture, such as on-line/off-line combination and parallel processing strategy
出处
《信息安全与通信保密》
2007年第1期69-71,共3页
Information Security and Communications Privacy
基金
北京市优秀人才培养专项经费资助项目(20042D0501504)
北京教委面上项目(KM200610005012)
关键词
敏感信息
过滤模型
WEB挖掘
sensitive information
filtering model
Web mining