期刊文献+

网页内容过滤技术中的特征提取 被引量:1

Characteristic Extraction in Web Content Filtering
下载PDF
导出
摘要 有害的网络内容日益猖獗,为封锁色情网页,论文通过统计和分析,主要从四个方面提取色情网页的特征:网页的布局,因特网内容选择平台(PICS)等级评定应用,暗示性条文和文档内容。从这四个方面的特征能几乎完全区分色情网页和非色情网页,该基本框架也适用于过滤网上除色情以外的其它不益内容。 With the proliferation of harmful internet content ,in this paper characteristic of pornographic Web pages are extracted from four aspects to block pornographic Web pages:page layout format,platform for internet content selection(PICS),indicative terms and text-content.Nonpornographic and pornographic Web pages can be distinguished almost completely,this general framework is adaptable for filtering other objectionable Web material.
出处 《计算机工程与应用》 CSCD 北大核心 2004年第31期145-146,共2页 Computer Engineering and Applications
关键词 特征 网页的布局 PICS 暗示性条文 语义识别 feature,page layout format,platform for internet content selection(PICS),indicative terms ,text-content identi-fication
  • 相关文献

参考文献7

  • 1The Effects of Pornography and Sexual Messages. National Coalition for the Protection of Children & Families, Cicinnati,Ohio,http ://www.nationalcoalition.org
  • 2Y Yang. An Evalution of Statistical Approaches to Text Categorization[J].Information Retrieval, 1999; 1:69~90
  • 3G Troina,N Walker. Document Classification and Searching:A Neural Network Approach. ESA Bull,http://esapub.esrin.esa.it/bulletin/bullet87/troina87.htm, 1996; (87)
  • 4J Morris,G Hirst.Lexical cohesion computed by thesaural relations as an indicator of the structure of text[J].Computational Linguistics, 1991;17(1 ) :21~43
  • 5Stairmond,Mark A.A Computational Analysis of Lexical Cohesion with Applications in Information Retrieval[D].Ph D thesis. Center for Computational Linguistics, UMIST, Manchester, 1999
  • 6吕宏伟,甄鹏.自动生成超文本链接[J].武警工程学院学报,2000,16(6):29-31. 被引量:2
  • 7李人厚.智能控制理论和方法[M].西安:西安交通大学出版社,1995

二级参考文献1

  • 1G. Salton and J. Allan," Selective Text Utilization and Text Traversal,"Proc.Hypertext 1993, pp[]..1993

共引文献1

同被引文献26

引证文献1

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部