期刊文献+

基于MIME邮件结构的邮件内容提取技术的研究 被引量:6

Research on Extracting E-mail Information Based on Structure of MIME Mail
下载PDF
导出
摘要 为准确提取电子邮件的内容,对邮件的组成结构进行详尽的分析,归纳出邮件正文特征,并设计出一个基于MIME邮件结构的邮件预处理系统。该系统采用分块处理和特征识别的方法,克服电子邮件不规范的缺点,并对邮件正文中的回复行和广告行进行过滤,从而实现对邮件内容快速准确提取。 In order to accurately extract the information of E - mail, E - mail' s structure and content features are analyzed, and an E - mail pretreatment system based on structure of MIME mail is designed. Using block - treatment and feature identification methods, this system overcomes the shortcomings of informal style and filteres reply lines and advertising lines. The system finally realizes expectative goal of extracting E - mail information quickly and accurately.
出处 《现代图书情报技术》 CSSCI 北大核心 2008年第5期85-88,共4页 New Technology of Library and Information Service
关键词 多用途互联网邮件扩展 电子邮件 预处理 MIME E - mail Pretreatment
  • 相关文献

参考文献3

  • 1MIME (Multipurpose Internet Mail Extensions) Part One: Mechanisms for Specifying and Describing the Format of Internet Message Bodies[ S]. Nathaniel Borenstein and Ned Freed, 1994.
  • 2KFC 822 : Standard for ARPA Internet Text Messages [ EB/OL ]. [ 2007 - 09 - 28 ]. http ://www. ieff. org/rfc/rfco822.txt? number = 822.
  • 3Carvalho V R, Cohen W W. Learning to Extract Signature and Reply Lines from Email[ EB/OL]. [ 2007 -09 -28 ]. http://www. cs. cmu. edu/~wcohen/postscript/email - 2004. pdf.

同被引文献43

引证文献6

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部