期刊文献+

基于局部主题关键句抽取的自动文摘方法 被引量:5

Automatic Summarization Method Based on Extracting Sentences from Local Topics
下载PDF
导出
摘要 自动文摘是语言信息处理中的重要环节。该文提出一种基于局部主题关键句抽取的中文自动文摘方法。通过层次分割的方法对文档进行主题分割,从各个局部主题单元中抽取一定数量的句子作为文章的文摘句。通过事先对文档进行语义分析,有效地避免了数据冗余和容易忽略分布较小的主题等问题。实验结果表明了该方法的有效性。 Automatic summarization is an important issue in natural language processing. This paper proposes a new method for automatic summarization of Chinese text based on extracting sentences from subtopics. The document is segmented into several units in terms of the subtopics in the document. The most representative sentences in each subtopic unit are selected as the summary sentences. By analyzing semantic structure of the documents in advance, the summary sentences are not redundancy and the coverage of each subtopic is balanced. Experimental results show that the method is effective.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第22期49-51,共3页 Computer Engineering
基金 国家自然科学基金资助项目(60773167 60673040)
关键词 自动文摘 主题分割 局部主题单元 automatic summarization topic segmentation local topic unit
  • 相关文献

参考文献5

  • 1Gong Y, Liu X. Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis[C]//Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New Orleans, Louisiana, USA: [s. n.], 2001.
  • 2杨晓兰,钟义信.基于文本理解的自动文摘系统研究与实现[J].电子学报,1998,26(7):155-158. 被引量:17
  • 3胡珀,何婷婷,姬东鸿.基于主题区域发现的中文自动文摘研究[J].计算机科学,2005,32(1):177-181. 被引量:5
  • 4Yaari, Yaacov. Segmentation of Expository Texts by Hierarchical Agglomerative Clustering[C]//Proceedings of the RANLP'97. Tzigov Chark, Bulgaria: [s. n.], 1997.
  • 5Mani I. Summarization Evaluation: An Over Overview[C]// Proceedings of the NTCIR Workshop Evaluation of Chinese and Japanese Text Retrieval and Text Summarization. Tokyo, Japan: National Institute of Informatics, 2001.

二级参考文献14

  • 1刘建舟 何婷婷 姬东鸿.基于开放式语料的汉语术语的自动抽取[A]..第二十届东方语言计算机处理国际学术会议论文集[C].,2003.43-49.
  • 2Nomoto T,Matsumoto Yuji. A New Approach to Unsupervised Text Summarization. In :Proc. of ACM SIGIR'01,2001. 26~34
  • 3Gong Yihong, Liu Xin. Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: Proc. of ACM SIGIR'01,2001.19~25
  • 4Pantel P, Lin Dekang. Document Clustering with Committees.In:Proc. of ACM SIGIR'02,2002. 199~206
  • 5Mitra P, Murthy C A,Pal S K. Unsupervised Feature Selection Using Feature Similarity. IEEE Transactions of Pattern Analysis and Machine Intelligence, 2002. 1~ 13
  • 6MANI I. Summarization Evaluation: An Overview. In: Proc. of the NTCIR Workshop 2 Meeting on Evaluation of Chinese and Japanese Text Retrieval and Text Summarization. Tokyo: National Institute of Informatics, 2001
  • 7MANI I. Recent Developments in Text Summarization. In:Proc.of CIKM'01,2001:529~531
  • 8Kaufmann L, Rousseeuw P J. Clustering by means of medoids.In Statistical Data Analysis based on the L1 Norm. In:Dodge Y,ed. Amsterdam, 1987. 405~416
  • 9Rissanen J. Modeling by the shortest description. Automatica,1978(14) :465~471
  • 10王建波,中文信息学报,1992年,6卷,2期

共引文献20

同被引文献41

引证文献5

二级引证文献28

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部