期刊文献+

基于词项—句子—文档三层图模型的多文档自动摘要 被引量:6

Multi-Document Summarization Based on the Term-Sentence-Document Tri-layer Graph Model
下载PDF
导出
摘要 应用图模型来研究多文档自动摘要是当前研究的一个热点,它以句子为顶点,以句子之间相似度为边的权重构造无向图结构。由于此模型没有充分考虑句子中的词项权重信息以及句子所属的文档信息,针对这个问题,该文提出了一种基于词项—句子—文档的三层图模型,该模型可充分利用句子中的词项权重信息以及句子所属的文档信息来计算句子相似度。在DUC2003和DUC2004数据集上的实验结果表明,基于词项—句子—文档三层图模型的方法优于LexRank模型和文档敏感图模型。 Graph model has been widely applied to document summarization by using sentence as the graph nodes, and the similarity between sentences as the weights of edge. However, the knowledge of terms and documents are neglected in this model. In this paper, we propose a tri-layer graph model based on the term, the sentence and the documentto make full use of knowledge when computing the similarity of sentences. The experimental results on the data sets of DUC'2003 and DUC'2004 show that the proposed model outperforms the state-of-the-art LexRank model and Document Sensitive Ranking model.
出处 《中文信息学报》 CSCD 北大核心 2014年第6期201-207,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金(61272212 61163006 61203313)
关键词 图模型 多文档自动摘要 句子相似度 词项—句子— 文档图 graph model multi-document summarization the similarity of sentences term-sentence-document graph
  • 相关文献

参考文献4

二级参考文献95

  • 1苏海菊,王永成.中文科技文献文摘的自动编写[J].情报学报,1989,8(6):433-439. 被引量:26
  • 2莫燕,王永成.中文文献摘要的自动编制[J].现代图书情报技术,1993(3):10-12. 被引量:15
  • 3秦兵 LiuTing LiSheng.Summarization based on physical features and logical structure of multi documents[J].High Technology Letters,2005,11(2):133-136. 被引量:2
  • 4李明.从字频统计出发的中文文摘自动编写[J].现代图书情报技术,1996(3):42-45. 被引量:20
  • 5Luhn H P. The Automatic Creation of Literature Abstracts[J]. IBM Journal of Research and Development, 1958 : 159-165.
  • 6Edmundson W. Automatic Abstracting and Indexing:Survey and Recommendations[J]. Communication of the ACM, 1961,4 (5): 226-234.
  • 7Edmundson W. New methods in automatic abstracting [J].Journal of the Association for Computing Machinery, 1996,16(2): 264-285.
  • 8Pollock J J, Zamora A. Automatic Abstracting Research at Chemical Abstracts Service[J]. Journal of Chemical Information and Computer Sciences, 1975,15(4) : 226-232.
  • 9Paice C D. The Automatic Generation of Literature Abstracts: An Approach Based on the Identification of Self-indicating Phrases[J]. Information Retrieval Research.
  • 10Schank C, Abelson P. Scripts, Plans, Goals, and Understanding: An Inquiry into Human Knowledge Structures[M]. Hillsdale, New Jersey: Lawrence Erlbaum Associates, 1977.

共引文献112

同被引文献87

引证文献6

二级引证文献21

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部