摘要
提出的方法,以句子为基本抽取单位,以兴趣主题词为句子的加权特征。对句子基于潜语义聚类,提出语义结构,这种结构对质量的提高有重要作用,并且提出了较为客观和有效的评价方法。实验表明,本文方法是行之有效的。
An automatic text summarization method was proposed in this paper, which took sentence as the basic extraction unit and weighted each sentence by considering the topic-related phrases. The latent semantic structure of the document, which is very important to improve the quality of automatic text summarization, can be obtained by clustering the sentences based on Latent Semantic Index (LSI). Besides, a relatively objective and effective method for the evaluation of automatic text summarization was presented. Experiments have shown that this approach is effective.
出处
《计算机应用》
CSCD
北大核心
2007年第2期459-462,465,共5页
journal of Computer Applications
基金
上海市科学技术委员会科技攻关项目(055115001)
关键词
兴趣主题
自动摘要
语义结构
摘要评价
interest topic
automatic text summarization
semantic structure
summarization evaluation