期刊文献+

面向倾向性分析的基于词聚类的基准词选择方法 被引量:7

Paradigm words selecting method based on word clustering for sentiments analysis
下载PDF
导出
摘要 现有的基准词选择方法存在着随机性和主观性的缺陷,提出了一种基于词聚类的基准词的选择方法:从目标领域本体中选出一组初始种子词进行扩展,聚类得出二代种子词,对二代种子词再进行扩展、聚类,依次迭代直至得到最优的聚类种子词,并作为最终选取的基准词。实验结果表明该方法提取的基准词在词的情感倾向分类中具有较高的准确率。 This paper put forward a method of selecting paradigm words, which was based on the existing randomness and sub- jectivity issue. Firstly, it expanded words by a group of selected initial seed words;secondly, it obtained the second generation of seed words by means of hierarchical clustering. According to the similarity between two different expanded words, then it ex- panded and clustered the second generation seed words. At last it orderly iterated by same procedure to get the optimal cluste- ring seed words as the final selected paradigm words. The experiment result indicates that the new method has a higher accuracy in selecting paradigm words while classifying the different emotional proclivities.
出处 《计算机应用研究》 CSCD 北大核心 2011年第1期114-116,共3页 Application Research of Computers
基金 高等院校博士点基金资助项目(20090111110016) 合肥工业大学科学研究发展基金资助项目(2010HGXJ0009)
关键词 基准词 词汇情感倾向 词的相似度 词的聚类 领域本体 paradigm word word sentiment orientation word similarity word clustering domain ontology
  • 相关文献

参考文献9

二级参考文献72

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2曹泽文,钱杰,张维明,邓苏.一种改进的本体映射方法[J].科学技术与工程,2006,6(19):3078-3082. 被引量:11
  • 3徐琳宏,林鸿飞,杨志豪.基于语义理解的文本倾向性识别机制[J].中文信息学报,2007,21(1):96-100. 被引量:123
  • 4许伟.句法-语义一体化的汉语句法分析研究[硕士学位论文].北京:清华大学,1997..
  • 5边肇祺.模式识别[M].北京:清华大学出版社,1997..
  • 6王根,赵军.中文褒贬义词语倾向性的分析[C].第三届学生计算语言学研讨会论集,2006:81-85.
  • 7PETER D.Turney.Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews[C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL)//Philadelphia,PA,USA.2002; 417-424.
  • 8PETER D.Turney and MICHAEL L.Littman.Measuring praise and criticism:inference of semantic orientation from association[J].ACM Transactions on Information Systems,2003,21(4):315-346.
  • 9PETER D.Turney and MICHAEL L.Littman.Unsupervised learning of semantic orientation from a hundred-billion-word corpus[R].Tech.Rep.EGB-1094,National Research Council Canada:2002.
  • 10DAVE K.,LAWRENCE S.,and PENNOCK D..Mining the peanut gallery.,opinion extraction and semantic classification of product reviews[C]//Proceedings of the 22nd International World Wide Web Conference.Budapest,Hungary:2003.

共引文献392

同被引文献56

  • 1朱嫣岚,闵锦,周雅倩,黄萱菁,吴立德.基于HowNet的词汇语义倾向计算[J].中文信息学报,2006,20(1):14-20. 被引量:326
  • 2路斌,万小军,杨建武,等.基于同义词词林的词汇褒贬计算[C]//中国计算技术与语言问题研究-第七届中文信息处理国际会议论文集.北京:电子工业出版社,2007:17-23.
  • 3董振东,董强.知网[DB/OL].[2009-03-15].http://www.keenage.com.
  • 4刘群,李素建.基于《知网》的词汇语义相似度的计算[C].台北:第三届汉语词汇语义学研讨会,2002.
  • 5PETER D T. Thumbs up or thumbs down? Semantic orienta- tion applied to unsupervised classification of reviews[ C]//Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. Philadelphia, USA, 2002: 417-424.
  • 6Ku L W,Lo Y S,Chen H H.Using Polarity Scores of Words for Sentence-level Opinion Extraction [C]//Proceedings of the 6th NTCIR Workshop Meeting.Tokyo,Japan:[s.n.],2007:316-322.
  • 7董振东,董强.知网[EB/ OL].(2011-06-23).http:// www.keenage.com.
  • 8Kang J H,Lerman K,Plangprasopchok A.Analyzing Microblogs with Affinity Propagation[C]//Proceedings of the 1st KDD Workshop on Social Media Analytic.New York,USA:ACM Press,2010:67-70.
  • 9Ramage D,Dumais S,Liebling D.Characterizing Microblogs with Topic Models [C]//Proceedings of International AAAI Conference on Weblogs and Social Media.Menlo Park,USA:AAAI Press,2010:130-137.
  • 10Kaji N,Kitsuregawa M.Building Lexicon for Sentiment Analysis from Massive Collection of HTML Documents[C]// Proceedings of EMNLP-CoNLL 2007.Prague,Czech:[s.n.],2007:1075-1083.

引证文献7

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部