期刊文献+

词聚类技术研究综述 被引量:2

A Survey on Word Clustering Technique
下载PDF
导出
摘要 词聚类是一种面向词语的聚类技术,广泛应用于自然语言处理的各个方向。文章将部分已有的词聚类方法分为基于语法特征、基于语义特征和基于语用特征三类,并对各类方法进行了归纳整理。 Word clustering is a word-oriented clustering technique, which is widely applied in a number of NLP tasks. This survey paper provides a categorization of some of the existing word clustering methods.
出处 《数字图书馆论坛》 2010年第5期15-19,共5页 Digital Library Forum
基金 国家“十一五”科技支撑计划课题“知识组织系统的集成及服务体系研究与实现”(2006BAH03803)和“科技文献信息服务系统关键技术研究及应用示范”(2006BAH05806)资助项目.中国科学技术信息研究所重点工作项目“汉语科技词系统建设与应用工程--新能源汽车领域完善及领域扩展”(2009KP01-3-2)资金项目.
关键词 词聚类 语法特征 语义特征 语用特征 Word clustering, Grammatical feature, Semantic feature, Pragmatic feature
  • 相关文献

参考文献38

  • 1JAMES A T, JUSTIN Z. A Model for Word Clustering[J]. Journal of the American Society for Information Science and Technology, 1992.
  • 2PETER F B, VINCENT J D P, PETER V D, JENIFER C L,ROBERT L M. Class-Based n-gram Models of Natural Language[J]. Computational Linguistics, 1992.
  • 3陈浪舟,黄泰翼.一种新颖的词聚类算法和可变长统计语言模型[J].计算机学报,1999,22(9):942-948. 被引量:17
  • 4SHINSUKE M, MAKOTO N. A Stochastic language model using dependency and its improvement by word clustering[C]// Universite de Montreal, Government of Canada. Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics. Morristown, N J, USA: Association for Computational Linguistics, 1998: 898-904.
  • 5JOHN G M, FRANCIS J S. Improving Statistical Language Model Performance with Automitically Generated Word Hierarchies[J]. Computational Linguistics, 1996,22(2):217-247.
  • 6BAssiou N K, KOTROPOULOS C L. Interpolated distanced bigram language models for robust word clustering[C]//Nonlinear Signal and Image Processing.[出版者不详],2005.
  • 7SHINSUKE M, NISHIMURA M, NOBUYASU I. Language Model Adaptation using Word Clustering[J]. Joho Shori Gakkai Kenkyu Hokoku, 2003,2003(14):89-94.
  • 8DAN T, RADU I, NANCY I. Fine-Grained Word Sense Disambignation Based on Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets[C]// Proceedings of the 20th international conference on Computational Linguistics, 2004, Geneva, Switzerland. Association for Computational Linguistics, 2004:1312-1318.
  • 9JIN P.SUN X, WU Y, YU S, Word Clustering for Collocation-based Word Sense Disambiguation[C]//Proceedings of the 8th International Conference on Computational Linguistics, Geneva, Switzerland. Association for Computational Linguistics, 2004,.
  • 10陈炯,张永奎.一种基于词聚类的中文文本主题抽取方法[J].计算机应用,2005,25(4):754-756. 被引量:17

二级参考文献29

  • 1吴健,吴朝晖,李莹,邓水光.基于本体论和词汇语义相似度的Web服务发现[J].计算机学报,2005,28(4):595-602. 被引量:218
  • 2许伟.句法-语义一体化的汉语句法分析研究[硕士学位论文].北京:清华大学,1997..
  • 3边肇祺.模式识别[M].北京:清华大学出版社,1997..
  • 4白硕,语言学知识的计算机辅助发现,1995年
  • 5陈群秀,计算语言学研究与应用,1994年
  • 6朱德熙,语法讲义,1982年
  • 7Li Hang,Clustering Words with the MDL Principle ,cmplg/ 960 50 14,1996年
  • 8姬东鸿,汉语形容词和名词的语义组合模型,1996年
  • 9倪文杰,现代汉语辞海,1994年
  • 10边肇祺,模式识别,1998年

共引文献144

同被引文献24

  • 1陈炯,张永奎.一种基于词聚类的中文文本主题抽取方法[J].计算机应用,2005,25(4):754-756. 被引量:17
  • 2赵丹华,赵江洪.汽车造型特征与特征线[J].包装工程,2007,28(3):115-117. 被引量:85
  • 3兰盖克.认知语法基础[M].北京:北京大学出版社,2004.
  • 4PHILIPSON M.Aestheties Today[M].USA: Word Publishing Press, 1961.
  • 5EVES B, HEWITF J.Style-branding, Aesthetic Design DNA [C]//International Conference on Engineering and Product Design Education 10 & 11 September 2009, University of Brighton, 2009.
  • 6LEIBE B, ETI'LIN A, SCHIELE B.Learning Semantic Object Parts for Object Categorization[J].Image and Vision Comput- ing, 2008,26 : 15-26.
  • 7AHMAD S, CHASE S C.Style Representation in Design Gram- mars[J].Environment and Planning B: Planning and Design, 2012,39 : 486-500.
  • 8HARRIS Z S.String Analysis of Sentence Strueture[M].USA: Mouton Press, 1962.
  • 9SCHANK R C, REISBECK C K. Scripts, Plans, Goals and Understanding[M].New York : Lawrence Relbaum Press, 1977.
  • 10TUFI? D, ION R, IDE N. Fine-grained word sense disambiguation based on parallel corpora, word alignment, word clustering and aligned wordnets[C]//Proceedings of the 20th International Conference on Computational Linguistics. Association for Computational Linguistics. Geneva, Switzerland, 2004: 1312.

引证文献2

二级引证文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部