摘要
针对网络舆情检测的关键技术及应用做了介绍。按照舆情监控的处理流程对网络爬虫、网页消重、网页去噪、文本分类、文本聚类等技术做了细致的介绍。对各种技术分类介绍了一些常用的算法。接着对舆情挖掘的应用方向话题跟踪与检测和情感倾向分析做了介绍。最后分析了舆情监测在理论研究和应用上的发展趋势。
In this paper, the applications and key technologies of internet public sentiment were reviewed. The technologies of WebCrawler, duplicated webpage deletion, webpage cleaning, text categorization and text clustering were introduced. Presentations of these key technologies focus on reports of many algorithms that were applied to intemet public sentiment. Then TDT(Topic Detection and Tracking) and Opinion Mining were presented as applications of internet public sentiment. The future development trend of theory and application in internet public sentiment were analyzed finally.
出处
《软件》
2012年第12期322-326,共5页
Software
基金
海南省教育厅基金项目(Hjkj2011-37)
三亚市院地合作项目(2011YD19)