摘要
1 前言今天,以因特网为主体的信息高速公路仍在不断普及和发展,因特网上蕴涵的海量信息远远超过人们的想象,面对这样的信息汪洋大海,人们往往感到束手无策,无所适从,出现所谓的“信息过载”问题。如何帮助人们有效地选择和利用所感兴趣的信息,同时保证人们在信息选择方面的个人隐私权利?这已成为学术界和企业界所十分关注的焦点。因此。
The background and the future of text filtering are described in this paper,and a concept-based Chinese text filtering model is presented. The main idea of the model is shown as follows:Original keywords are given by users,then the concept expansion is automatically performed with the keywords to construct the user profiles. It is noted that user profiles consist of the subprofiles,and the ratio of sub-profiles matching and the similarity of sub-profiles are defined. As a result, they can weaken the Boolean constrains to ensure that the text can be matched while some sub-profiles don't match with it , and they can restrict the Vector Space Model to match more subprofiles. In addition,the mechanism of passage matching is applied to improve the efficiency of filtering model.
出处
《计算机科学》
CSCD
北大核心
2000年第2期88-90,82,共4页
Computer Science
基金
国家自然科学基金 编号:69675019
国家教委博士点基金
关键词
中文文本
文本过滤模型
概念扩充
信息过滤
Text filtering,Boolean constrains,Vector space model,Concept expansion,Passage match-ing, User profiles,Fuzzy logic