摘要
经典Rough集理论是基于完备信息系统的。然而在实际应用中,由于数据存取或数据处理方面的原因,决策表经常是不完备的,即存在缺值。为了处理不完备信息系统,Kryszkiewicz提出了基于容差关系的Rough集模型。在该模型下进行知识约简时,现有的算法一般都采用构造区分矩阵和相应区分函数的方法。该方法虽然可以求得所有约简,然而业己证明这是一个NP-hard问题,因此实践中更为可行的方法是利用启发式搜索算法求出最优或次最优约简。在文中提出属性的重要性定义,并以此作为启发式信息,设计一种完备的知识约简算法。
The classic theory of Rough sets is based on incomplete information systems. In practicing, decision tables are, however, usually incomplete due to the causes of data outputting or processing. That is to say, there are often default values. In order to deal with incomplete systems, Kryszkiewicz puts a Rough sets model on the basis of error tolerance relations. According to this model, constructing discernibility matrixes and discernibility functions are the familiar approach by the current knowledge reduction algorithms. By this means, all reductions can work out. But it has been proved that it is a problem of "NT-hard". So it is more effective when a heuristic search algorithm is used to attain the most optimized or the second most optimized reduction. In this paper, the importance of attributes is defined and used as heuristic information. Then a complete knowledge reduction algorithm is put forward.
出处
《计算机与现代化》
2010年第3期170-172,共3页
Computer and Modernization
基金
国家863资助项目(2007FJ4080)