摘要
随着网络应用的快速发展,XML数据已大量存在于当前的信息社会,使得XML类型的数据成为当前主流的数据形式,并已经成为Internet中进行数据交换和表示事实上的标准。由于客观世界的复杂性,不确定性是数据常见的内在属性,因此不确定的信息是普遍存在的。通常不确定信息以概率值的形式在XML文件(称为概率XML文件)中表示,因此,研究表示和处理概率XML数据将成为一个新的研究领域。自2001年以来,概率XML数据管理技术取得了一系列研究成果。从概率XML数据模型、PXML代数、查询、原型系统等几个方面综述了概率XML数据管理的研究进展,讨论了目前存在的主要问题和需要进一步研究的方向。
With the rapid development of network application,a large amount of XML data have existed,so the style of XML data becomes the primary data and the standard style of data exchanging and representation on Internet. Because of the complexity of external world,the uncertainty is the common internal attribute of data, the uncertain information universally exists. Usually the uncertain information can be represented as the probability values in XML document (probabilistic XML document), so the research ways of representing and processing the probabilistic XML data will be a new research field. Since 2001, a series of research achievements of the probabilistic XML data management have been obtained. The paper surveyed the research techniques of the probabilistic XML data management including the probabilistic XML data model,the PXML algebra, query and the prototype systems. The existing problems in the current research work and the new research issues were also discussed.
出处
《计算机科学》
CSCD
北大核心
2009年第11期14-17,共4页
Computer Science
基金
黑龙江省自然科学基金(F200601)资助