摘要
XML标记语言是新一代的WEB标记语言,一些应用面对的都是在线的、持续的高速数据流.而XML是自描述的,XML数据流中存在大量的冗余数据.如何压缩XML数据流成为一个新的研究领域.从XML的结构入手,分析XML数据的特点,提出了一种基于动态哈夫曼编码的XML数据流压缩算法(DHFXSC).利用SAX解析器对XML Schema进行解析,获得相应的结构事件流,动态构建哈夫曼树,输出与XML事件流匹配的哈夫曼编码,实时完成XML数据流的压缩和解压缩.
XML has become the web mark language for the new age whose data are online and continuous high-speed stream in some cases of applications.Tbere is much redundant structural information in the self-descrlbed XML data stream. A new area of research deals with the methods of compressing XML data stream. Starting with the structure of XML, the feature of XML was analyzed, and a new method of compression was proposed on the basis of dynamic Huffman codes .The XML schema was parsed with SAX parser, and the corresponding event sequence of elements and attributes were obtained. The Huffman tree were generated dynamically, and the Huffman codes matching with XML event sequence were outputted. The compression and decompression of XML data stream were completed in real time.
出处
《内蒙古科技大学学报》
CAS
2007年第4期331-336,共6页
Journal of Inner Mongolia University of Science and Technology
基金
国家社会科学基金资助项目(07XTQ003)