期刊文献+

基于动态哈夫曼编码的XML数据流压缩技术 被引量:3

Dynamic Huffman code-based compressing technology over XML data stream
下载PDF
导出
摘要 XML标记语言是新一代的WEB标记语言,一些应用面对的都是在线的、持续的高速数据流.而XML是自描述的,XML数据流中存在大量的冗余数据.如何压缩XML数据流成为一个新的研究领域.从XML的结构入手,分析XML数据的特点,提出了一种基于动态哈夫曼编码的XML数据流压缩算法(DHFXSC).利用SAX解析器对XML Schema进行解析,获得相应的结构事件流,动态构建哈夫曼树,输出与XML事件流匹配的哈夫曼编码,实时完成XML数据流的压缩和解压缩. XML has become the web mark language for the new age whose data are online and continuous high-speed stream in some cases of applications.Tbere is much redundant structural information in the self-descrlbed XML data stream. A new area of research deals with the methods of compressing XML data stream. Starting with the structure of XML, the feature of XML was analyzed, and a new method of compression was proposed on the basis of dynamic Huffman codes .The XML schema was parsed with SAX parser, and the corresponding event sequence of elements and attributes were obtained. The Huffman tree were generated dynamically, and the Huffman codes matching with XML event sequence were outputted. The compression and decompression of XML data stream were completed in real time.
出处 《内蒙古科技大学学报》 CAS 2007年第4期331-336,共6页 Journal of Inner Mongolia University of Science and Technology
基金 国家社会科学基金资助项目(07XTQ003)
关键词 XML数据流 压缩 动态哈夫曼 SAX XML SCHEMA XML data stream compression dynamic Huffman SAX XML Schema
  • 相关文献

参考文献7

  • 1[1]Hatnnut L,Dan S.XMILL:An efficient compressor for XML data[A].Proe of the SIGMOD 2000[C].Texas:ACM Press,2000.153-164.
  • 2[2]Pankaj MT,Jayant RH.XGRIND:A query friendly XML com-pressor[A].Proe of the ICDE 2002[C].San Jose:IEEE Com-puter Society,2002.225-234
  • 3[3]Jun K M,Myung J P,Chin W C.XPRESS:A quefiable com-pression for XML data[A].Proe of the SIGMOD 2003[C].San Diego:ACM Press,2003.122-133.
  • 4王腾蛟,高军,杨冬青,唐世渭,刘云峰.面向XPath执行的XML数据流压缩方法[J].软件学报,2005,16(5):869-877. 被引量:17
  • 5[5]Jeffery S V.Design and analysis of dynamic Huffman codes[J].Journal of the ACM,1987,34(4):825-845.
  • 6[6]Jacob Z,Abraham L.A universal algorithm for sequence data compression[J].IEEE Trans.On information Theory,1977,23 (3):337-343.
  • 7[7]Green T J,Miklau G,Onizuka M,ct al.Processing XML streaming with deterministic automata[A].Proe of the Int'l Conf on Data Theory,LNCS 2572[G].Springer-Verlag,2003.173-189.

二级参考文献10

  • 1Hartmut L, Dan S. XMill: An efficient compressor for XML data. In: Weidong C, Jeffrey F, eds. Proc. of the SIGMOD 2000. Texas;ACM Press, 2000. 153-164.
  • 2Pankaj MT, Jayant RH. XGRIND: A query friendly XML compressor. In: Proc. of the ICDE 2002. San Jose: IEEE Computer Society, 2002. 225-234.
  • 3Jun KM, Myung JP, Chin WC. XPRESS: A queriable compression for XML data. In: Alon Y, Zachary G, eds. Proc. of the SIGMOD 2003. San Diego: ACM Press, 2003. 122-133.
  • 4Jacob Z, Abraham L. A universal algorithm for sequence data compression. IEEE Trans. on Information Theory, 1977,23(3):337-343.
  • 5Jeffery SV. Design and analysis of dynamic Huffman codes. Journal of the ACM, 1987,34(4):825-845.
  • 6Jean LG. GZIP. 2003. HTTP://www.gzip.com
  • 7SwissProt Data Set. 1998. http://www.cs.washington.edu/research/xmldatasets/data/SwissProt/SwissProt.xml
  • 8NASA Data Set. 2001. http://www.cs.washington.edu/research/xmldatasets/data/nasa/nasa.xml
  • 9Tree Bank Data Set. 2002. http://www.cs.washington.edu/research/xmldatasets/data/treebank/treebank_e.xml
  • 10Angel LD, Douglas L. XML generator. 1999. http://www.alphaworks.ibm.com/tech/xmlgenerator

共引文献16

同被引文献20

  • 1高宁波,金宏,王宏安.历史数据实时压缩方法研究[J].计算机工程与应用,2004,40(28):167-170. 被引量:13
  • 2王腾蛟,高军,杨冬青,唐世渭,刘云峰.面向XPath执行的XML数据流压缩方法[J].软件学报,2005,16(5):869-877. 被引量:17
  • 3钟世明,邵锐,张胜,朱才连.基于位置服务系统中XML数据流压缩方法[J].武汉理工大学学报(交通科学与工程版),2006,30(1):29-32. 被引量:9
  • 4MCCOWNS.网络业未来12件大事.网络世界,2007,(8):11-11.
  • 5VJAGADISH H, MADAR J, NG R. Semantic compression and pattern extraction with fascicles[C]. In Proc. of the 25th Intl. Conf.on Very Large Data Bases. 1999.
  • 6VJAGADISH H, RAYMOND TNg, BENG C C,et al. It compress: an iterative-semantic compression mgorithm [C].20th International Canference on Data Engineering (ICDE ' 04). 2004:646-657.
  • 7BABU S, GAROFALAKIS M, RASTOGI R. Spartan: a model-based semantic compression syatem for massive data tables[C]. In Proc of the ACM SIGMOD'2001 International Conference on Management of Data. May,2001.
  • 8Hashemian R.Condensed table of Huffman coding,a new approach to efficient decoding[J].IEEE Transactions on Communications,2004,52 (1):6-8.
  • 9Sharma M.Compression using Huffman coding[J].International Journal of Computer Science and Network Security,2010,10(5):133-141.
  • 10王磊,孟昭鹏,刘亚琼.一种基于LFU置换的BWT压缩算法的改进[J].微计算机应用,2008,29(3):80-83. 被引量:3

引证文献3

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部