期刊文献+

流体系结构指令存储器优化设计研究 被引量:3

Optimized Design Research of Instruction Memory for Stream Architecture
下载PDF
导出
摘要 针对流体系结构中VLIW代码体积对指令存储器的容量和功耗带来的问题,本文通过分析流处理器的指令特征,提出了一种新的VLIW分域压缩技术.在此基础上,本文为流体系结构设计了分布式的片上指令存储器,并提出了SIMD流水的执行模式.实验结果证明,该技术减少了38%的片外指令访存,降低约65%的片上指令存储器空间需求;分布式指令存储器减少了约37%的片上指令存储器面积,使得MASA的系统面积降低了8.92%,并降低了61%的指令存储器功耗. The huge VLlW code size has brought serious problems to the capacity and energy consumption of instruction memory in stream architecture. Through analyzing the characteristics of instruction, this paper proposes a novel domain-divided VLIW compression scheme, designs a distributed on-chip instruction memory for slream architecture and proposes a new SIMD pipeline execution model. The experiment results show that about 38 % of the off-chip instruction accessing and 65 % of the on-chip instruction memory space demand can be reduced by the compression; the distributed instruction memory depresses about 37% of the area of on-chip instruction memory, and about 8.92 % of the area of MASA stream processor. At the same time, the energy con- sumption of the instruction memory is reduced by 61%.
出处 《电子学报》 EI CAS CSCD 北大核心 2012年第7期1379-1385,共7页 Acta Electronica Sinica
基金 国家自然科学基金重点项目(No.61033008) 国家自然科学基金(No.60903041 No.61103080) 教育部博士点基金(No.20104307110002) 国防科学技术大学杰出青年基金(No.CJ11-06-01)
关键词 流体系结构 分布式指令存储器 VLIW压缩 stream architecture distributed instruction memory very long instruction word compression
  • 相关文献

参考文献17

  • 1R Banakar, et al. Scratchpad memory: design alternative for cache on-chip memory in embedded systems[A]. Proceedings of the 10th International Symposium on Hardware/Software Codesign[C]. Estes Park, USA: IEEE Computer Society, 2002. 73 - 78.
  • 2袁国兴,邵京云.评几种高档微处理器在运算科学计算问题时的性能[EB/OL].北京应用物理与计算数学研究所,http://www.ccw.com.cn,2003,4.
  • 3Mattan Erez. MERRIMAC high-performance and highly-efficient scientific computing with streants [D]. Palo Alto, CA: Stanford University ,2007.
  • 4James Balfour, et al. An energy-efficient processor architecture for embedded systems [J]. IEEE Computer Architecture Letters,2008,7(1) :29 - 32.
  • 5J Liu, et al. Analysis and characterization of Intel Itanium instruction bundles for improving VLIW processor performance [A] .Proceedings of the First International Multi-Symposiums on Computer and Computational Sciences[C]. NY, USA: IEEE Computer Society, 2006.389 - 396.
  • 6YuriV Ivanov, C J Bleakley. Dynamic complexity scaling for real-time H.264/AVC video encoding[A] .Proceedings of the 2007 ACM International Conference on MultiMedia[C]. Augsburg, Germany. ACM Press, 2007.962 - 970.
  • 7赖明澈,王志英,戴葵,高蕾.基于代码特征分析的TTA指令压缩技术与解压部件实现[J].电子学报,2008,36(11):2234-2238. 被引量:3
  • 8Talal Bonny, Jorg Henkel. LICT- Left-uncompressed instructions compression technique to improve the decoding performance of VLlW processors [ A ]. Proceedings of the 46th ACM/IEEE Design Automation Conference[ C]. San Francisco, USA: ACM Press,2009.903 - 906.
  • 9Zhiguo Ge, et al. A DVS-based pipelined reconfigurable instruction memory [ A ]. Proceedings of the 46th ACM/IEEE Design Automation Conference [C]. San Francisco, USA: ACM Press,2(D9. 897 - 902.
  • 10Stefan Metzlaff, et al. A dynamic instruction scratchpad memory for embedded processors managed by hardware[ A ]. Proceedings of the 24th International Conference on Architecture of Computing Systems[C]. Lecture Notes of Computer Science 6566,2011.122 - 134.

二级参考文献9

  • 1赵学秘,王志英,岳虹,陆洪毅,戴葵.TTA-EC:一种基于传输触发体系结构的ECC整体算法处理器[J].计算机学报,2007,30(2):225-233. 被引量:4
  • 2J L Hennessy, D A. Patterson, Computer Architecture: A Quantitative Approach[M].3rd Edition,Morgan Kaufmann Publishing Co, 2002.
  • 3Henk Corporaal et al. Microprocessor Architecture from VLIW to TTA[ M]. West Sussex, England: John Wiley & Sons Ltd, 1999.
  • 4Kuukkanen, et al. Bitwise and dictionary modeling for code compression on transport triggered architectures [ J ]. WSEAS Transactions on Circuits and Systems, 2004, 9 ( 3 ) : 1750 - 1755.
  • 5J Heikkinen, et al. Evaluating template-based instruction compression on transport triggered architectures[ A]. Proceedings of 3rd IWSOC [ C]. IEEE Computer Society Press, 2003. 192 - 195.
  • 6J Heikkinen, et al. Dictionary-based program compression on transport triggered architectures [ A ]. Proceedings of ISCAS [ C]. IEEE Computer Society Press, 2005. 1122- 1125.
  • 7Yuan Xie, et al. Code compression for VLIW processors using variable-to-flxed coding [A ]. Proceedings of ISSS [ C ]. IEEE Computer Society Press, 2002.138 - 143.
  • 8S J Nam, et al. Improving dictionary-based code compression in VLIW architectures[ J]. IEICE Trans Fundamentals of Eleclronics, Communication and Computer Sciences, 1999, E82-A(11) : 2318 - 2124.
  • 9S Aditya, et al. Automatic Design of VLIW and EPIC Instruction Formats[R]. Hewlett-Packard Laboratories, 2000.

共引文献2

同被引文献14

  • 1李勇,王志英,赵学秘,岳虹.配置流驱动计算体系结构指导下的ASIP设计[J].计算机研究与发展,2007,44(4):714-721. 被引量:3
  • 2G E Suh,D Clarke,B Gassend,M van Dijk,S Devadas. AEGIS:Architecture for tamper-evident and tamper resistant processing[A].New York,USA:ACM Press,2003.160-171.
  • 3Chenghuai Lu,et al,M-Tree. A high efficiency security archi-tecture for protecting integrity and privacy of software[J].Jour-nal of Parallel and Distributed Computing,2006,(09):1116-1128.
  • 4Hall,W E,Jutla,C S. Parallelizable authentication trees[A].Kingston,Canada:Springer Berlin Heidelberg Press,2006.95-109.
  • 5Reouven Elbaz. TEC-Tree.A low cost and parallelizable tree for efficient defense against memory replay attacks[A].Vienna,Austria:Springer Berlin Heidelberg Press,2007.289-302.
  • 6Todd Austin,Eric Larson,Dan Ernst. SimpleScalar:An infras-tructure for computer system modeling[J].{H}COMPUTER,2002,(02):59-67.
  • 7Hussein Al-Zoubi. Performance evaluation of cache re-placement policies for the SPEC CPU2000 benchmark suite[A].New York,USA:ACM Press,2004.267-272.
  • 8Brian Rogers. Single-level integrity and confidentiality protection for distributed shared memory multiprocessors[A].Salt Lake City,UT:IEEE Computer Society,2008.161-172.
  • 9王超,张惠臻,周学海,马宏星.异质存储系统中的高速缓存机制研究[J].电子学报,2011,39(6):1267-1271. 被引量:6
  • 10刘文松,朱恩,王健,徐龙涛,林叶.JPEG2000算术编码器的算法优化和VLSI设计[J].电子学报,2011,39(11):2486-2491. 被引量:5

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部