期刊文献+

多核处理器目录缓存结构设计 被引量:3

Directory Cache Design for Multi-Core Processor
下载PDF
导出
摘要 随着物联网、云计算与网络舆情分析等应用的快速发展,大数据处理的应用已经成为数据中心的核心负载.数据中心服务器普遍采用多核处理器,而目录缓存作为多核处理器结构中维护缓存一致性的关键部件,对其结构研究(如稀疏目录)更多地关注于目录缓存的容量与可扩展性,更适合处理高性能计算等计算密集型应用.然而,当多核处理器执行延迟敏感的大数据应用程序时,目录缓存的高访存延迟严重制约了数据中心的服务质量.针对该问题,新型主从目录缓存结构优化了数据访问过程中的一致性协议通路,其中主目录区分共享与私有数据,管理私有数据的访存操作,降低私有数据的访存延迟,提高了从目录的容量利用率;从目录维护共享数据的缓存一致性,采用有限位标签结构,提高了从目录的存储效率.实验在Simics+GEMS模拟平台上对大数据程序测试集Cloudsuite-v1.0进行评估.结果表明在以大数据应用程序为主的运行环境下,与2倍容量的稀疏目录相比,主从目录缓存结构降低了24.39%的硬件开销,降低了28.45%的缓存缺失延时,提升了3.5%的处理器IPC;与缓存内目录相比,主从目录结构虽然损失了5.14%的缓存缺失延时与1.1%的处理器IPC,但是降低了42.59%的硬件开销. With the development of Internet of things,cloud computing and Internet public opinion analysis,big data applications are growing into the critical workloads in current data center.Directory cache is used to guarantee cache coherence in chip multi-processor,which is massively deployed in data centers.Previous researches proposed all kinds of innovation to improve the utilization of directory cache capacity and scalability,making it more suitable for high-performance computing.Big data workloads are timing sensitive,which is not satisfied by previous works.To meet the requirement of big data workloads,master-salve directory is a novel directory cache design,which can optimize the path of memory instruction.In the novel directory cache design,master directory picks up private data accesses and provides services for them to reduce miss-latency,and slave directory provides cache coherence for shared memory space to improve the utilization of cache capacity and the scalability of chip multi-processor.Our experiment benchmark is CloudSuite-v1.0,running on Simics+GEMS simulator.Compared with sparse directory with 2×capacity,the experimental results show that master-slave directory can reduce hardware overhead by 24.39%,and reduce miss-latency by 28.45%,and improve IPC by 3.5%.Compared with in-cache directory,the results show that master-slave directory sacrifices 5.14% miss-latency and 1.1%IPC,but reduces hardware overhead by 42.59%.
出处 《计算机研究与发展》 EI CSCD 北大核心 2015年第6期1242-1253,共12页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2013AA011701)
关键词 大数据 多核处理器 缓存一致性 目录缓存 稀疏目录 big data multi-core processor cache coherence directory cache sparse directory
  • 相关文献

参考文献19

  • 1Ferdman M, Adileh A, Kocberber O, et al. Clearing the clouds~ A study of emerging scale out workloads on modernhardware [C] //Proc of the 17th Conf on Architecture Support for Programming Languages and Operating Systems (ASPLOS). New York: ACM, 2012:37-48.
  • 2Sorin D J, Hill M D, Wood D A. A Primer on Memory Consistency and Cache Coherence [M]. San Rafael, CA: Morgan & Claypool Publishers, 2011.
  • 3Barroso L A, Gharachorloo K, McNamara R, et al. Piranha: A scalable architecture based on single chip multiprocessing [C] //Proc of the 27th Annual Int Symp on Computer Architecture (ISCA). New York; ACM, 2000 282 -293.
  • 4Sun, OpenSPARCTM T2 core microarchitecture specification [R/OL]. Sun MicroSysmtes, Inc, 2007 [2015 -04-20]. http~//www, oracle, com/technetwork/systems/opensparc/t2 06 opensparet2 core-microarch 1537749. html.
  • 5Singhal R. Inside Intel next generation Nehalem micorarchlteeture [R/OL]. Intel Corporation, 2008 [2015- 04-20]. http://weblab, cs. uml. edu/-bill/cs515/Intel Nehalem Processor. pdf.
  • 6Ferdman M, Pejman L K, Balet K, et al. Cuckoo directory: A scalable directory for many-core systems [C] //Proc of the 17th Int Syrup on High Performance Computer Architecture (HPCA). New York: ACM, 2011:169-180.
  • 7Cuesta B A, Ros A, Gomez M E,' et al. Increasing the effectiveness of directory caches by deactivating coherence for private memory blocks [C] //Proe of the 38th Annual Int Syrup on Computer Architecture (ISCA). New York: ACM, 2011, 93-104.
  • 8Pejman L K, Grot B, Fredman M, et al. Scale-out Processors [C] //Proc of the 39th Annual Int Symp on Computer Architecture (ISCA). Piscataway, NJ: IEEE, 2012:500-511.
  • 9Gupta A, Weber W, Mowry T. Reducing memory and traffic requirements for scalable directory-based cache coherence schemes [C] //Proe of the 1990 Int Conf on Paraliel Processing (ICPP). Berlin: Springer, 1992:167-192.
  • 10Martin M M K, Hill M D, Sorin D J. Why on-chip cache coherence is here to stay [J]. Communications of the ACM, 2012, 55(7): 78-89.

同被引文献16

引证文献3

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部