多核处理器目录缓存结构设计被引量：3

Directory Cache Design for Multi-Core Processor

下载PDF

导出

摘要随着物联网、云计算与网络舆情分析等应用的快速发展,大数据处理的应用已经成为数据中心的核心负载.数据中心服务器普遍采用多核处理器,而目录缓存作为多核处理器结构中维护缓存一致性的关键部件,对其结构研究(如稀疏目录)更多地关注于目录缓存的容量与可扩展性,更适合处理高性能计算等计算密集型应用.然而,当多核处理器执行延迟敏感的大数据应用程序时,目录缓存的高访存延迟严重制约了数据中心的服务质量.针对该问题,新型主从目录缓存结构优化了数据访问过程中的一致性协议通路,其中主目录区分共享与私有数据,管理私有数据的访存操作,降低私有数据的访存延迟,提高了从目录的容量利用率;从目录维护共享数据的缓存一致性,采用有限位标签结构,提高了从目录的存储效率.实验在Simics+GEMS模拟平台上对大数据程序测试集Cloudsuite-v1.0进行评估.结果表明在以大数据应用程序为主的运行环境下,与2倍容量的稀疏目录相比,主从目录缓存结构降低了24.39%的硬件开销,降低了28.45%的缓存缺失延时,提升了3.5%的处理器IPC;与缓存内目录相比,主从目录结构虽然损失了5.14%的缓存缺失延时与1.1%的处理器IPC,但是降低了42.59%的硬件开销. With the development of Internet of things,cloud computing and Internet public opinion analysis,big data applications are growing into the critical workloads in current data center.Directory cache is used to guarantee cache coherence in chip multi-processor,which is massively deployed in data centers.Previous researches proposed all kinds of innovation to improve the utilization of directory cache capacity and scalability,making it more suitable for high-performance computing.Big data workloads are timing sensitive,which is not satisfied by previous works.To meet the requirement of big data workloads,master-salve directory is a novel directory cache design,which can optimize the path of memory instruction.In the novel directory cache design,master directory picks up private data accesses and provides services for them to reduce miss-latency,and slave directory provides cache coherence for shared memory space to improve the utilization of cache capacity and the scalability of chip multi-processor.Our experiment benchmark is CloudSuite-v1.0,running on Simics＋GEMS simulator.Compared with sparse directory with 2×capacity,the experimental results show that master-slave directory can reduce hardware overhead by 24.39%,and reduce miss-latency by 28.45%,and improve IPC by 3.5%.Compared with in-cache directory,the results show that master-slave directory sacrifices 5.14% miss-latency and 1.1%IPC,but reduces hardware overhead by 42.59%.

作者王恩东唐士斌陈继承王洪伟倪璠赵雅倩

机构地区高效能服务器和存储技术国家重点实验室(浪潮集团有限公司)

出处《计算机研究与发展》 EI CSCD 北大核心 2015年第6期1242-1253,共12页 Journal of Computer Research and Development

基金国家"八六三"高技术研究发展计划基金项目(2013AA011701)

关键词大数据多核处理器缓存一致性目录缓存稀疏目录 big data multi-core processor cache coherence directory cache sparse directory

分类号 TP303 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献19

1Ferdman M, Adileh A, Kocberber O, et al. Clearing the clouds~ A study of emerging scale out workloads on modernhardware [C] //Proc of the 17th Conf on Architecture Support for Programming Languages and Operating Systems (ASPLOS). New York: ACM, 2012:37-48.
2Sorin D J, Hill M D, Wood D A. A Primer on Memory Consistency and Cache Coherence [M]. San Rafael, CA: Morgan & Claypool Publishers, 2011.
3Barroso L A, Gharachorloo K, McNamara R, et al. Piranha: A scalable architecture based on single chip multiprocessing [C] //Proc of the 27th Annual Int Symp on Computer Architecture (ISCA). New York; ACM, 2000 282 -293.
4Sun, OpenSPARCTM T2 core microarchitecture specification [R/OL]. Sun MicroSysmtes, Inc, 2007 [2015 -04-20]. http~//www, oracle, com/technetwork/systems/opensparc/t2 06 opensparet2 core-microarch 1537749. html.
5Singhal R. Inside Intel next generation Nehalem micorarchlteeture [R/OL]. Intel Corporation, 2008 [2015- 04-20]. http://weblab, cs. uml. edu/-bill/cs515/Intel Nehalem Processor. pdf.
6Ferdman M, Pejman L K, Balet K, et al. Cuckoo directory: A scalable directory for many-core systems [C] //Proc of the 17th Int Syrup on High Performance Computer Architecture (HPCA). New York: ACM, 2011:169-180.
7Cuesta B A, Ros A, Gomez M E,' et al. Increasing the effectiveness of directory caches by deactivating coherence for private memory blocks [C] //Proe of the 38th Annual Int Syrup on Computer Architecture (ISCA). New York: ACM, 2011, 93-104.
8Pejman L K, Grot B, Fredman M, et al. Scale-out Processors [C] //Proc of the 39th Annual Int Symp on Computer Architecture (ISCA). Piscataway, NJ: IEEE, 2012:500-511.
9Gupta A, Weber W, Mowry T. Reducing memory and traffic requirements for scalable directory-based cache coherence schemes [C] //Proe of the 1990 Int Conf on Paraliel Processing (ICPP). Berlin: Springer, 1992:167-192.
10Martin M M K, Hill M D, Sorin D J. Why on-chip cache coherence is here to stay [J]. Communications of the ACM, 2012, 55(7): 78-89.

同被引文献16

1汪东,陈书明.DSCF:一种面向共享存储多核DSP的数据流分簇前向技术[J].计算机研究与发展,2008,45(8):1446-1453. 被引量：1
2郭御风,李琼,窦强,罗莉.一种自适应多核处理器I/O一致性处理方法[J].电子学报,2011,39(5):1194-1198. 被引量：1
3郭宪勇,陈性元,邓亚丹.基于多核处理器的VTD-XML节点查询执行性能优化[J].计算机科学,2014,41(2):179-181. 被引量：2
4臧强,程立.基于ASP技术和SQL数据库的成绩查询系统的设计与实现[J].电子设计工程,2014,22(3):45-47. 被引量：7
5张轮凯,宋风龙,王达,范东睿,孙凝晖.提升稀疏目录缓存一致性系统性能的方法[J].计算机研究与发展,2014,51(9):1955-1970. 被引量：3
6陈占龙,张丁文,吴亮,臧英.面向多核处理器的共享cache优化研究进展[J].计算机应用研究,2014,31(10):2881-2887. 被引量：2
7王磊.基于BACS算法的数据库查询优化[J].计算机工程与应用,2015,51(13):118-121. 被引量：2
8徐金甫,陈帆,冯晓,李伟.密码多核处理器互联结构研究与设计[J].电子技术应用,2015,41(9):51-54. 被引量：1
9张必英,陈红松,崔刚,傅忠传.温度约束多核处理器最大稳态吞吐量分析[J].计算机研究与发展,2015,52(9):2083-2093. 被引量：1
10顾玉磊,朱雪阳,晏荣杰,张广泉.基于异构多核平台的同步数据流图帕累托优化与调度[J].计算机科学,2015,42(11):43-47. 被引量：3

引证文献3

1叶苗.多核处理器下SKLOIS多级安全数据库查询方法研究[J].科学技术与工程,2017,17(2):95-99.
2吴健虢,陈海燕,刘胜,邓让钰,陈俊杰.多核Cache稀疏目录性能提升方法综述[J].计算机工程与科学,2019,41(3):385-392. 被引量：2
3李国鹏,吴瑞骐,谈海生,陈国良.面向大语言模型驱动的智能体的计划复用机制[J].计算机研究与发展,2024,61(11):3706-3720.

二级引证文献2

1陈家豪,黄乐天,谢暄,魏敬和.基于片上网络互连的多核缓存一致性研究综述[J].电子与封装,2020,20(11):1-8. 被引量：2
2高吉普,徐长宝,辛明勇,陈军健,刘德宏.基于B/S结构的多核微处理器实速故障诊断技术[J].电子设计工程,2023,31(22):171-175.

1伍卫国,方敏,吴小康,万群,胡雷钧.PVFS客户端目录缓存设计与实现[J].计算机工程,2005,31(23):206-207.
2周正娟,刘心松,张兴.分布式目录失效的恢复算法研究[J].成都信息工程学院学报,2007,22(6):669-676.
3黄志钢,盛肖炜.多核处理器结构与核间通信的CMC总线设计[J].沈阳理工大学学报,2012,31(6):70-75. 被引量：3
4黄海利,王晓喃.基于NS-2的传输协议性能比较与分析[J].常熟理工学院学报,2013,27(2):118-121.
5石坚,邹玲,董天临,赵尔墩.基于延迟敏感的组播路由遗传算法的研究[J].计算机科学,2000,27(11):25-28. 被引量：1
6冯泳,张延园.一种用于存储网络软故障恢复的快照技术的设计和实现[J].计算机应用研究,2004,21(12):281-283. 被引量：4
7严杰俊,黄皓.多核处理器共享资源管理技术研究[J].电脑知识与技术,2011,7(5):3159-3161.
8宝石的“亲密伴侣”——白色贵金属入门篇[J].中国宝石,2011(3):188-189.
9黄国睿,张平,魏广博.多核处理器的关键技术及其发展趋势[J].计算机工程与设计,2009,30(10):2414-2418. 被引量：47
10张永亮,于远诚.基于WEB的远程教学平台设计[J].通化师范学院学报,2009,30(4):43-44.

计算机研究与发展

2015年第6期

浏览历史

内容加载中请稍等...

多核处理器目录缓存结构设计被引量：3

参考文献19

同被引文献16

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

多核处理器目录缓存结构设计 被引量：3

参考文献19

同被引文献16

引证文献3

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

多核处理器目录缓存结构设计被引量：3