面向大规模计算集群的多轨分割网络被引量：2

A Sliced Multi-Rail Interconnection Network for Large-Scale Clusters

下载PDF

导出

摘要在千万亿次规模的系统中,互连网络设计面临新的挑战.高性能节点和大规模是构建千万亿次系统的主要技术趋势,不断提高的节点计算能力要求互连网络提供更高的性能,而不断增大的规模又对互连网络扩展性提出了更高的要求.此外,随着系统规模的增大,集合通信的执行时间也在不断增长,制约了应用的扩展性,集合通信的性能需要得到进一步优化.除性能之外,可靠性问题也随着系统规模的扩大而日益严重.而随着计算节点性能的不断提高,互连网络逐渐成为限制大规模计算机系统性能的瓶颈.互连网络核心部件交换芯片可提供的聚合网络带宽受到工艺和封装技术的限制.从网络结构与交换机结构的协同设计思想出发,提出了一种在交换机聚合带宽限定的条件下多轨分割网络结构和设计方法.通过数学建模和网络模拟仿真,分析了该多轨分割网络的性能边界.评测结果表明:该网络可将短消息(长度小于128B)的平均延迟性能提高10倍以上,为以短消息占多数的数据中心网络的性能优化提供了新思路. In largescale clusters, the design of interconnection network is facing greater challenges. Firstly, the increasing computing capacity of a single node requires the network providing higher bandwidth and lower latency. Secondly, the increasing number of nodes requires the network to have extremely better scalability. Thirdly, the increasing scale of system leads to worse performance of collective communication, which is harmful to the performance and scalability of applications. Fourthly, the increasing number of devices requires the network to have better reliability. As the performance of computing nodes keeps increasing, interconnection network has gradually become the bottleneck of largescale computing system. However, switch chip, the core component of interconnection network, can offer limited aggregate bandwidth because of the constraint of physical processes and packaging technologies. With the codesign of network architecture and switch microarchitecture, this paper proposes a sliced multirail network architecture regarding the given aggregate bandwidth. Through mathematical modeling and network simulation, we studies the performance boundaries of sliced multirail network. Evaluation results show that the average latency of the short message （less than 128B）can be increased by more than 10 times.

作者邵恩元国军郇志轩曹政孙凝晖

机构地区计算机体系结构国家重点实验室(中国科学院计算技术研究所) 中国科学院大学

出处《计算机研究与发展》 EI CSCD 北大核心 2017年第11期2534-2546,共13页 Journal of Computer Research and Development

基金国家重点研发计划项目(2016YFB0200300 2016YFGX030148 2016YFB0200205 2016GZKF0JT006) 国家自然科学基金项目(61572464 61331008 61402444) 国家"八六三"高技术研究发展计划基金项目(2015AA01A301) 华为科研基金项目(YB2015070066) 中国科学院战略性先导科技专项(XDB24060600)~~

关键词大规模计算集群多轨网络带宽分割数据中心网络大规模网络模拟 largescale clusters multirail network bandwidth division data center network largescale network simulation

分类号 TP303 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献1

1王达伟,曹政,刘新春,游定山,孙凝晖.高性能互联网络交换机研究与设计[J].计算机研究与发展,2008,45(12):2069-2078. 被引量：3

二级参考文献21

1Salvador C. A strategy for efficient and scalable collective communication in the quadrics network [D]. Valencia, Spain: Electronic Engineering Departrnent, Technical University of Valencia, 2005.
2Riesen R. Communication patterns [C] //Proc of Workshop on Communication Architecture for Clusters. Los Alamitos, CA: IEEE Computer Society, 2006.
3Quadrics QsNetII: A network for supereomputing applications [OL]. [2007-10-12]. http://www. quadrics. com/.
4Scott S, Abts D, Kim J, et al. The BlackWidow high-radix Clos network [C] //Proc of the 33rd Annual Int Symp on Computer Architecture. Los Alamitos, CA: IEEE Computer Society, 2006:16-28.
5Liu Jiuxing, Mamidala Amith R, Dhabaleswar K Panda. Fast and scalable MPI-level broadcast using InfiniBand's hardware multicast Support [C] //Proc of Int Parallel and Distributed Processing Symposium ( IPDPS 2004 ). Piscataway, NJ: IEEE, 2004.
6Myricom. Myrinet Overview [OL]. [2007-05-17]. http:// www. myricom. com/myrinet/overview/.
7Filch J, Lopez P, Malumbres M P, et al. Boosting the performance of Myrinet networks [J]. IEEE Trans on Parallel and Distributed Systems, 2002, 13(11): 1166-1182.
8Mellanox. Mellanox InfiniScaleTM III [OL]. http://www. mellanox. com/pdf/products/silicon/InfiniScalelII. pdf.
9The Message Passing Interface (MPI) standard [OL]. [2007-05-18]. http://www-unix. mcs. anl. gov/mpi/.
10Berkeley UPC-Unified Parallel C [OL]. [2007-05-18]. http://upc. lbl. gov/.

共引文献2

1王文龙,杨贵,刘明慧.智能变电站过程层用交换机的研制[J].电力系统自动化,2011,35(18):72-76. 被引量：25
2杨贵,高红亮,彭安,张喜铭,李莉,潘磊.智能变电站过程层交换机设计及实现[J].电力工程技术,2017,36(5):128-135. 被引量：5

同被引文献4

1王小宁,肖海力,曹荣强.面向高性能计算环境的作业优化调度模型的设计与实现[J].计算机工程与科学,2017,39(4):619-626. 被引量：17
2陈琳,南洋.一种面向大规模系统域网络性能管理系统[J].计算机工程与科学,2017,39(9):1588-1593. 被引量：3
3王强,李亭,孟浩华.基于流量分析的电力企业应用性能监控系统的设计与实现[J].计算机与数字工程,2017,45(6):1170-1174. 被引量：5
4刘开南.云数据中心基于遗传算法的虚拟机迁移模型[J].计算机应用研究,2020,37(4):1115-1118. 被引量：10

引证文献2

1陈斌,宫本超.大规模计算机互联网络性能监控模型的设计与实现[J].科学技术创新,2018(6):85-86. 被引量：1
2蔡文伟,朱嘉贤,张会兵.任务序列强度感知的大规模集群服务器控制模型[J].计算机应用研究,2020,37(12):3753-3756. 被引量：1

二级引证文献2

1朱振伸,范黎林,赵敬云.多媒体网络中基于QoS的自适应SPC仿真[J].计算机仿真,2022,39(1):213-217. 被引量：1
2陈灵毓.大规模计算机互联网络性能监控模型的设计研究[J].信息记录材料,2024,25(9):66-68.

1邵冰.《节点计算》教学中学生的思维能力培养初探[J].读与写（教育教学刊）,2017,14(4).
2黄秋兰,李海波,石京燕,孙震宇,伍文静,程耀东,程振京.基于Openstack的高能物理虚拟计算集群系统及应用[J].计算机科学,2017,44(10):59-63. 被引量：4
3李梅生,肖文俊,赖正文,张占英,韩冬.一种具有小世界性常数度的数据中心网[J].华南理工大学学报（自然科学版）,2017,45(7):63-68. 被引量：1
4孙震宇,石京燕,姜晓巍,邹佳恒,杜然.大型高能物理计算集群资源管理方法的评测[J].计算机科学,2017,44(10):85-90. 被引量：7
5黄鹤.成品油储运中存在的风险与技术趋势[J].化工管理,2017(26):276-276. 被引量：3
62018年《计算机研究与发展》专题(正刊)征文通知--网络功能虚拟化[J].计算机研究与发展,2017,54(11):2557-2557.
7徐文远,毛力,王晓锋.基于流量约简的网络模拟方法[J].计算机工程,2017,43(1):120-125. 被引量：1
8王烨.基于单片机的语音交换单元的设计与制作[J].数字技术与应用,2017,35(9):3-4.
9吴冰冰,余冰雁,伍剑,林金桐.2017年欧洲光通信会议述评[J].电信科学,2017,33(10):155-162. 被引量：3
10王德强,王敢甫.基于Mininet的胖树SDN网络仿真[J].软件,2017,38(9):46-50. 被引量：3

计算机研究与发展

2017年第11期

浏览历史

内容加载中请稍等...

面向大规模计算集群的多轨分割网络被引量：2

参考文献1

二级参考文献21

共引文献2

同被引文献4

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

面向大规模计算集群的多轨分割网络 被引量：2

参考文献1

二级参考文献21

共引文献2

同被引文献4

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

面向大规模计算集群的多轨分割网络被引量：2