摘要
【目的】为了加快国产CFD软件的计算速度,本文设计并实现了基于国产加速器的加速版本。【方法】基于CCFD V3.0版本,将软件的核心算法移植到国产加速器,并采用多种方法进行优化。【结果】使用128*128*128大小的网格进行实验,移植后的程序模拟结果与原CPU版本基本一致,单加速卡相比于单CPU核心,对流项计算部分取得了166倍的加速,ADI迭代计算部分取得了59倍的加速。【局限】由于CFD软件模块较多,未对整个程序进行移植优化,未来会将软件都移植到国产加速器上进行加速。【结论】本文实现了基于国产加速器的CFD核心算法并行优化,取得了较好的加速效果,为以后CFD软件的移植与优化工作提供了经验和参考。
[Objective]In order to accelerate the calculation of domestic CFD software,this paper designs and implements an accelerated version of CFD core algorithms based on the domestic processor.[Methods]Based on the CCFD V3.0 version,the core algorithms of the software were ported to the domestic processor and optimized by various methods.[Results]Using a 128*128*128 grid for experiments,the simulation results of the ported program are basically the same as the original CPU version.Compared with a single CPU core,one acceleration card has achieved 166 times acceleration on convection calculations,and 59 times acceleration on ADI iterative calculations.[Limitations]Because there are many modules in CFD software,the target CFD program has not been ported and optimized entirely.In the future,the software will be ported completely to the domestic processors for acceleration.[Conclusions]This article has realized the parallel optimization of the CFD core algorithms based on domestic processors and achieved good acceleration results,which provides experience and reference for porting and optimizing CFD software in the future.
作者
曹义魁
陆忠华
张鉴
刘夏真
袁武
梁姗
CAO Yikui;LU Zhonghua;ZHANG Jian;LIU Xiazhen;YUAN Wu;LIANG Shan(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《数据与计算发展前沿》
CSCD
2021年第4期93-103,共11页
Frontiers of Data & Computing
基金
国家重点研发计划(2017YFB0202803)。
关键词
国产加速器
CFD
移植
高性能计算
并行计算
优化
domestic processors
CFD
transplantation
high-performance computing
parallel computing
optimization