期刊文献+
共找到23篇文章
< 1 2 >
每页显示 20 50 100
Turbo Message Passing Based Burst Interference Cancellation for Data Detection in Massive MIMO-OFDM Systems
1
作者 Wenjun Jiang Zhihao Ou +1 位作者 Xiaojun Yuan Li Wang 《China Communications》 SCIE CSCD 2024年第2期143-154,共12页
This paper investigates the fundamental data detection problem with burst interference in massive multiple-input multiple-output orthogonal frequency division multiplexing(MIMO-OFDM) systems. In particular, burst inte... This paper investigates the fundamental data detection problem with burst interference in massive multiple-input multiple-output orthogonal frequency division multiplexing(MIMO-OFDM) systems. In particular, burst interference may occur only on data symbols but not on pilot symbols, which means that interference information cannot be premeasured. To cancel the burst interference, we first revisit the uplink multi-user system and develop a matrixform system model, where the covariance pattern and the low-rank property of the interference matrix is discussed. Then, we propose a turbo message passing based burst interference cancellation(TMP-BIC) algorithm to solve the data detection problem, where the constellation information of target data is fully exploited to refine its estimate. Furthermore, in the TMP-BIC algorithm, we design one module to cope with the interference matrix by exploiting its lowrank property. Numerical results demonstrate that the proposed algorithm can effectively mitigate the adverse effects of burst interference and approach the interference-free bound. 展开更多
关键词 burst interference cancellation data detection massive multiple-input multiple-output(MIMO) message passing orthogonal frequency division multiplexing(OFDM)
下载PDF
Vector Approximate Message Passing with Sparse Bayesian Learning for Gaussian Mixture Prior 被引量:1
2
作者 Chengyao Ruan Zaichen Zhang +3 位作者 Hao Jiang Jian Dang Liang Wu Hongming Zhang 《China Communications》 SCIE CSCD 2023年第5期57-69,共13页
Compressed sensing(CS)aims for seeking appropriate algorithms to recover a sparse vector from noisy linear observations.Currently,various Bayesian-based algorithms such as sparse Bayesian learning(SBL)and approximate ... Compressed sensing(CS)aims for seeking appropriate algorithms to recover a sparse vector from noisy linear observations.Currently,various Bayesian-based algorithms such as sparse Bayesian learning(SBL)and approximate message passing(AMP)based algorithms have been proposed.For SBL,it has accurate performance with robustness while its computational complexity is high due to matrix inversion.For AMP,its performance is guaranteed by the severe restriction of the measurement matrix,which limits its application in solving CS problem.To overcome the drawbacks of the above algorithms,in this paper,we present a low complexity algorithm for the single linear model that incorporates the vector AMP(VAMP)into the SBL structure with expectation maximization(EM).Specifically,we apply the variance auto-tuning into the VAMP to implement the E step in SBL,which decrease the iterations that require to converge compared with VAMP-EM algorithm when using a Gaussian mixture(GM)prior.Simulation results show that the proposed algorithm has better performance with high robustness under various cases of difficult measurement matrices. 展开更多
关键词 sparse Bayesian learning approximate message passing compressed sensing expectation propagation
下载PDF
Speech Enhancement Based on Approximate Message Passing 被引量:1
3
作者 Chao Li Ting Jiang Sheng Wu 《China Communications》 SCIE CSCD 2020年第8期187-198,共12页
To overcome the limitations of conventional speech enhancement methods, such as inaccurate voice activity detector(VAD) and noise estimation, a novel speech enhancement algorithm based on the approximate message passi... To overcome the limitations of conventional speech enhancement methods, such as inaccurate voice activity detector(VAD) and noise estimation, a novel speech enhancement algorithm based on the approximate message passing(AMP) is adopted. AMP exploits the difference between speech and noise sparsity to remove or mute the noise from the corrupted speech. The AMP algorithm is adopted to reconstruct the clean speech efficiently for speech enhancement. More specifically, the prior probability distribution of speech sparsity coefficient is characterized by Gaussian-model, and the hyper-parameters of the prior model are excellently learned by expectation maximization(EM) algorithm. We utilize the k-nearest neighbor(k-NN) algorithm to learn the sparsity with the fact that the speech coefficients between adjacent frames are correlated. In addition, computational simulations are used to validate the proposed algorithm, which achieves better speech enhancement performance than other four baseline methods-Wiener filtering, subspace pursuit(SP), distributed sparsity adaptive matching pursuit(DSAMP), and expectation-maximization Gaussian-model approximate message passing(EM-GAMP) under different compression ratios and a wide range of signal to noise ratios(SNRs). 展开更多
关键词 speech enhancement approximate message passing Gaussian model expectation maximization algorithm
下载PDF
Message Passing Based Detection for Orthogonal Time Frequency Space Modulation
4
作者 YUAN Zhengdao LIU Fei +1 位作者 GUO Qinghua WANG Zhongyong 《ZTE Communications》 2021年第4期34-44,共11页
The orthogonal time frequency space(OTFS)modulation has emerged as a promis⁃ing modulation scheme for wireless communications in high-mobility scenarios.An efficient detector is of paramount importance to harvesting t... The orthogonal time frequency space(OTFS)modulation has emerged as a promis⁃ing modulation scheme for wireless communications in high-mobility scenarios.An efficient detector is of paramount importance to harvesting the time and frequency diversities promised by OTFS.Recently,some message passing based detectors have been developed by exploiting the features of the OTFS channel matrices.In this paper,we provide an overview of some re⁃cent message passing based OTFS detectors,compare their performance,and shed some light on potential research on the design of message passing based OTFS receivers. 展开更多
关键词 OTFS DETECTION message passing belief propagation approximate message pass⁃ing(AMP) unitary AMP(UAMP)
下载PDF
A residual-based message passing algorithm for constraint satisfaction problems
5
作者 Chun-Yan Zhao Yan-Rong Fu Jin-Hua Zhao 《Communications in Theoretical Physics》 SCIE CAS CSCD 2022年第3期77-86,共10页
Message passing algorithms,whose iterative nature captures complicated interactions among interconnected variables in complex systems and extracts information from the fixed point of iterated messages,provide a powerf... Message passing algorithms,whose iterative nature captures complicated interactions among interconnected variables in complex systems and extracts information from the fixed point of iterated messages,provide a powerful toolkit in tackling hard computational tasks in optimization,inference,and learning problems.In the context of constraint satisfaction problems(CSPs),when a control parameter(such as constraint density)is tuned,multiple threshold phenomena emerge,signaling fundamental structural transitions in their solution space.Finding solutions around these transition points is exceedingly challenging for algorithm design,where message passing algorithms suffer from a large message fiuctuation far from convergence.Here we introduce a residual-based updating step into message passing algorithms,in which messages with large variation between consecutive steps are given high priority in the updating process.For the specific example of model RB(revised B),a typical prototype of random CSPs with growing domains,we show that our algorithm improves the convergence of message updating and increases the success probability in finding solutions around the satisfiability threshold with a low computational cost.Our approach to message passing algorithms should be of value for exploring their power in developing algorithms to find ground-state solutions and understand the detailed structure of solution space of hard optimization problems. 展开更多
关键词 constraint satisfaction problems model RB message passing algorithms residuals of messages
原文传递
An End-To-End Hyperbolic Deep Graph Convolutional Neural Network Framework
6
作者 Yuchen Zhou Hongtao Huo +5 位作者 Zhiwen Hou Lingbin Bu Yifan Wang Jingyi Mao Xiaojun Lv Fanliang Bu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第4期537-563,共27页
Graph Convolutional Neural Networks(GCNs)have been widely used in various fields due to their powerful capabilities in processing graph-structured data.However,GCNs encounter significant challenges when applied to sca... Graph Convolutional Neural Networks(GCNs)have been widely used in various fields due to their powerful capabilities in processing graph-structured data.However,GCNs encounter significant challenges when applied to scale-free graphs with power-law distributions,resulting in substantial distortions.Moreover,most of the existing GCN models are shallow structures,which restricts their ability to capture dependencies among distant nodes and more refined high-order node features in scale-free graphs with hierarchical structures.To more broadly and precisely apply GCNs to real-world graphs exhibiting scale-free or hierarchical structures and utilize multi-level aggregation of GCNs for capturing high-level information in local representations,we propose the Hyperbolic Deep Graph Convolutional Neural Network(HDGCNN),an end-to-end deep graph representation learning framework that can map scale-free graphs from Euclidean space to hyperbolic space.In HDGCNN,we define the fundamental operations of deep graph convolutional neural networks in hyperbolic space.Additionally,we introduce a hyperbolic feature transformation method based on identity mapping and a dense connection scheme based on a novel non-local message passing framework.In addition,we present a neighborhood aggregation method that combines initial structural featureswith hyperbolic attention coefficients.Through the above methods,HDGCNN effectively leverages both the structural features and node features of graph data,enabling enhanced exploration of non-local structural features and more refined node features in scale-free or hierarchical graphs.Experimental results demonstrate that HDGCNN achieves remarkable performance improvements over state-ofthe-art GCNs in node classification and link prediction tasks,even when utilizing low-dimensional embedding representations.Furthermore,when compared to shallow hyperbolic graph convolutional neural network models,HDGCNN exhibits notable advantages and performance enhancements. 展开更多
关键词 Graph neural networks hyperbolic graph convolutional neural networks deep graph convolutional neural networks message passing framework
下载PDF
Multi-Panel Extra-Large Scale MIMO Based Joint Activity Detection and Channel Estimation for Near-Field Massive IoT Access 被引量:1
7
作者 Zhen Gao Hanlin Xiu +4 位作者 Yikun Mei Anwen Liao Malong Ke Chun Hu Mohamed-Slim Alouini 《China Communications》 SCIE CSCD 2023年第5期232-243,共12页
The extra-large scale multiple-input multiple-output(XL-MIMO)for the beyond fifth/sixth generation mobile communications is a promising technology to provide Tbps data transmission and stable access service.However,th... The extra-large scale multiple-input multiple-output(XL-MIMO)for the beyond fifth/sixth generation mobile communications is a promising technology to provide Tbps data transmission and stable access service.However,the extremely large antenna array aperture arouses the channel near-field effect,resulting in the deteriorated data rate and other challenges in the practice communication systems.Meanwhile,multi-panel MIMO technology has attracted extensive attention due to its flexible configuration,low hardware cost,and wider coverage.By combining the XL-MIMO and multi-panel array structure,we construct multi-panel XL-MIMO and apply it to massive Internet of Things(IoT)access.First,we model the multi-panel XL-MIMO-based near-field channels for massive IoT access scenarios,where the electromagnetic waves corresponding to different panels have different angles of arrival/departure(AoAs/AoDs).Then,by exploiting the sparsity of the near-field massive IoT access channels,we formulate a compressed sensing based joint active user detection(AUD)and channel estimation(CE)problem which is solved by AMP-EM-MMV algorithm.The simulation results exhibit the superiority of the AMP-EM-MMV based joint AUD and CE scheme over the baseline algorithms. 展开更多
关键词 extra-large scale MIMO massive IoT access active user detection channel estimation multipanel approximate message passing
下载PDF
Iterative Receiver for Orthogonal Time Frequency Space with Index Modulation via Structured Prior-Based Hybrid Belief and Expectation Propagation
8
作者 Haoyang Li Bin Li +2 位作者 Tingting Zhang Yuan Feng Nan Wu 《China Communications》 SCIE CSCD 2023年第1期66-78,共13页
Orthogonal Time Frequency Space(OTFS)signaling with index modulation(IM)is a promising transmission scheme characterized by high transmission efficiency for high mobility scenarios.In this paper,we study the receiver ... Orthogonal Time Frequency Space(OTFS)signaling with index modulation(IM)is a promising transmission scheme characterized by high transmission efficiency for high mobility scenarios.In this paper,we study the receiver for coded OTFS-IM system.First,we construct the corresponding factor graph,on which the structured prior incorporating activation pattern constraint and channel coding is devised.Then we develop a iterative receiver via structured prior-based hybrid belief propagation(BP)and expectation propagation(EP)algorithm,named as StrBP-EP,for the coded OTFS-IM system.To reduce the computational complexity of discrete distribution introduced by structured prior,Gaussian approximation conducted by EP is adopted.To further reduce the complexity,we derive two variations of the proposed algorithm by using some approximations.Simulation results validate the superior performance of the proposed algorithm. 展开更多
关键词 OTFS index modulation message passing belief propagation expectation propagation
下载PDF
Model-Driven Deep Learning for Massive Space-Domain Index Modulation MIMO Detection
9
作者 Ping Yang Qin Yi +3 位作者 Yiqian Huang Jialiang Fu Yue Xiao Wanbin Tang 《China Communications》 SCIE CSCD 2023年第10期43-57,共15页
In this paper,a powerful model-driven deep learning framework is exploited to overcome the challenge of multi-domain signal detection in spacedomain index modulation(SDIM)based multiple input multiple output(MIMO)syst... In this paper,a powerful model-driven deep learning framework is exploited to overcome the challenge of multi-domain signal detection in spacedomain index modulation(SDIM)based multiple input multiple output(MIMO)systems.Specifically,we use orthogonal approximate message passing(OAMP)technique to develop OAMPNet,which is a novel signal recovery mechanism in the field of compressed sensing that effectively uses the sparse property from the training SDIM samples.For OAMPNet,the prior probability of the transmit signal has a significant impact on the obtainable performance.For this reason,in our design,we first derive the prior probability of transmitting signals on each antenna for SDIMMIMO systems,which is different from the conventional massive MIMO systems.Then,for massive MIMO scenarios,we propose two novel algorithms to avoid pre-storing all active antenna combinations,thus considerably improving the memory efficiency and reducing the related overhead.Our simulation results show that the proposed framework outperforms the conventional optimization-driven based detection algorithms and has strong robustness under different antenna scales. 展开更多
关键词 deep learning generalized spatial modulation index modulation massive MIMO message passing orthogonal approximate
下载PDF
UAMP-Based Delay-Doppler Channel Estimation for OTFS Systems
10
作者 Zhongjie Li Weijie Yuan +2 位作者 Qinghua Guo Nan Wu Ji Zhang 《China Communications》 SCIE CSCD 2023年第10期70-84,共15页
Orthogonal time frequency space(OTFS)technique,which modulates data symbols in the delay-Doppler(DD)domain,presents a potential solution for supporting reliable information transmission in highmobility vehicular netwo... Orthogonal time frequency space(OTFS)technique,which modulates data symbols in the delay-Doppler(DD)domain,presents a potential solution for supporting reliable information transmission in highmobility vehicular networks.In this paper,we study the issues of DD channel estimation for OTFS in the presence of fractional Doppler.We first propose a channel estimation algorithm with both low complexity and high accuracy based on the unitary approximate message passing(UAMP),which exploits the structured sparsity of the effective DD domain channel using hidden Markov model(HMM).The empirical state evolution(SE)analysis is then leveraged to predict the performance of our proposed algorithm.To refine the hyperparameters in the proposed algorithm,we derive the update criterion for the hyperparameters through the expectation-maximization(EM)algorithm.Finally,Our simulation results demonstrate that our proposed algorithm can achieve a significant gain over various baseline schemes. 展开更多
关键词 orthogonal time frequency space(OTFS) channel estimation hidden Markov model(HMM) unitary approximate message passing(UAMP)
下载PDF
A Survey on Low Complexity Detectors for OTFS Systems 被引量:3
11
作者 ZHANG Zhengquan LIU Heng +1 位作者 WANG Qianli FAN Pingzhi 《ZTE Communications》 2021年第4期3-15,共13页
The newly emerging orthogonal time frequency space(OTFS)modulation can ob⁃tain delay-Doppler diversity gain to significantly improve the system performance in high mobility wireless communication scenarios such as veh... The newly emerging orthogonal time frequency space(OTFS)modulation can ob⁃tain delay-Doppler diversity gain to significantly improve the system performance in high mobility wireless communication scenarios such as vehicle-to-everything(V2X),high-speed railway and unmanned aerial vehicles(UAV),by employing inverse symplectic finite Fouri⁃er transform(ISFFT)and symplectic finite Fourier transform(SFFT).However,OTFS modu⁃lation will dramatically increase system complexity,especially at the receiver side.Thus,de⁃signing low complexity OTFS receiver is a key issue for OTFS modulation to be adopted by new-generation wireless communication systems.In this paper,we review low complexity OTFS detectors and provide some insights on future researches.We firstly present the OTFS system model and basic principles,followed by an overview of OTFS detector structures,classifications and comparative discussion.We also survey the principles of OTFS detection algorithms.Furthermore,we discuss the design of hybrid OTFS and orthogonal frequency di⁃vision multiplexing(OFDM)detectors in single user and multi-user multi-waveform commu⁃nication systems.Finally,we address the main challenges in designing low complexity OT⁃FS detectors and identify some future research directions. 展开更多
关键词 high mobility wireless communications OTFS ISFFT SFFT delay-Doppler di⁃versity iterative maximum ratio combining(MRC)detection message passing detection
下载PDF
Coupling analysis of transmission lines excited by space electromagnetic fields based on time domain hybrid method using parallel technique 被引量:1
12
作者 叶志红 吴小林 李尧尧 《Chinese Physics B》 SCIE EI CAS CSCD 2020年第9期249-254,共6页
We present a time domain hybrid method to realize the fast coupling analysis of transmission lines excited by space electromagnetic fields, in which parallel finite-difference time-domain (FDTD) method, interpolation ... We present a time domain hybrid method to realize the fast coupling analysis of transmission lines excited by space electromagnetic fields, in which parallel finite-difference time-domain (FDTD) method, interpolation scheme, and Agrawal model-based transmission line (TL) equations are organically integrated together. Specifically, the Agrawal model is employed to establish the TL equations to describe the coupling effects of space electromagnetic fields on transmission lines. Then, the excitation fields functioning as distribution sources in TL equations are calculated by the parallel FDTD method through using the message passing interface (MPI) library scheme and interpolation scheme. Finally, the TL equations are discretized by the central difference scheme of FDTD and assigned to multiple processors to obtain the transient responses on the terminal loads of these lines. The significant feature of the presented method is embodied in its parallel and synchronous calculations of the space electromagnetic fields and transient responses on the lines. Numerical simulations of ambient wave acting on multi-conductor transmission lines (MTLs), which are located on the PEC ground and in the shielded cavity respectively, are implemented to verify the accuracy and efficiency of the presented method. 展开更多
关键词 Agrawal model transmission line equations parallel FDTD method message passing interface(MPI)library
原文传递
Large-scale high performance computation on 3D explosion and shockproblems
13
作者 费广磊 马天宝 郝莉 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 2011年第3期375-382,共8页
Explosion and shock often involve large deformation, interface treatment between multi-material, and strong discontinuity. The Eulerian method has advantages for solving these problems. In parallel computation of the ... Explosion and shock often involve large deformation, interface treatment between multi-material, and strong discontinuity. The Eulerian method has advantages for solving these problems. In parallel computation of the Eulerian method, the physical quantities of the computaional cells do not change before the disturbance reaches to these cells. Computational efficiency is low when using fixed partition because of load imbalance. To solve this problem, a dynamic parallel method in which the computation domain expands with disturbance is used. The dynamic parallel program is designed based on the generally used message passing interface model. The numerical test of dynamic parallel program agrees well with that of the original parallel program, also agrees with the actual situation. 展开更多
关键词 explosion and shock dynamic parallel message passing interface airexplosion
下载PDF
Real-space parallel density matrix renormalization group with adaptive boundaries
14
作者 陈富州 程晨 罗洪刚 《Chinese Physics B》 SCIE EI CAS CSCD 2021年第8期191-197,共7页
We propose an improved real-space parallel strategy for the density matrix renormalization group(DMRG)method,where boundaries of separate regions are adaptively distributed during DMRG sweeps.Our scheme greatly improv... We propose an improved real-space parallel strategy for the density matrix renormalization group(DMRG)method,where boundaries of separate regions are adaptively distributed during DMRG sweeps.Our scheme greatly improves the parallel efficiency with shorter waiting time between two adjacent tasks,compared with the original real-space parallel DMRG with fixed boundaries.We implement our new strategy based on the message passing interface(MPI),and dynamically control the number of kept states according to the truncation error in each DMRG step.We study the performance of the new parallel strategy by calculating the ground state of a spin-cluster chain and a quantum chemical Hamiltonian of the water molecule.The maximum parallel efficiencies for these two models are 91%and 76%in 4 nodes,which are much higher than the real-space parallel DMRG with fixed boundaries. 展开更多
关键词 density matrix renormalization group strongly correlated systems message passing interface
原文传递
Deep Learning-Based AMP for Massive MIMO Detection
15
作者 Yang Yang Shaoping Chen Xiqi Gao 《China Communications》 SCIE CSCD 2022年第10期69-77,共9页
Low-complexity detectors play an essential role in massive multiple-input multiple-output (MIMO) transmissions. In this work, we discuss the perspectives of utilizing approximate message passing (AMP) algorithm to the... Low-complexity detectors play an essential role in massive multiple-input multiple-output (MIMO) transmissions. In this work, we discuss the perspectives of utilizing approximate message passing (AMP) algorithm to the detection of massive MIMO transmission. To this end, we need to efficiently reduce the divergence occurrence in AMP iterations and bridge the performance gap that AMP has from the optimum detector while making use of its advantage of low computational load. Our solution is to build a neural network to learn and optimize AMP detection with four groups of specifically designed learnable coefficients such that divergence rate and detection mean squared error (MSE) can be significantly reduced. Moreover, the proposed deep learning-based AMP has a much faster converging rate, and thus a much lower computational complexity than conventional AMP, providing an alternative solution for the massive MIMO detection. Extensive simulation experiments are provided to validate the advantages of the proposed deep learning-based AMP. 展开更多
关键词 approximate message passing CONVERGENCE machine learning
下载PDF
Static Analysis Techniques for Fixing Software Defects in MPI-Based Parallel Programs
16
作者 Norah Abdullah Al-Johany Sanaa Abdullah Sharaf +1 位作者 Fathy Elbouraey Eassa Reem Abdulaziz Alnanih 《Computers, Materials & Continua》 SCIE EI 2024年第5期3139-3173,共35页
The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memorysystems.However, MPI implementations can contain defects that impact the reliability and performance of par... The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memorysystems.However, MPI implementations can contain defects that impact the reliability and performance of parallelapplications. Detecting and correcting these defects is crucial, yet there is a lack of published models specificallydesigned for correctingMPI defects. To address this, we propose a model for detecting and correcting MPI defects(DC_MPI), which aims to detect and correct defects in various types of MPI communication, including blockingpoint-to-point (BPTP), nonblocking point-to-point (NBPTP), and collective communication (CC). The defectsaddressed by the DC_MPI model include illegal MPI calls, deadlocks (DL), race conditions (RC), and messagemismatches (MM). To assess the effectiveness of the DC_MPI model, we performed experiments on a datasetconsisting of 40 MPI codes. The results indicate that the model achieved a detection rate of 37 out of 40 codes,resulting in an overall detection accuracy of 92.5%. Additionally, the execution duration of the DC_MPI modelranged from 0.81 to 1.36 s. These findings show that the DC_MPI model is useful in detecting and correctingdefects in MPI implementations, thereby enhancing the reliability and performance of parallel applications. TheDC_MPImodel fills an important research gap and provides a valuable tool for improving the quality ofMPI-basedparallel computing systems. 展开更多
关键词 High-performance computing parallel computing software engineering software defect message passing interface deadlock
下载PDF
High Performance MPI over the Slingshot Interconnect
17
作者 Kawthar Shafie Khorassani Chen-Chun Chen +3 位作者 Bharath Ramesh Aamir Shafi Hari Subramoni Dhabaleswar K.Panda 《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第1期128-145,共18页
The Slingshot interconnect designed by HPE/Cray is becoming more relevant in high-performance computing with its deployment on the upcoming exascale systems.In particular,it is the interconnect empowering the first ex... The Slingshot interconnect designed by HPE/Cray is becoming more relevant in high-performance computing with its deployment on the upcoming exascale systems.In particular,it is the interconnect empowering the first exascale and highest-ranked supercomputer in the world,Frontier.It offers various features such as adaptive routing,congestion control,and isolated workloads.The deployment of newer interconnects sparks interest related to performance,scalability,and any potential bottlenecks as they are critical elements contributing to the scalability across nodes on these systems.In this paper,we delve into the challenges the Slingshot interconnect poses with current state-of-the-art MPI(message passing interface)libraries.In particular,we look at the scalability performance when using Slingshot across nodes.We present a comprehensive evaluation using various MPI and communication libraries including Cray MPICH,Open-MPI+UCX,RCCL,and MVAPICH2 on CPUs and GPUs on the Spock system,an early access cluster deployed with Slingshot-10,AMD MI100 GPUs and AMD Epyc Rome CPUs to emulate the Frontier system.We also evaluate preliminary CPU-based support of MPI libraries on the Slingshot-11 interconnect. 展开更多
关键词 AMD GPU interconnect technology MPI(message passing interface) Slingshot
原文传递
Denoising enabled channel estimation for underwater acoustic communications:A sparsity-aware model-driven learning approach
18
作者 Sicong Liu Younan Mou +2 位作者 Xianyao Wang Danping Su Ling Cheng 《Intelligent and Converged Networks》 EI 2023年第1期1-14,共14页
It has always been difficult to achieve accurate information of the channel for underwater acoustic communications because of the severe underwater propagation conditions,including frequency-selective property,high re... It has always been difficult to achieve accurate information of the channel for underwater acoustic communications because of the severe underwater propagation conditions,including frequency-selective property,high relative mobility,long propagation latency,and intensive ambient noise,etc.To this end,a deep unfolding neural network based approach is proposed,in which multiple layers of the network mimic the iterations of the classical iterative sparse approximation algorithm to extract the inherent sparse features of the channel by exploiting deep learning,and a scheme based on the Sparsity-Aware DNN(SA-DNN)for UAC estimation is proposed to improve the estimation accuracy.Moreover,we propose a Denoising Sparsity-Aware DNN(DeSA-DNN)based enhanced method that integrates a denoising CNN module in the sparsity-aware deep network,so that the degradation brought by intensive ambient noise could be eliminated and the estimation accuracy can be further improved.Simulation results demonstrate that the performance of the proposed schemes is superior to the state-of-the-art compressed sensing based and iterative sparse recovery schems in the aspects of channel recovery precision,pilot overhead,and robustness,particularly under unideal circumstances of intensive ambient noise or inadequate measurement pilots. 展开更多
关键词 Orthogonal Frequency Division Multiplexing(OFDM) Underwater Acoustic Communications(UAC) sparse recovery deep learning sparse learning DENOISING approximate message passing
原文传递
PMODTRAN:a parallel implementation based on MODTRAN for massive remote sensing data processing
19
作者 Fang Huang Ji Zhou +3 位作者 Jian Tao Xicheng Tan Shunlin Liang Jie Cheng 《International Journal of Digital Earth》 SCIE EI CSCD 2016年第9期819-834,共16页
MODerate resolution atmospheric TRANsmission(MODTRAN)is a commercial remote sensing(RS)software package that has been widely used to simulate radiative transfer of electromagnetic radiation through the Earth’s atmosp... MODerate resolution atmospheric TRANsmission(MODTRAN)is a commercial remote sensing(RS)software package that has been widely used to simulate radiative transfer of electromagnetic radiation through the Earth’s atmosphere and the radiation observed by a remote sensor.However,when very large RS datasets must be processed in simulation applications at a global scale,it is extremely time-consuming to operate MODTRAN on a modern workstation.Under this circumstance,the use of parallel cluster computing to speed up the process becomes vital to this time-consuming task.This paper presents PMODTRAN,an implementation of a parallel task-scheduling algorithm based on MODTRAN.PMODTRAN was able to reduce the processing time of the test cases used here from over 4.4 months on a workstation to less than a week on a local computer cluster.In addition,PMODTRAN can distribute tasks with different levels of granularity and has some extra features,such as dynamic load balancing and parameter checking. 展开更多
关键词 Parallel computing message passing interface MODTRAN thermal infrared remote sensing land-surface temperature retrieval
原文传递
An MPI+OpenACC-Based PRM Scalar Advection Scheme in the GRAPES Model over a Cluster with Multiple CPUs and GPUs
20
作者 Huadong Xiao Yang Lu +1 位作者 Jianqiang Huang Wei Xue 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2022年第1期164-173,共10页
A moisture advection scheme is an essential module of a numerical weather/climate model representing the horizontal transport of water vapor.The Piecewise Rational Method(PRM) scalar advection scheme in the Global/Reg... A moisture advection scheme is an essential module of a numerical weather/climate model representing the horizontal transport of water vapor.The Piecewise Rational Method(PRM) scalar advection scheme in the Global/Regional Assimilation and Prediction System(GRAPES) solves the moisture flux advection equation based on PRM.Computation of the scalar advection involves boundary exchange,and computation of higher bandwidth requirements is complicated and time-consuming in GRAPES.Recently,Graphics Processing Units(GPUs) have been widely used to solve scientific and engineering computing problems owing to advancements in GPU hardware and related programming models such as CUDA/OpenCL and Open Accelerator(OpenACC).Herein,we present an accelerated PRM scalar advection scheme with Message Passing Interface(MPI) and OpenACC to fully exploit GPUs’ power over a cluster with multiple Central Processing Units(CPUs) and GPUs,together with optimization of various parameters such as minimizing data transfer,memory coalescing,exposing more parallelism,and overlapping computation with data transfers.Results show that about 3.5 times speedup is obtained for the entire model running at medium resolution with double precision when comparing the scheme’s elapsed time on a node with two GPUs(NVIDIA P100) and two 16-core CPUs(Intel Gold 6142).Further,results obtained from experiments of a higher resolution model with multiple GPUs show excellent scalability. 展开更多
关键词 Graphics Processing Unit(GPU)computing Open Accelerator(OpenACC) message passing Interface(MPI) Global/Regional Assimilation and Prediction System(GRAPES) Piecewise Rational Method(PRM)scalar advection scheme
原文传递
上一页 1 2 下一页 到第
使用帮助 返回顶部