摘要
云底高度是地气系统辐射收支以及飞行安全的重要影响因素。介绍了利用FY-4A卫星的数据产品反演云底高度的方法,设计了两种云底高度反演方案:第一种方案先将云划分为卷云(Ci)、高层云(As)、高积云(Ac)、层云/层积云(St/Sc)、积云(Cu)、雨层云(Ns)、深对流云(Dc)和多层云(Multi)等8种云类型,再分别采用独立的集成学习模型反演这8类云的云底高度;第二种方案不区分云的类型,采用统一的集成学习模型反演云底高度。将CloudSat探测的云底高度作为参考值,以129515个样本对两种方案进行评估,结果表明方案一的反演模型效果更好,均方根误差(RMSE)为1304.7 m,平均绝对误差(MAE)为898.4 m,相关系数(R)为0.9214。
Objective Cloud base height(CBH)is a crucial cloud parameter affecting the water cycle and radiation budget of the earth-atmosphere system.Additionally,CBH has a great impact on aviation safety.Low CBH often leads to a decrease in visibility,which poses a great threat to flight safety.Therefore,it is meaningful to acquire accurate CBH for related scientific research and meteorological services.It is valuable but challenging to use satellite passive remote sensing data to retrieve CBH.Some cloud products such as cloud top height(CTH)and cloud optical thickness(COT)are often used in previous research,related to CBH retrieval,from which two ideas to retrieve CBH can be summarized.The first idea employed independent methods to obtain CBH of different types of clouds respectively,and the second one directly retrieves CBH using cloud products of satellites without regarding cloud types.At present,there is no CBH products of FY-4A.Therefore,a CBH retrieval method for FY-4A is introduced in this paper.According to the two ideas mentioned above,two schemes of CBH retrieval are designed,which are compared to find more suitable ideas to retrieve CBH for FY-4A and to provide reference for subsequent development of FY-4A CBH products.Methods A CBH retrieval method based on ensemble learning is proposed in this paper.CTH,COT,and cloud effective radius(CER)from FY-4A are used.Additionally,CBH and cloud types from CloudSat are employed for their widely recognized data quality.First,data of FY-4A and CloudSat are matched spatiotemporally and are divided into training data,validation data,and test data.Second,CBH retrieval models are built based on two ensemble learning algorithms,random forest(RF),and gradient boosting tree(GBT).Two schemes of CBH retrieval are designed in this paper.In the first scheme,matched data are divided into eight types according to the eight cloud types of CloudSat.For each type of cloud,two retrieval models are built based on RF and GBT using training data and validation data through tenfold cross validation. The optimal model is selected according to the models' results on test data. In the second scheme,retrieval models are built without regarding cloud types. Training data of the eight cloud types are combined together.Validation data and test data are processed similarly. The three data sets are used to obtain the RF model and GBT model,and to select the optimal retrieval model. Finally, the optimal scheme and model of CBH retrieval for FY-4A are selectedaccording to the models' performance.Results and Discussions Root mean squared error (RMSE), mean absolute error (MAE), correlation coefficient (R), andmean relative error (MRE) are used to evaluate models' performance. In the first scheme, the GBT model is the optimalretrieval model for Cirrus (Ci), Altostratus (As), and Altostratus (Ac). RF model is the optimal retrieval model forStratus/Stratocumulus (St/Sc), Cumulus (Cu), Nimbostratus (Ns), deep convective cloud (Dc), and multilayer cloud(Multi). In the second scheme, the GBT model is the optimal retrieval model. The models of the two schemes arecompared on test data with 129515 samples. Overall, the retrieval model of the first scheme outperforms that of the secondscheme. Specifically, RMSE of the model in the first scheme is 1304. 7 m. MAE is 898. 3 m, R is 0. 9214, and MRE is63. 93%. For the eight types of clouds, RMSE, MAE, R, and MRE of the model in the first scheme are also superior tothose of the model in the second scheme. Although the first scheme can obtain better results, the retrieval model of thefirst scheme still needs to be improved in the future. For example, the performance of the retrieval model for Dc is not apatch on that of other types of clouds. Additionally, the paper discusses how to apply the proposed method to practice.First, level 1 data (i. e. reflectance and brightness temperature) and level 2 data (i. e. CTH, COT, and CER) of FY-4Acan be used to acquire the eight cloud types according to a cloud type classification model proposed by Yu et al. Second,according to the cloud type classification results, the retrieval models of the first scheme can be adopted to retrieve CBHfor the eight types of clouds respectively.Conclusions CBH is a critical cloud parameter, but there are no CBH products of geostationary meteorological satellitescurrently. Thus, a CBH retrieval method for FY-4A based on ensemble learning is introduced in this paper. Two schemesof CBH retrieval are designed, and corresponding CBH retrieval models are built based on two ensemble learningalgorithms, namely, RF and GBT. Data of CTH, COT, and CER from FY-4A are used in this paper. The first schemeemploys eight independent models to retrieve CBH for eight types of clouds (i. e. Ci, As, Ac, St/Sc, Cu, Ns, Dc, andMulti) respectively. Specifically, for Ci, As, and Ac, the GBT model is used to retrieve CBH. For the other five types ofcloud, the RF model is used to retrieve CBH. The second scheme uses a GBT model to retrieve CBH without regardingcloud types. CBH from CloudSat is used to evaluate the results of the two schemes, and the retrieval model of the firstscheme outperforms that of the second scheme. For the eight types of clouds, the retrieval model of the first scheme alsoobtains better results.
作者
余茁夫
王雅
马烁
艾未华
严卫
Yu Zhuofu;Wang Ya;Ma Shuo;Ai Weihua;Yan Wei(College of Meteorology and Oceanography,National University of Defense Technology,Changsha 410000,Hunan,China;National Satellite Meteorological Center,China Meteorological Administration,Beijing 100081,China)
出处
《光学学报》
EI
CAS
CSCD
北大核心
2023年第6期41-54,共14页
Acta Optica Sinica
基金
国家自然科学基金(41705007)。
关键词
大气光学
云底高度反演
FY-4A
云顶高度
云光学厚度
云粒子有效半径
集成学习
atmospheric optics
cloud base height retrieval
FY-4A
cloud top height
cloud optical thickness
cloud effective radius
ensemble learning