期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 被引量:4
1
作者 Ding Wang Ning Gao +2 位作者 derong liu Jinna Li Frank L.Lewis 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2024年第1期18-36,共19页
Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and ... Reinforcement learning(RL) has roots in dynamic programming and it is called adaptive/approximate dynamic programming(ADP) within the control community. This paper reviews recent developments in ADP along with RL and its applications to various advanced control fields. First, the background of the development of ADP is described, emphasizing the significance of regulation and tracking control problems. Some effective offline and online algorithms for ADP/adaptive critic control are displayed, where the main results towards discrete-time systems and continuous-time systems are surveyed, respectively.Then, the research progress on adaptive critic control based on the event-triggered framework and under uncertain environment is discussed, respectively, where event-based design, robust stabilization, and game design are reviewed. Moreover, the extensions of ADP for addressing control problems under complex environment attract enormous attention. The ADP architecture is revisited under the perspective of data-driven and RL frameworks,showing how they promote ADP formulation significantly.Finally, several typical control applications with respect to RL and ADP are summarized, particularly in the fields of wastewater treatment processes and power systems, followed by some general prospects for future research. Overall, the comprehensive survey on ADP and RL for advanced control applications has d emonstrated its remarkable potential within the artificial intelligence era. In addition, it also plays a vital role in promoting environmental protection and industrial intelligence. 展开更多
关键词 Adaptive dynamic programming(ADP) advanced control complex environment data-driven control event-triggered design intelligent control neural networks nonlinear systems optimal control reinforcement learning(RL)
下载PDF
Optimal Constrained Self-learning Battery Sequential Management in Microgrid Via Adaptive Dynamic Programming 被引量:16
2
作者 Qinglai Wei derong liu +1 位作者 Yu liu Ruizhuo Song 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2017年第2期168-176,共9页
This paper concerns a novel optimal self-learning battery sequential control scheme for smart home energy systems. The main idea is to use the adaptive dynamic programming U+0028 ADP U+0029 technique to obtain the opt... This paper concerns a novel optimal self-learning battery sequential control scheme for smart home energy systems. The main idea is to use the adaptive dynamic programming U+0028 ADP U+0029 technique to obtain the optimal battery sequential control iteratively. First, the battery energy management system model is established, where the power efficiency of the battery is considered. Next, considering the power constraints of the battery, a new non-quadratic form performance index function is established, which guarantees that the value of the iterative control law cannot exceed the maximum charging/discharging power of the battery to extend the service life of the battery. Then, the convergence properties of the iterative ADP algorithm are analyzed, which guarantees that the iterative value function and the iterative control law both reach the optimums. Finally, simulation and comparison results are given to illustrate the performance of the presented method. © 2017 Chinese Association of Automation. 展开更多
关键词 Adaptive control systems Automation Battery management systems Control theory Electric batteries Energy management Energy management systems Intelligent buildings Iterative methods Number theory Secondary batteries
下载PDF
Residential Energy Scheduling for Variable Weather Solar Energy Based on Adaptive Dynamic Programming 被引量:15
3
作者 derong liu Yancai Xu +1 位作者 Qinglai Wei Xinliang liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2018年第1期36-46,共11页
The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable ener... The residential energy scheduling of solar energy is an important research area of smart grid. On the demand side, factors such as household loads, storage batteries, the outside public utility grid and renewable energy resources, are combined together as a nonlinear, time-varying, indefinite and complex system, which is difficult to manage or optimize. Many nations have already applied the residential real-time pricing to balance the burden on their grid. In order to enhance electricity efficiency of the residential micro grid, this paper presents an action dependent heuristic dynamic programming(ADHDP) method to solve the residential energy scheduling problem. The highlights of this paper are listed below. First,the weather-type classification is adopted to establish three types of programming models based on the features of the solar energy. In addition, the priorities of different energy resources are set to reduce the loss of electrical energy transmissions.Second, three ADHDP-based neural networks, which can update themselves during applications, are designed to manage the flows of electricity. Third, simulation results show that the proposed scheduling method has effectively reduced the total electricity cost and improved load balancing process. The comparison with the particle swarm optimization algorithm further proves that the present method has a promising effect on energy management to save cost. 展开更多
关键词 Action dependent heuristic dynamic programming adaptive dynamic programming control strategy residential energy management smart grid
下载PDF
Data-based Fault Tolerant Control for Affine Nonlinear Systems Through Particle Swarm Optimized Neural Networks 被引量:15
4
作者 Haowei Lin Bo Zhao +1 位作者 derong liu Cesare Alippi 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2020年第4期954-964,共11页
In this paper, a data-based fault tolerant control(FTC) scheme is investigated for unknown continuous-time(CT)affine nonlinear systems with actuator faults. First, a neural network(NN) identifier based on particle swa... In this paper, a data-based fault tolerant control(FTC) scheme is investigated for unknown continuous-time(CT)affine nonlinear systems with actuator faults. First, a neural network(NN) identifier based on particle swarm optimization(PSO) is constructed to model the unknown system dynamics. By utilizing the estimated system states, the particle swarm optimized critic neural network(PSOCNN) is employed to solve the Hamilton-Jacobi-Bellman equation(HJBE) more efficiently.Then, a data-based FTC scheme, which consists of the NN identifier and the fault compensator, is proposed to achieve actuator fault tolerance. The stability of the closed-loop system under actuator faults is guaranteed by the Lyapunov stability theorem. Finally, simulations are provided to demonstrate the effectiveness of the developed method. 展开更多
关键词 Adaptive dynamic programming(ADP) critic neural network data-based fault tolerant control(FTC) particle swarm optimization(PSO)
下载PDF
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control 被引量:9
5
作者 Mingming Ha Ding Wang derong liu 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2022年第7期1262-1272,共11页
The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of t... The core task of tracking control is to make the controlled plant track a desired trajectory.The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases.In this paper,a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem.Unlike the regulator problem,the iterative value function of tracking control problem cannot be regarded as a Lyapunov function.A novel stability analysis method is developed to guarantee that the tracking error converges to zero.The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated.Finally,the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches. 展开更多
关键词 Adaptive critic design adaptive dynamic programming(ADP) approximate dynamic programming discrete-time nonlinear systems reinforcement learning stability analysis tracking control value iteration(VI)
下载PDF
Guest Editorial for Special Issue on Autonomous Control of Unmanned Aerial Vehicles
6
作者 derong liu Changyin Sun Bin Xian 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI 2015年第1期1-1,共1页
IN recent years,unmanned aerial vehicles(UAVs)have been widely employed in different applications,both military and civilian.Especially,a fast growing civil UAV market is predicted over the next decades.However,most c... IN recent years,unmanned aerial vehicles(UAVs)have been widely employed in different applications,both military and civilian.Especially,a fast growing civil UAV market is predicted over the next decades.However,most currently developed UAVs depend on simple control strategy.They require exact modeling of the UAVs dynamics and are vulnerable to external disturbance.Therefore,there is great 展开更多
关键词 for in ET IS UAV Guest Editorial for Special Issue on Autonomous Control of Unmanned Aerial Vehicles on of
下载PDF
ROS-responsive nanoparticle delivery of ferroptosis inhibitor prodrug to facilitate mesenchymal stem cell-mediated spinal cord injury repair
7
作者 Renshuai Hua Chenxi Zhao +7 位作者 Zhengyu Xu derong liu Wenyuan Shen Wenlu Yuan Yan Li Jun Ma Zhishuo Wang Shiqing Feng 《Bioactive Materials》 SCIE CSCD 2024年第8期438-454,共17页
Spinal cord injury(SCI)is a traumatic condition that results in impaired motor and sensory function.Ferroptosis is one of the main causes of neural cell death and loss of neurological function in the spinal cord,and f... Spinal cord injury(SCI)is a traumatic condition that results in impaired motor and sensory function.Ferroptosis is one of the main causes of neural cell death and loss of neurological function in the spinal cord,and ferroptosis inhibitors are effective in reducing inflammation and repairing SCI.Although human umbilical cord mesenchymal stem cells(Huc-MSCs)can ameliorate inflammatory microenvironments and promote neural regeneration in SCI,their efficacy is greatly limited by the local microenvironment after SCI.Therefore,in this study,we constructed a drug-release nanoparticle system with synergistic Huc-MSCs and ferroptosis inhibitor,in which we anchored Huc-MSCs by a Tz-A6 peptide based on the CD44-targeting sequence,and combined with the reactive oxygen species(ROS)-responsive drug nanocarrier mPEG-b-Lys-BECI-TCO at the other end for SCI repair.Meanwhile,we also modified the classic ferroptosis inhibitor Ferrostatin-1(Fer-1)and synthesized a new prodrug Feborastatin-1(Feb-1).The results showed that this treatment regimen significantly inhibited the ferroptosis and inflammatory response after SCI,and promoted the recovery of neurological function in rats with SCI.This study developed a combination therapy for the treatment of SCI and also provides a new strategy for the construction of a drug-coordinated cell therapy system. 展开更多
关键词 Spinal cord injury Huc-MSCs ROS-Responsive nanoparticles Ferroptosis inhibitor
原文传递
News Keyword Extraction Algorithm Based on Semantic Clustering and Word Graph Model 被引量:9
8
作者 Ao Xiong derong liu +3 位作者 Hongkang Tian Zhengyuan liu Peng Yu Michel Kadoch 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2021年第6期886-893,共8页
The internet is an abundant source of news every day. Thus, efficient algorithms to extract keywords from the text are important to obtain information quickly. However, the precision and recall of mature keyword extra... The internet is an abundant source of news every day. Thus, efficient algorithms to extract keywords from the text are important to obtain information quickly. However, the precision and recall of mature keyword extraction algorithms need improvement. TextRank, which is derived from the PageRank algorithm, uses word graphs to spread the weight of words. The keyword weight propagation in Text Rank focuses only on word frequency. To improve the performance of the algorithm, we propose Semantic Clustering TextRank(SCTR), a semantic clustering news keyword extraction algorithm based on TextRank. Firstly, the word vectors generated by the Bidirectional Encoder Representation from Transformers(BERT) model are used to perform k-means clustering to represent semantic clustering. Then, the clustering results are used to construct a TextRank weight transfer probability matrix. Finally,iterative calculation of word graphs and extraction of keywords are performed. The test target of this experiment is a Chinese news library. The results of the experiment conducted on this text set show that the SCTR algorithm has greater precision, recall, and F1 value than the traditional TextRank and Term Frequency-Inverse Document Frequency(TF-IDF) algorithms. 展开更多
关键词 keyword extraction TextR ank SEMANTICS word vector
原文传递
State of the Art of Adaptive Dynamic Programming and Reinforcement Learning
9
作者 derong liu Mingming Ha Shan Xue 《CAAI Artificial Intelligence Research》 2022年第2期93-110,共18页
This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning(ADPRL).First,algorithms in reinforcement learning(RL)are introduced and their roots in dynamic progra... This article introduces the state-of-the-art development of adaptive dynamic programming and reinforcement learning(ADPRL).First,algorithms in reinforcement learning(RL)are introduced and their roots in dynamic programming are illustrated.Adaptive dynamic programming(ADP)is then introduced following a brief discussion of dynamic programming.Researchers in ADP and RL have enjoyed the fast developments of the past decade from algorithms,to convergence and optimality analyses,and to stability results.Several key steps in the recent theoretical developments of ADPRL are mentioned with some future perspectives.In particular,convergence and optimality results of value iteration and policy iteration are reviewed,followed by an introduction to the most recent results on stability analysis of value iteration algorithms. 展开更多
关键词 adaptive dynamic programming approximate dynamic programming adaptive critic designs neuro-dynamic programming neural dynamic programming reinforcement learning intelligent control learning control optimal control
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部