REVISTA
Algorithms

TODAS

Redirigiendo al acceso original de articulo en 16 segundos...

Inicio / Algorithms / Vol: 12 Par: 6 (2019) / Artículo

ARTÍCULO

TITULO

Learning Output Reference Model Tracking for Higher-Order Nonlinear Systems with Unknown Dynamics

Mircea-Bogdan Radac and Timotei Lala

Resumen

This work suggests a solution for the output reference model (ORM) tracking control problem, based on approximate dynamic programming. General nonlinear systems are included in a control system (CS) and subjected to state feedback. By linear ORM selection, indirect CS feedback linearization is obtained, leading to favorable linear behavior of the CS. The Value Iteration (VI) algorithm ensures model-free nonlinear state feedback controller learning, without relying on the process dynamics. From linear to nonlinear parameterizations, a reliable approximate VI implementation in continuous state-action spaces depends on several key parameters such as problem dimension, exploration of the state-action space, the state-transitions dataset size, and a suitable selection of the function approximators. Herein, we find that, given a transition sample dataset and a general linear parameterization of the Q-function, the ORM tracking performance obtained with an approximate VI scheme can reach the performance level of a more general implementation using neural networks (NNs). Although the NN-based implementation takes more time to learn due to its higher complexity (more parameters), it is less sensitive to exploration settings, number of transition samples, and to the selected hyper-parameters, hence it is recommending as the de facto practical implementation. Contributions of this work include the following: VI convergence is guaranteed under general function approximators; a case study for a low-order linear system in order to generalize the more complex ORM tracking validation on a real-world nonlinear multivariable aerodynamic process; comparisons with an offline deep deterministic policy gradient solution; implementation details and further discussions on the obtained results.

Palabras claves

approximate dynamic programming - reinforcement learning - data-driven control - model-free control - reference trajectory tracking - output reference model - multivariable control - aerodynamic rotor system - neural networks - learning systems

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 12 Parte: 6 (2019)

MATERIAS

INGENIERÍA Y CONSTRUCCIÓN CIVIL
TECNOLOGÍA

REVISTAS SIMILARES

Aerospace
Applied Sciences
AI

DOI

https://doi.org/10.3390/a12060121

Artículos similares

Correntropy-Based Constructive One Hidden Layer Neural Network

Acceso

Mojtaba Nayyeri, Modjtaba Rouhani, Hadi Sadoghi Yazdi, Marko M. Mäkelä, Alaleh Maskooki and Yury Nikulin

One of the main disadvantages of the traditional mean square error (MSE)-based constructive networks is their poor performance in the presence of non-Gaussian noises. In this paper, we propose a new incremental constructive network based on the correntro... ver más

Revista: Algorithms

Optimizing Reinforcement Learning Using a Generative Action-Translator Transformer

Acceso

Jiaming Li, Ning Xie and Tingting Zhao

In recent years, with the rapid advancements in Natural Language Processing (NLP) technologies, large models have become widespread. Traditional reinforcement learning algorithms have also started experimenting with language models to optimize training. ... ver más

Revista: Algorithms

Solar Irradiance Forecasting with Natural Language Processing of Cloud Observations and Interpretation of Results with Modified Shapley Additive Explanations

Acceso

Pavel V. Matrenin, Valeriy V. Gamaley, Alexandra I. Khalyasmaa and Alina I. Stepanova

Forecasting the generation of solar power plants (SPPs) requires taking into account meteorological parameters that influence the difference between the solar irradiance at the top of the atmosphere calculated with high accuracy and the solar irradiance ... ver más

Revista: Algorithms

Delving into Causal Discovery in Health-Related Quality of Life Questionnaires

Acceso

Maria Ganopoulou, Efstratios Kontopoulos, Konstantinos Fokianos, Dimitris Koparanis, Lefteris Angelis, Ioannis Kotsianidis and Theodoros Moysiadis

Questionnaires on health-related quality of life (HRQoL) play a crucial role in managing patients by revealing insights into physical, psychological, lifestyle, and social factors affecting well-being. A methodological aspect that has not been adequately... ver más

Revista: Algorithms

ZWNet: A Deep-Learning-Powered Zero-Watermarking Scheme with High Robustness and Discriminability for Images

Acceso

Can Li, Hua Sun, Changhong Wang, Sheng Chen, Xi Liu, Yi Zhang, Na Ren and Deyu Tong

In order to safeguard image copyrights, zero-watermarking technology extracts robust features and generates watermarks without altering the original image. Traditional zero-watermarking methods rely on handcrafted feature descriptors to enhance their per... ver más

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los números publicados en la revista Infrastructures

Infrastructures

Acceso directo a los números publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los números publicados en la revista BiT

Acceso directo a los números publicados en la revista Revista de la Construcción

Revista de la Construcción

Ver todas las revistas