Die Ergebnisse können Gästen nur in Auswahl angezeigt werden. Bitte melden Sie sich für Vollzugriff an: Anmelden

Treffer: Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators.

Title:

Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators.

Authors:

Yang, Qinmin¹, Jagannathan, Sarangapani²

Source:

IEEE Transactions on Systems, Man & Cybernetics: Part B. Apr2012, Vol. 42 Issue 2, p377-390. 14p.

Subject Terms:

*APPROXIMATION theory, *DYNAMIC programming, *SIMULATION methods & models, REINFORCEMENT learning, NONLINEAR systems, DISCRETE-time systems, ONLINE algorithms, HEURISTIC algorithms, FEEDBACK control systems, FUZZY logic

Database:

Business Source Premier

Weitere Informationen

In this paper, reinforcement learning state- and output-feedback-based adaptive critic controller designs are proposed by using the online approximators (OLAs) for a general multi-input and multioutput affine unknown nonlinear discretetime systems in the presence of bounded disturbances. The proposed controller design has two entities, an action network that is designed to produce optimal signal and a critic network that evaluates the performance of the action network. The critic estimates the cost-to-go function which is tuned online using recursive equations derived from heuristic dynamic programming. Here, neural networks (NNs) are used both for the action and critic whereas any OLAs, such as radial basis functions, splines, fuzzy logic, etc., can be utilized. For the output-feedback counterpart, an additional NN is designated as the observer to estimate the unavailable system states, and thus, separation principle is not required. The NN weight tuning laws for the controller schemes are also derived while ensuring uniform ultimate boundedness of the closed-loop system using Lyapunov theory. Finally, the effectiveness of the two controllers is tested in simulation on a pendulum balancing system and a two-link robotic arm system. [ABSTRACT FROM AUTHOR]

Copyright of IEEE Transactions on Systems, Man & Cybernetics: Part B is the property of IEEE and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Treffer: Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators.

Weitere Informationen

Links

Zusatz-Funktionen