Verbindung von approximativer dynamischer Programmierung, prädiktiver Regelung und Dissipativität: Neue adaptive Ansätze zur Regelung nichtlinearer Systeme

2026-03-122026-03-12https://hdl.handle.net/11420/62050Approximative Dynamische Programmierung (ADP), modellprädiktive Regelung (MPC) und Dissipativität sind etablierte Konzepte, die über Optimalitätsbetrachtungen eng miteinander verbunden sind. Tatsächlich gibt es ein ausgereiftes Verständnis der MPC, z.B. deren nominale Stabilität und Performanz. Neben den nominalen Ansätzen gibt es robuste Ansätze, welche jedoch oft inhärent konservativ sind, und stochastische Ansätze, welche einen erheblichen mathematischen Überbau erfordern. Daher sind Ansätze für nichtlineare Systeme mit Modellunsicherheiten, die eine Online-Anpassung von Modellen und/oder Rückkopplungsgesetzen ermöglichen, von wachsendem Interesse. Im MPC-Kontext ist die eng verbundene ADP vielversprechend. Etablierte Analysemethoden für MPC verwenden jedoch häufig obere Schranken für die Cost-to-Go (z.B. bei Tracking MPC mit Endbeschränkungen) oder sie stützen sich auf untere Schranken für die Cost-to-Go (z.B. bei dissipativitätsbasierten Ansätzen). Wir möchten den verfügbaren Methodensatz erweitern, indem Endkosten betrachtet werden, die lediglich eine Näherung der Cost-to-Go darstellen. Konkret werden Ansätze zur adaptiven Steuerung nichtlinearer Systeme entworfen, die ADP, MPC und Dissipativität miteinander verbinden. Wir konzentrieren uns dabei auf zwei Forschungsfragen: i) Wie kann ADP für den Entwurf von MPC-Reglern für nichtlineare Systeme mit Modellunsicherheit genutzt werden? ii) Wie können Dissipativitätskonzepte in ADP ausgenutzt werden? Die Untersuchungen zu i) zielen auf das Verständnis der Modellunsicherheit in ADP und die Analyse der Approximationsfehler ab. Dafür wird das Zusammenspiel von Regelkreisstabilität und Approximationsfehlern und die Auswirkung von Modellfehlern untersucht. Diese Überlegungen bilden die Grundlage für die Analyse der Robustheit der MPC, bei der die Endkosten keine Obergrenze für die Cost-to-Go darstellen, sondern eine bloße Annäherung davon. Zudem wird die Performanz von MPC mit approximierter Cost-to-Go im geschlossenen Regelkreis untersucht. Im Mittelpunkt unserer Untersuchungen zu ii) steht die Beziehung zw. optimaler Steuerung und der Dissipativität offener Systeme. Insbesondere nutzen wir die Verbindungen zw. Speicherfunktionen, die für Dissipativität benötigt werden, und Wertfunktionen für optimale Steuerungsprobleme mitunendlichem Horizont, um Verfahren für dissipativitätsinformierte Online-ADP zu entwickeln. Zu den hilfreichen Beziehungen zwischen Speicher- und Wertefunktionen gehören die Eigenschaften der Gradientenapproximation und die Tatsache, dass die Speicherfunktion eine untere Schranke für die Wertefunktion darstellt. Darüber hinaus kombinieren wir die Erkenntnisse aus i) mit den obigen Ausführungen zur dissipativitätsinformierten Online-Anpassung in einem MPC-Kontext. Unsere Erkenntnisse werden anhand von Fallstudien zur Regelung von Biogasreaktoren validiert, welche aufgrund des wachsenden Bedarfs an erneuerbaren Energien zunehmend in den Fokus der Forschung geraten.Approximate Dynamic Programming (ADP), Model Predictive Control (MPC), and dissipativity are established concepts, which are closely related through optimality considerations. Indeed, there is mature understanding of many aspects of MPC such as, e.g., nominal stability and performance. Beyond nominal considerations, robust designs are often intrinsically conservative and stochastic methods tend to induce a substantial mathematical overhead. Hence, in the face of uncertainty, control approaches for nonlinear systems which allow for online adaptation of models, feedback laws, or both are of growing interest. In the context of MPC, ADP concepts are promising guiding principles for online adaptation due to their close mutual relation. Yet, established analysis techniques for MPC often use upper-bounds on the cost-to-go (e.g. in tracking MPC with terminal ingredients) or they rely on lower-bounds on the cost-to-go (e.g. in dissipativity-based designs). Conceptually, we aim to broaden the scope, i.e., we consider terminal penalties that are mere approximations of the cost-to-go. Specifically, this project explores new approaches to adaptive control of nonlinear systems linking ADP, MPC, and dissipativity. We focus on two fundamental research questions: i) How to leverage ADP for the design of MPC controllers for nonlinear systems subject to model uncertainty? ii) How to exploit dissipativity concepts in ADP? The investigations with respect to i) target the understanding of model uncertainty in ADP and the analysis of approximations errors of ADP. For this, we investigate the interplay of closed-loop stability and approximation errors and the effect of model errors on ADP control policies. In turn, these considerations provide the basis to analyze the robustness of MPC wherein the terminal penalty is not an upper-bound on the cost-to-go but a mere approximation of it. Moreover, we are interested in analyzing the closed-loop performance of MPC with approximated cost-to-go. At the core of our investigations of ii) is the deep relation between optimal control and the dissipativity notion of open systems. Specifically, we leverage the links between storage functions needed for dissipativity and infinite-horizon value functions of optimal control problems to develop novel schemes for dissipativity-informed online ADP. The helpful relations between storage and value functions include gradient approximation properties and the fact that the storage provides a lower bound on the value function. Moreover, we combine the insights of our investigations on i) with the above towards dissipativity-informed online adaptation in MPC. Our findings are validated on case studies from biogas reactors, which is subject to growing research attention due to the increasing need for renewable energy sources.Verbindung von approximativer dynamischer Programmierung, prädiktiver Regelung und Dissipativität: Neue adaptive Ansätze zur Regelung nichtlinearer SystemeLinking Approximate Dynamic Programming, Predictive Control, and Dissipativity: Novel Adaptive Control Approaches for Nonlinear Systems