ترغب بنشر مسار تعليمي؟ اضغط هنا

Feedback from Pixels: Output Regulation via Learning-Based Scene View Synthesis

248   0   0.0 ( 0 )
 نشر من قبل Murad Abu-Khalaf
 تاريخ النشر 2021
والبحث باللغة English




اسأل ChatGPT حول البحث

We propose a novel controller synthesis involving feedback from pixels, whereby the measurement is a high dimensional signal representing a pixelated image with Red-Green-Blue (RGB) values. The approach neither requires feature extraction, nor object detection, nor visual correspondence. The control policy does not involve the estimation of states or similar latent representations. Instead, tracking is achieved directly in image space, with a model of the reference signal embedded as required by the internal model principle. The reference signal is generated by a neural network with learning-based scene view synthesis capabilities. Our approach does not require an end-to-end learning of a pixel-to-action control policy. The approach is applied to a motion control problem, namely the longitudinal dynamics of a car-following problem. We show how this approach lend itself to a tractable stability analysis with associated bounds critical to establishing trustworthiness and interpretability of the closed-loop dynamics.

قيم البحث

اقرأ أيضاً

86 - Mahmoud Khaled , Kuize Zhang , 2020
Symbolic control is a an abstraction-based controller synthesis approach that provides, algorithmically, certifiable-by-construction controllers for cyber-physical systems. Current methodologies of symbolic control usually assume that full-state info rmation is available. This is not suitable for many real-world applications with partially-observable states or output information. This article introduces a framework for output-feedback symbolic control. We propose relations between original systems and their symbolic models based on outputs. They enable designing symbolic controllers and refining them to enforce complex requirements on original systems. To demonstrate the effectiveness of the proposed framework, we provide three different methodologies. They are applicable to a wide range of linear and nonlinear systems, and support general logic specifications.
323 - Daniele Astolfi 2021
In linear systems theory its a well known fact that a regulator given by the cascade of an oscillatory dynamics, driven by some regulated variables, and of a stabiliser stabilising the cascade of the plant and of the oscillators has the ability of bl ocking on the steady state of the regulated variables any harmonics matched with the ones of the oscillators. This is the well-celebrated internal model principle. In this paper we are interested to follow the same design route for a controlled plant that is a nonlinear and periodic system with period T : we add a bunch of linear oscillators, embedding n o harmonics that are multiple of 2$pi$/T , driven by a regulated variable of the nonlinear system, we look for a stabiliser for the nonlinear cascade of the plant and the oscillators, and we study the asymptotic properties of the resulting closedloop regulated variable. In this framework the contributions of the paper are multiple: for specific class of minimum-phase systems we present a systematic way of designing a stabiliser, which is uniform with respect to n o , by using a mix of high-gain and forwarding techniques; we prove that the resulting closed-loop system has a periodic steady state with period T with a domain of attraction not shrinking with n o ; similarly to the linear case, we also show that the spectrum of the steady state closed-loop regulated variable does not contain the n harmonics embedded in the bunch of oscillators and that the L 2 norm of the regulated variable is a monotonically decreasing function of n o. The results are robust, namely the asymptotic properties on the regulated variable hold also in presence of any uncertainties in the controlled plant not destroying closed-loop stability.
This study considers the problem of periodic event-triggered (PET) cooperative output regulation for a class of linear multi-agent systems. The advantage of the PET output regulation is that the data transmission and triggered condition are only need ed to be monitored at discrete sampling instants. It is assumed that only a small number of agents can have access to the system matrix and states of the leader. Meanwhile, the PET mechanism is considered not only in the communication between various agents, but also in the sensor-to-controller and controller-to-actuator transmission channels for each agent. The above problem set-up will bring some challenges to the controller design and stability analysis. Based on a novel PET distributed observer, a PET dynamic output feedback control method is developed for each follower. Compared with the existing works, our method can naturally exclude the Zeno behavior, and the inter-event time becomes multiples of the sampling period. Furthermore, for every follower, the minimum inter-event time can be determined textit{a prior}, and computed directly without the knowledge of the leader information. An example is given to verify and illustrate the effectiveness of the new design scheme.
The suspension regulation is critical to the operation of medium-low-speed maglev trains (mlsMTs). Due to uncertain environment, strong disturbances and high nonlinearity of the system dynamics, this problem cannot be well solved by most of the model -based controllers. In this paper, we propose a model-free controller by reformulating it as a continuous-state, continuous-action Markov decision process (MDP) with unknown transition probabilities. With the deterministic policy gradient and neural network approximation, we design reinforcement learning (RL) algorithms to solve the MDP and obtain a state-feedback controller by using sampled data from the suspension system. To further improve its performance, we adopt a double Q-learning scheme for learning the regulation controller. We illustrate that the proposed controllers outperform the existing PID controller with a real dataset from the mlsMT in Changsha, China and is even comparable to model-based controllers, which assume that the complete information of the model is known, via simulations.
This paper provides a protocol to address the robust output feedback consensus problem for networked heterogeneous nonlinear negative-imaginary (NI) systems with free body dynamics. We extend the definition of nonlinear NI systems to allow for system s with free body motion. A new stability result is developed for the interconnection of a nonlinear NI system and a nonlinear output strictly negative-imaginary (OSNI) system. Also, a class of networked nonlinear OSNI controllers is proposed to achieve output feedback consensus for heterogeneous networked nonlinear NI systems. We show that in this control framework, the system outputs converge to the same limit trajectory. This consensus protocol is illustrated by a numerical example.
التعليقات
جاري جلب التعليقات جاري جلب التعليقات
سجل دخول لتتمكن من متابعة معايير البحث التي قمت باختيارها
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا