How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?

176 0 0.0 ( 0 )

تحميل البحث استخدام كمرجع

نشر من قبل Jingxi Xu

تاريخ النشر 2021

مجال البحث الهندسة المعلوماتية

والبحث باللغة English

تأليف Jingxi Xu - Bruce Lee - Nikolai Matni

قم بزيارة صفحتنا على فيسبوك

‎Shamra Academia - شمرا أكاديميا‎

اسأل ChatGPT حول البحث

الملخص بالعربية الملخص بالإنكليزية

The difficulty of optimal control problems has classically been characterized in terms of system properties such as minimum eigenvalues of controllability/observability gramians. We revisit these characterizations in the context of the increasing popularity of data-driven techniques like reinforcement learning (RL), and in control settings where input observations are high-dimensional images and transition dynamics are unknown. Specifically, we ask: to what extent are quantifiable control and perceptual difficulty metrics of a task predictive of the performance and sample complexity of data-driven controllers? We modulate two different types of partial observability in a cartpole stick-balancing problem -- (i) the height of one visible fixation point on the cartpole, which can be used to tune fundamental limits of performance achievable by any controller, and by (ii) the level of perception noise in the fixation point position inferred from depth or RGB images of the cartpole. In these settings, we empirically study two popular families of controllers: RL and system identification-based $H_infty$ control, using visually estimated system state. Our results show that the fundamental limits of robust control have corresponding implications for the sample-efficiency and performance of learned perception-based controllers. Visit our project website https://jxu.ai/rl-vs-control-web for more information.

قيم البحث

329 - Akshay Mehra , Bhavya Kailkhura , Pin-Yu Chen 2020

Predictions of certifiably robust classifiers remain constant in a neighborhood of a point, making them resilient to test-time attacks with a guarantee. In this work, we present a previously unrecognized threat to robust machine learning models that highlights the importance of training-data quality in achieving high certified adversarial robustness. Specifically, we propose a novel bilevel optimization-based data poisoning attack that degrades the robustness guarantees of certifiably robust classifiers. Unlike other poisoning attacks that reduce the accuracy of the poisoned models on a small set of target points, our attack reduces the average certified radius (ACR) of an entire target class in the dataset. Moreover, our attack is effective even when the victim trains the models from scratch using state-of-the-art robust training methods such as Gaussian data augmentationcite{cohen2019certified}, MACERcite{zhai2020macer}, and SmoothAdvcite{salman2019provably} that achieve high certified adversarial robustness. To make the attack harder to detect, we use clean-label poisoning points with imperceptible distortions. The effectiveness of the proposed method is evaluated by poisoning MNIST and CIFAR10 datasets and training deep neural networks using previously mentioned training methods and certifying the robustness with randomized smoothing. The ACR of the target class, for models trained on generated poison data, can be reduced by more than 30%. Moreover, the poisoned data is transferable to models trained with different training methods and models with different architectures.

التعلم الآلي

Certainty Equivalent Perception-Based Control

284 - Sarah Dean , Benjamin Recht 2020

In order to certify performance and safety, feedback control requires precise characterization of sensor errors. In this paper, we provide guarantees on such feedback systems when sensors are characterized by solving a supervised learning problem. We show a uniform error bound on nonparametric kernel regression under a dynamically-achievable dense sampling scheme. This allows for a finite-time convergence rate on the sub-optimality of using the regressor in closed-loop for waypoint tracking. We demonstrate our results in simulation with simplified unmanned aerial vehicle and autonomous driving examples.

التعلم الآلي التحسين والتحكم التعلم الالي

Guaranteeing Safety of Learned Perception Modules via Measurement-Robust Control Barrier Functions

132 - Sarah Dean , Andrew J. Taylor , Ryan K. Cosner 2020

Modern nonlinear control theory seeks to develop feedback controllers that endow systems with properties such as safety and stability. The guarantees ensured by these controllers often rely on accurate estimates of the system state for determining co ntrol actions. In practice, measurement model uncertainty can lead to error in state estimates that degrades these guarantees. In this paper, we seek to unify techniques from control theory and machine learning to synthesize controllers that achieve safety in the presence of measurement model uncertainty. We define the notion of a Measurement-Robust Control Barrier Function (MR-CBF) as a tool for determining safe control inputs when facing measurement model uncertainty. Furthermore, MR-CBFs are used to inform sampling methodologies for learning-based perception systems and quantify tolerable error in the resulting learned models. We demonstrate the efficacy of MR-CBFs in achieving safety with measurement model uncertainty on a simulated Segway system.

أنظمة وتحكم التعلم الآلي أنظمة وتحكم

How Sensitive are Meta-Learners to Dataset Imbalance?

272 - Mateusz Ochal , Massimiliano Patacchiola , Amos Storkey 2021

Meta-Learning (ML) has proven to be a useful tool for training Few-Shot Learning (FSL) algorithms by exposure to batches of tasks sampled from a meta-dataset. However, the standard training procedure overlooks the dynamic nature of the real-world whe re object classes are likely to occur at different frequencies. While it is generally understood that imbalanced tasks harm the performance of supervised methods, there is no significant research examining the impact of imbalanced meta-datasets on the FSL evaluation task. This study exposes the magnitude and extent of this problem. Our results show that ML methods are more robust against meta-dataset imbalance than imbalance at the task-level with a similar imbalance ratio ($rho<20$), with the effect holding even in long-tail datasets under a larger imbalance ($rho=65$). Overall, these results highlight an implicit strength of ML algorithms, capable of learning generalizable features under dataset imbalance and domain-shift. The code to reproduce the experiments is released under an open-source license.

التعلم الآلي الرؤية الحاسوبية وتمييز الأنماط

Limits of optimal control yields achievable with quantum controllers

390 - Re-Bing Wu , Constantin Brif , Matthew R. James 2014

In quantum optimal control theory, kinematic bounds are the minimum and maximum values of the control objective achievable for any physically realizable system dynamics. For a given initial state of the system, these bounds depend on the nature and s tate of the controller. We consider a general situation where the controlled quantum system is coupled to both an external classical field (referred to as a classical controller) and an auxiliary quantum system (referred to as a quantum controller). In this general situation, the kinematic bound is between the classical kinematic bound (CKB), corresponding to the case when only the classical controller is available, and the quantum kinematic bound (QKB), corresponding to the ultimate physical limit of the objectives value. Specifically, when the control objective is the expectation value of a quantum observable (a Hermitian operator on the systems Hilbert space), the QKBs are the minimum and maximum eigenvalues of this operator. We present, both qualitatively and quantitatively, the necessary and sufficient conditions for surpassing the CKB and reaching the QKB, through the use of a quantum controller. The general conditions are illustrated by examples in which the system and controller are initially in thermal states. The obtained results provide a basis for the design of quantum controllers capable of maximizing the control yield and reaching the ultimate physical limit.

فيزياء الكم