أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Yichen Yang

Regional Adversarial Training for Better Robust Generalization

107 - Chuanbiao Song , Yanbo Fan , Yichen Yang 2021

Adversarial training (AT) has been demonstrated as one of the most promising defense methods against various adversarial attacks. To our knowledge, existing AT-based methods usually train with the locally most adversarial perturbed points and treat a ll the perturbed points equally, which may lead to considerably weaker adversarial robust generalization on test data. In this work, we introduce a new adversarial training framework that considers the diversity as well as characteristics of the perturbed points in the vicinity of benign samples. To realize the framework, we propose a Regional Adversarial Training (RAT) defense method that first utilizes the attack path generated by the typical iterative attack method of projected gradient descent (PGD), and constructs an adversarial region based on the attack path. Then, RAT samples diverse perturbed training points efficiently inside this region, and utilizes a distance-aware label smoothing mechanism to capture our intuition that perturbed points at different locations should have different impact on the model performance. Extensive experiments on several benchmark datasets show that RAT consistently makes significant improvement on standard adversarial training (SAT), and exhibits better robust generalization.

الرؤية الحاسوبية وتمييز الأنماط التعلم الآلي

An Approximate Analytical Solution to Knudsen Layers

93 - Ruo Li , Yichen Yang 2021

We apply moment methods to obtaining an approximate analytical solution to Knudsen layers. Based on the hyperbolic regularized moment system for the Boltzmann equation with the Shakhov collision model, we derive a linearized hyperbolic moment system to model the scenario with the Knudsen layer vicinity to a solid wall with Maxwell boundary condition. We find that the reduced system is in an even-odd parity form that the reduced system proves to be well-posed under all accommodation coefficients. We show that the system may capture the temperature jump coefficient and the thermal Knudsen layer well with only a few moments. With the increasing number of moments used, qualitative convergence of the approximate solution is observed.

الفيزياء الرياضية الفيزياء الرياضية

Program Synthesis Guided Reinforcement Learning

113 - Yichen Yang , Jeevana Priya Inala , Osbert Bastani 2021

A key challenge for reinforcement learning is solving long-horizon planning and control problems. Recent work has proposed leveraging programs to help guide the learning algorithm in these settings. However, these approaches impose a high manual burd en on the user since they must provide a guiding program for every new task they seek to achieve. We propose an approach that leverages program synthesis to automatically generate the guiding program. A key challenge is how to handle partially observable environments. We propose model predictive program synthesis, which trains a generative model to predict the unobserved portions of the world, and then synthesizes a program based on samples from this model in a way that is robust to its uncertainty. We evaluate our approach on a set of challenging benchmarks, including a 2D Minecraft-inspired ``craft environment where the agent must perform a complex sequence of subtasks to achieve its goal, a box-world environment that requires abstract reasoning, and a variant of the craft environment where the agent is a MuJoCo Ant. Our approach significantly outperforms several baselines, and performs essentially as well as an oracle that is given an effective program.

الذكاء الاصطناعي

Equality Saturation for Tensor Graph Superoptimization

275 - Yichen Yang , Phitchaya Mangpo Phothilimtha , Yisu Remy Wang 2021

One of the major optimizations employed in deep learning frameworks is graph rewriting. Production frameworks rely on heuristics to decide if rewrite rules should be applied and in which order. Prior research has shown that one can discover more opti mal tensor computation graphs if we search for a better sequence of substitutions instead of relying on heuristics. However, we observe that existing approaches for tensor graph superoptimization both in production and research frameworks apply substitutions in a sequential manner. Such sequential search methods are sensitive to the order in which the substitutions are applied and often only explore a small fragment of the exponential space of equivalent graphs. This paper presents a novel technique for tensor graph superoptimization that employs equality saturation to apply all possible substitutions at once. We show that our approach can find optimized graphs with up to 16% speedup over state-of-the-art, while spending on average 48x less time optimizing.

الذكاء الاصطناعي النظم الموزعة والتوازية والحوسبة العنقودية

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد