أوراق بحثية, رسائل ماجستير ودكتوراه منشورة من قبل Yunfeng Xin

Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning

46 - Soham Gadgil , Yunfeng Xin , Chengzhe Xu 2020

Reinforcement Learning (RL) is an area of machine learning concerned with enabling an agent to navigate an environment with uncertainty in order to maximize some notion of cumulative long-term reward. In this paper, we implement and analyze two diffe rent RL techniques, Sarsa and Deep QLearning, on OpenAI Gyms LunarLander-v2 environment. We then introduce additional uncertainty to the original problem to test the robustness of the mentioned techniques. With our best models, we are able to achieve average rewards of 170+ with the Sarsa agent and 200+ with the Deep Q-Learning agent on the original problem. We also show that these techniques are able to overcome the additional uncertainities and achieve positive average rewards of 100+ with both agents. We then perform a comparative analysis of the two techniques to conclude which agent peforms better.

التعلم الآلي

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد