Giter VIP home page Giter VIP logo

gym-ddqn-atari's Introduction

CS294-112 HW 3: Q-Learning

Dependencies:

  • Python 3.5
  • Numpy version 1.14.5
  • TensorFlow version 1.10.5
  • MuJoCo version 1.50 and mujoco-py 1.50.1.56
  • OpenAI Gym version 0.10.5
  • seaborn
  • Box2D==2.3.2
  • OpenCV
  • ffmpeg

Before doing anything, first replace gym/envs/box2d/lunar_lander.py with the provided lunar_lander.py file.

The only files that you need to look at are dqn.py and train_ac_f18.py, which you will implement.

See the HW3 PDF for further instructions.

The starter code was based on an implementation of Q-learning for Atari generously provided by Szymon Sidor from OpenAI.

Our Answers

Uğur İpekdüzen was my teammate in this homework.

Question 1

The plot showing the performance of our implementation on the game Pong.

Question 2

We chose learning rate and ran three other settings of this hyperparameter, and plotted all four values on the same graph including the original one (lr=1.0e-4). Our choices are 0.9e-4, 1.1e-4 and 1.5e-4.

As a result lr=1.5e-4 is the best performer among these options in 2.5 million steps.

Note: x axis' values should be multiplied by 1e4

gym-ddqn-atari's People

Contributors

karascr avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.