Giter VIP home page Giter VIP logo

modem's Introduction

MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations

Original PyTorch implementation of MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations by

Nicklas Hansen, Yixin Lin, Hao Su, Xiaolong Wang, Vikash Kumar, Aravind Rajeswaran (Meta AI, UC San Diego)

[Paper]โ€ƒ[Website]

Method

Our model-based method, MoDem, solves challenging visuo-motor control tasks with sparse rewards and high-dimensional action spaces in 100K interaction steps given only 5 demonstrations.

Citation

If you use this repo in your research, please consider citing the paper as follows:

@article{hansen2022modem,
  title={MoDem: Accelerating Visual Model-Based Reinforcement Learning with Demonstrations},
  author={Nicklas Hansen and Yixin Lin and Hao Su and Xiaolong Wang and Vikash Kumar and Aravind Rajeswaran},
  journal={arXiv preprint},
  year={2022}
}

Instructions

We assume that your machine has a CUDA-enabled GPU, a local copy of MuJoCo 2.1.x installed (required for the Adroit/Meta-World domains), and at least 80GB of memory. Then, create a conda environment with conda env create -f environment.yml, and add /<path>/<to>/<your>/modem/tasks/mj_envs to your PYTHONPATH (required for the Adroit domain). No additional setup required for the DMControl domain. You will also need to configure wandb and your demonstration/logging directories in cfgs/config.yaml. Demonstrations are made available here. Once setup is complete, you should be able to run the following commands.

To train MoDem on a task from Adroit:

python train.py suite=adroit task=adroit-door

To train MoDem on a task from Meta-World:

python train.py suite=mw task=mw-assembly

To train MoDem on a task from DMControl:

python train.py suite=dmcontrol task=quadruped-run

License & Acknowledgements

This codebase is based on the original TD-MPC implementation. MoDem, TD-MPC and Meta-World are licensed under the MIT license. MuJoCo, DeepMind Control Suite, and mj_envs (Adroit) are licensed under the Apache 2.0 license. We thank the DrQv2 authors for their implementation of DMControl wrappers.

modem's People

Contributors

aravindr93 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

modem's Issues

Integraing Algorithm with Different Benchmarks

Hello,

I am interested in integrating this RL algorithm with different benchmark environments. Could you please guide me on whether this is feasible? If yes, what specific changes and additions would I need to implement to achieve this integration successfully?

Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.