MTRF

Code for Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention. Please see License for details.

Project website: https://sites.google.com/view/mtrf

Setup

Clone this repo with pre-populated submodule dependencies

$ git clone --recursive [email protected]:vikashplus/r3l.git

Update submodules

$ cd MTRF
$ git submodule update --remote

conda env create -f environment.yml
- This might complain for you to add nvidia-*** to your Python path in .bashrc, just follow the instructions given to resolve this.
pip install -r requirements.txt
pip install -U git+https://github.com/hartikainen/serializable.git@76516385a3a716ed4a2a9ad877e2d5cbcf18d4e6
- This repository depends on definitions in this specific serializable package.
Add MTRF repository to your python_path
- option1: conda develop MTRF
- option2: manually add <MTRF_folder_path> to python_path
Enter the algorithms directory and run pip install -e . to install softlearning.
Run an example command (see below).

Example Commands

Basket

softlearning run_example_local examples.development --exp-name=replicate_basket_results --algorithm=PhasedSAC --num-samples=1  --trial-gpus=1 --trial-cpus=2 --universe=gym --domain=SawyerDhandInHandDodecahedron --task=BasketPhased-v0 --task-evaluation=BasketPhasedEval-v0 --video-save-frequency=0 --save-training-video-frequency=5 --vision=False --preprocessor-type="None" --checkpoint-frequency=50 --checkpoint-replay-pool=False

Bulb

softlearning run_example_local examples.development --exp-name=replicate_bulb_results --algorithm=PhasedSAC --num-samples=1  --trial-gpus=1 --trial-cpus=2 --universe=gym --domain=SawyerDhandInHandDodecahedron --task=BulbPhased-v0 --task-evaluation=BulbPhasedEval-v0 --video-save-frequency=0 --save-training-video-frequency=5 --vision=False --preprocessor-type="None" --checkpoint-frequency=50 --checkpoint-replay-pool=False

Tips

Add export CUDA_VISIBLE_DEVICES="0,1" in front of the command to specify GPUs.
Change --num-samples=X for X seeds of the same experiment.
Change --trial-gpus=X to specify X GPUs PER trial.
Find results in ~/ray_results/<universe>/<domain>/<task>/<experiment_name>

Citation

@article{guptaYuZhaoKumar2021reset,
  title={Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention},
  author={Gupta, Abhishek* and Yu, Justin* and Zhao, Tony Z* and Kumar, Vikash* and Rovinsky, Aaron and Xu, Kelvin and Devlin, Thomas and Levine, Sergey},
  journal={International Conference on Robotics and Automation(ICRA)},
  year={2021}
}

facebookresearch / mtrf Goto Github PK

mtrf's Introduction

MTRF

Setup

Example Commands

Basket

Bulb

Tips

Citation

mtrf's People

Contributors

Stargazers

Watchers

Forkers

mtrf's Issues

cannot git clone --recursive [email protected]:vikashplus/r3l.git

How to install "r3l" module?

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent