Giter VIP home page Giter VIP logo

StarCraft

Pytorch implementations of the multi-agent reinforcement learning algorithms, including IQL, QMIX, VDN, COMA, QTRAN(both QTRAN-base and QTRAN-alt), MAVEN, CommNet, DyMA-CL, and G2ANet, which are the state of the art MARL algorithms. In addition, because CommNet and G2ANet need an external training algorithm, we provide Central-V and REINFORCE for them to training, you can also combine them with COMA. We trained these algorithms on SMAC, the decentralised micromanagement scenario of StarCraft II.

Corresponding Papers

Requirements

Use pip install -r requirements.txt to install the following requirements:

Acknowledgement

TODO List

  • Add CUDA option
  • DyMA-CL
  • G2ANet
  • MAVEN
  • VBC
  • Other SOTA MARL algorithms
  • Update results on other maps

Quick Start

$ python main.py --map=3m --alg=qmix

Directly run the main.py, then the algorithm will start training on map 3m. Note CommNet and G2ANet need an external training algorithm, so the name of them are like reinforce+commnet or central_v+g2anet, all the algorithms we provide are written in ./common/arguments.py.

If you just want to use this project for demonstration, you should set --evaluate=True --load_model=True.

The running of DyMA-CL is independent from others because it requires different environment settings, so we put it on another project. For more details, please read DyMA-CL documentation.

Result

We independently train these algorithms for 8 times and take the mean of the 8 independent results, and we evaluate them for 20 episodes every 100 training steps. All of the results are saved in ./result. Results on other maps are still in training, we will update them later.

1. Mean Win Rate of 8 Independent Runs on 3m --difficulty=7(VeryHard)

2. Mean Win Rate of 8 Independent Runs on 8m --difficulty=7(VeryHard)

3. Mean Win Rate of 8 Independent Runs on 2s3z --difficulty=7(VeryHard)

Replay

If you want to see the replay, make sure the replay_dir is an absolute path, which can be set in ./common/arguments.py. Then the replays of each evaluation will be saved, you can find them in your path.

huili_phd's Projects

aerial-swarm-robotics icon aerial-swarm-robotics

UE4 Cesium UAV AI, simulation of an intelligent drone pathfinding and large-scale surveillance in a 3D realistic earth-like environment

airsim icon airsim

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

cdmft icon cdmft

Implementation of the cluster dynamical mean-field theory

closeaircombat icon closeaircombat

An environment based on JSBSIM aimed at one-to-one close air combat.

drqn-tensorflow icon drqn-tensorflow

Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro

exp-mean-field-annealing-iiith icon exp-mean-field-annealing-iiith

This experiment belongs to Artificial Neural Networks lab                                                 Full Name: Weighted matching problem: Deterministic, stochastic and mean-field annealing of an Hopfield model

gmfg-learning icon gmfg-learning

This repository is the official implementation of Learning Graphon Mean Field Games and Approximate Nash Equilibria.

graph_refinement icon graph_refinement

This is official Pytorch implementation of "Graph Refinement based Airway Extraction using Mean-Field Networks and Graph Neural Networks", Raghavendra Selvan et al. 2020

machin icon machin

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

maddpg_mpe icon maddpg_mpe

无需配置MPE环境可直接运行,方便MADDPG学习

magent icon magent

A Platform for Many-agent Reinforcement Learning

magent2 icon magent2

An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments

malib icon malib

A parallel framework for population-based multi-agent reinforcement learning.

marl-sc icon marl-sc

Config files for my GitHub profile.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.