Light

huili_phd photo

1229685850 Goto Github PK

followers: 0.0 following: 2.0 repos: 74.0 gists: 0.0

Name: huili_phd

Type: User

StarCraft

Pytorch implementations of the multi-agent reinforcement learning algorithms, including IQL, QMIX, VDN, COMA, QTRAN(both QTRAN-base and QTRAN-alt), MAVEN, CommNet, DyMA-CL, and G2ANet, which are the state of the art MARL algorithms. In addition, because CommNet and G2ANet need an external training algorithm, we provide Central-V and REINFORCE for them to training, you can also combine them with COMA. We trained these algorithms on SMAC, the decentralised micromanagement scenario of StarCraft II.

Corresponding Papers

Requirements

Use pip install -r requirements.txt to install the following requirements:

matplotlib
torch
SMAC
pysc2

Acknowledgement

SMAC
pymarl

TODO List

Quick Start

$ python main.py --map=3m --alg=qmix

Directly run the main.py, then the algorithm will start training on map 3m. Note CommNet and G2ANet need an external training algorithm, so the name of them are like reinforce+commnet or central_v+g2anet, all the algorithms we provide are written in ./common/arguments.py.

If you just want to use this project for demonstration, you should set --evaluate=True --load_model=True.

The running of DyMA-CL is independent from others because it requires different environment settings, so we put it on another project. For more details, please read DyMA-CL documentation.

Result

We independently train these algorithms for 8 times and take the mean of the 8 independent results, and we evaluate them for 20 episodes every 100 training steps. All of the results are saved in ./result. Results on other maps are still in training, we will update them later.

1. Mean Win Rate of 8 Independent Runs on `3m --difficulty=7(VeryHard)`

2. Mean Win Rate of 8 Independent Runs on `8m --difficulty=7(VeryHard)`

3. Mean Win Rate of 8 Independent Runs on `2s3z --difficulty=7(VeryHard)`

Replay

If you want to see the replay, make sure the replay_dir is an absolute path, which can be set in ./common/arguments.py. Then the replays of each evaluation will be saved, you can find them in your path.

huili_phd's Projects

aerial-swarm-robotics

UE4 Cesium UAV AI, simulation of an intelligent drone pathfinding and large-scale surveillance in a 3D realistic earth-like environment

airsim

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

caim-p6_mf

cbusummer-program-mpe-work

Missile evasion-maddpg-mpe-pytorch

cdmft

Implementation of the cluster dynamical mean-field theory

chaos-in-recurrent-neural-networks-rnns

This is a detailed review of the extended mean-field theory for recurrent networks of rate units and a discussion on Perturbation and chaotic networks.

closeaircombat

An environment based on JSBSIM aimed at one-to-one close air combat.

clustermeanfield.jl

d2d

Resource Allocation and Interference Cancellation in D2D Communication

deep-reinforcement-learning-algorithms-with-pytorch

Clean and robust implementations of Reinforcement Learning algorithms by Pytorch

dgn

DGN Code

discrete_mean_field_game

Experiments on a discrete mean field game model of population dynamics with reinforcement learning

dmfg

dqn-based-uav-3d_path_planer

Realization of UAV's Track Planning in 3D Urban Environment Based on Reinforcement Learning Algorithm(DQN)

drone_swarm_gnn

Use GNNs to control drone swarms in a PyBullet simulation

drqn-tensorflow

Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro

exp-mean-field-annealing-iiith

This experiment belongs to Artificial Neural Networks lab Full Name: Weighted matching problem: Deterministic, stochastic and mean-field annealing of an Hopfield model

flightmare

An Open Flexible Quadrotor Simulator

gmfg-learning

This repository is the official implementation of Learning Graphon Mean Field Games and Approximate Nash Equilibria.

graph_refinement

This is official Pytorch implementation of "Graph Refinement based Airway Extraction using Mean-Field Networks and Graph Neural Networks", Raghavendra Selvan et al. 2020

gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

hamiltonian-mean-field-model

highdimnonlocalmfg

infer-policy-feature

DPIQN对手建模

maca

machin

Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...

maddpg-pytorch

PyTorch Implementation of MADDPG (Lowe et. al. 2017)

maddpg_mpe

无需配置MPE环境可直接运行，方便MADDPG学习

magent

A Platform for Many-agent Reinforcement Learning

magent2

An engine for high performance multi-agent environments with very large numbers of agents, along with a set of reference environments

1
2
3

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.