Giter VIP home page Giter VIP logo

1m-agents's Introduction

A preliminary platform for up to 1 million reinforcement learning agents

Our goal is to provide a Multi-Agent Reinforcement Learning plateform for up to 1 million agents.

This plateform is still work in progress, but we have provided two specific settings for demo.

You could see two directories Population Dynamics and Collective Grouping Behaviors, which correspond to the two setting in the paper(when arXiv version is available) respectively.

Dependencies

Population Dynamics setting

Usage

cd Population Dynamics
./train.sh

The log is saved at Population_dynmacis.log.

Collective Grouping Behaviors setting

####Usage cd Collective Grouping Behaviors ./train.sh

The log is saved at Collective_Grouping_Behaviors.log.

You could also open the bash file to change the parameters, the list above is the specific explanations of the parameters.

  • random_seed default=10 The random seed of the random number generator for generating the obstacles.
  • width default=1000 The width of the map.
  • height default=1000 The height of the map.
  • batch_size default=32 The batch size of the process of training.
  • view_args defalut=2500-5-5-0,2500-5-5-1,2500-5-5-2,2500-5-5-3 Define the view size and face direction of the agents. Four number in each item means the number of agents that has these property, the left view size, the front view size and the face direction, where 0 means north in the map, 1 means east, 2 means south and 3 means west.
  • agent_number defalut=10000 The initial number of agents.
  • pig_max_number default=5000 The initial number of prey-pig.
  • rabbit_max_number default=3000 The initial number of prey-rabbit.
  • agent_increase_rate default=0.001 The birth rate of the agent.
  • pig_increaserate default=0.001 The birth rate of the prey-pig.
  • rabbit_increase_rate default=0.001 The birth rate of the prey-rabbit.
  • reward_radius_pig default=7 The reward radius threshold of the prey-pig.
  • reward_radius_rabbit default=2 The reward radius threshold of the prey-rabbit.
  • reward_threshold_pig default=3 The reward threshold of the prey-pig.
  • agent_emb_dim default=5 The dimension of the agent embedding.
  • damage_per_step default=0.01 The decrease health of agent per step.
  • model_name default=DNN The category of the model.
  • model_hidden_size default=32, 32 The units number of each layer.
  • activations default=sigmoid, sigmoid The activation functions of each layer.
  • view_flat_size defalut=32 The input dimension of the neural network, please see the paper for details.
  • num_actions default=9 The number of actions.
  • reward_decay defalut=0.9 The reward decay in the reinforcement learning.
  • savd_dir default=models The model save path.
  • load_dir default=None The model load path.
  • round default=100 The training rounds.
  • time_step defalut=500 The training steps in each round.
  • learning_rate default=0.001 The learning rate of the reinforcement learning.
  • log_file default=log.txt The log file path.

For more details of the parameters, please refer to the papers.

1m-agents's People

Contributors

seawan avatar wnzhang avatar bywbilly avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.