Giter VIP home page Giter VIP logo

rlcard's Introduction

RLCard: A Toolkit for Reinforcement Learning in Card Games

Logo

Build Status Codacy Badge Coverage Status

RLCard is a toolkit for Reinforcement Learning (RL) in card games. It supports multiple card environments with easy-to-use interfaces. The goal of RLCard is to bridge reinforcement learning and imperfect information games, and push forward the research of reinforcement learning in domains with multiple agents, large state and action space, and sparse reward. RLCard is developed by DATA Lab at Texas A&M University.

Installation

Make sure that you have Python 3.5+ and pip installed. We recommend installing rlcard with pip as follow:

git clone https://github.com/datamllab/rlcard.git
cd rlcard
pip install -e .

Or you can directly install the package with

pip install rlcard

Examples

Please refer to examples/. A short example is as below.

import rlcard
from rlcard.agents.random_agent import RandomAgent

env = rlcard.make('blackjack')
env.set_agents([RandomAgent()])

trajectories, payoffs = env.run()

We also recommend the following toy examples.

Demo

Run examples/leduc_holdem_human.py to play with the pre-trained Leduc Hold'em model:

>> Leduc Hold'em pre-trained model

>> Start a new game!
>> Agent 1 chooses raise

=============== Community Card ===============
┌─────────┐
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
│░░░░░░░░░│
└─────────┘
===============   Your Hand    ===============
┌─────────┐
│J        │
│         │
│         │
│    ♥    │
│         │
│         │
│        J│
└─────────┘
===============     Chips      ===============
Yours:   +
Agent 1: +++
=========== Actions You Can Choose ===========
0: call, 1: raise, 2: fold

>> You choose action (integer):

Documents

Please refer to the Documents for general introductions. API documents are available at our website.

Available Environments

We provide a complexity estimation for the games on several aspects. InfoSet Number: the number of information sets; Avg. InfoSet Size: the average number of states in a single information set; Action Size: the size of the action space. Name: the name that should be passed to env.make to create the game environment.

Game InfoSet Number Avg. InfoSet Size Action Size Name Status
Blackjack (wiki, baike) 10^3 10^1 10^0 blackjack Available
Leduc Hold’em 10^2 10^2 10^0 leduc-holdem Available
Limit Texas Hold'em (wiki, baike) 10^14 10^3 10^0 limit-holdem Available
Dou Dizhu (wiki, baike) 10^53 ~ 10^83 10^23 10^4 doudizhu Available
Mahjong (wiki, baike) 10^121 10^48 10^2 mahjong Available
No-limit Texas Hold'em (wiki, baike) 10^162 10^3 10^4 no-limit-holdem Available
UNO (wiki, baike) 10^163 10^10 10^1 uno Available
Sheng Ji (wiki, baike) 10^157 ~ 10^165 10^61 10^11 - Developing

Evaluation

The perfomance is measured by winning rates through tournaments. Example outputs are as follows: Learning Curves

Cite this work

@misc{zha2019rlcard,
    title={RLCard: A Toolkit for Reinforcement Learning in Card Games},
    author={Daochen Zha and Kwei-Herng Lai and Yuanpu Cao and Songyi Huang and Ruzhe Wei and Junyu Guo and Xia Hu},
    year={2019},
    eprint={1910.04376},
    archivePrefix={arXiv},
    primaryClass={cs.AI}
}

Contributing

Contribution to this project is greatly appreciated! Please create an issue for feedbacks/bugs. If you want to contribute codes, please refer to Contributing Guide.

Acknowledgements

We would like to thank JJ World Network Technology Co.,LTD for the generous support.

rlcard's People

Contributors

andrewnc avatar caoyuanpu avatar daochenzha avatar hsywhu avatar junyuguo avatar lhenry15 avatar ruzhwei avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.