Light

Xihuai Wang photo

xihuai18 Goto Github PK

followers: 62.0 following: 60.0 repos: 35.0 gists: 0.0

Name: Xihuai Wang

Type: User

Company: Shanghai Jiao Tong University

Bio: 😭Youth is paid.

Location: Shanghai, China

Blog: https://xihuai18.github.io/

Hi Here!

I am now focusing on

Reinforcement Learning
Multi-agent System, especially
- Efficiency of Cooperative Multi-agent Reinforcement Learning;
- Zero-shot Generalization Ability in Cooperative Multi-agent Systems.
Language-based(Symbolic) Planning

Xihuai Wang's Projects

2048game

a2po-iclr2023

Codebase for [Order Matters: Agent-by-agent Policy Optimization](https://openreview.net/forum?id=Q-neeWNVv1)

agilerl

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

alphazero-for-othello

(Re)-Implementation of alphazero for othello using pytorch

arxiv-sanity-x

awesome-rl-generalization

A list of papers regarding generalization in (deep) reinforcement learning

c-s-and-p2p-demo

cleanmapg

High-quality single file implementation of Multi-agent Policy Gradient algorithms with research-friendly features (MAPPO, A2PO).

common-cooperative-multi-agent-environments

Commonly-used Cooperative Multi-agent Environments Installation, Convenient Wrappers, and VectorEnv Implementation with PettingZoo (and Gymnasium) Compatibility.

computational-geometry

computer-organization-and-design-review

cpu-single-cycle

data-structure

gfootball-gymnasium-pettingzoo

Google Research Football with gymnasium support.

go-distributed-storage-service

A distributed storage service developed by Golang.

gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

image-processing-in-cuda

Implementation of Image Processing Method

mamujoco-pettingzoo

MaMuJoCo from https://github.com/Farama-Foundation/Gymnasium-Robotics with Convenient Wrappers and Utilities.

marl-comm

Basic MARL algorithms with Communication

multi-cycle-cpu

multi-threaded-queue

openbilibili-go-common

听说这是来自 https://github.com/openbilibili/go-common/ 的 “哔哩哔哩 bilibili 网站后台工程源码”，不过咱也不知道这是啥。

operating-system-project

Project for Operating System Course, Semester 2018 Spring.

overcooked_ai

A benchmark environment for fully cooperative human-AI performance.

pettingzoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

pysc2

StarCraft II Learning Environment

radixsort-cuda

RadixSort using CUDA

recommendation-system-based-on-mpi-openmp

A distributed and parallel recommendation system

reinforcement-learning-notes

Reinforcement Learning Notes for Reinforcement Learning: An Introduction (2nd Edition) and David Silver's Reinforcement Learning Course in UCL

rl-proofs

Some fundamental proofs in Reinforcement Learning.

1
2

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.