erlebnisw Goto Github PK
Name: Mingzhi Wang
Type: User
Name: Mingzhi Wang
Type: User
Second place submission in the 2021 MineRL BASALT competition: training Minecraft agents on hard to specify tasks from demonstration, using Inverse soft-Q Learning for Imitation.
behavior cloning from observation
PyTorch Behavioral Cloning from Observation for MuJoCo Fetch Environments
Linux 端使用 Clash 作为代理工具
An environment based on JSBSIM aimed at one-to-one close air combat.
A best practice for deep learning project template architecture.
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
OpenDILab Decision AI Engine
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
Massively Parallel Deep Reinforcement Learning. 🔥
@CVPR2018: Efficient unrolling iterative matrix square-root normalized ConvNets, implemented by PyTorch (and code of B-CNN,Compact bilinear pooling etc.) for training from scratch & finetuning.
Fictitious Cross-Play
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
We perform functional grounding of LLMs' knowledge in BabyAI-Text
Official implementation of HARL algorithms based on PyTorch.
Imitation learning algorithms
Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
ISP-reID
Notebooks for the O'Reilly book "Learning Ray"
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Lightweight version of MAPPO to help you quickly migrate to your local environment.
Fine-tune LLM agents with online reinforcement learning
This is a repository for the course Machine Learning delivered by Andrew Ng.
A parallel framework for population-based multi-agent reinforcement learning.
This is the official implementation of Multi-Agent PPO (MAPPO).
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.