apocalypsex Goto Github PK
Name: Ruiqi Xue
Type: User
Company: NJU
Location: Nanjing
Name: Ruiqi Xue
Type: User
Company: NJU
Location: Nanjing
Author's PyTorch implementation of BCQ for continuous and discrete actions
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC
Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"
Constrained Policy Optimization
Code for conservative Q-learning
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Mastering Diverse Domains through World Models
DYNAIL: Dynamics Adapted Imitation Learning
An extension of the PyMARL codebase that includes additional algorithms and environment support
OpenAI Gym Style Tic-Tac-Toe Environment
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II
A pytorch reprelication of the model-based reinforcement learning algorithm MBPO
Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents"
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
mujoco-circle environment for safe RL
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
An elegant PyTorch offline reinforcement learning library for researchers.
For the experiment of offline safe algorithms
Benchmarked implementations of Offline Multi-Agent RL Algorithms based on PyMARL codebase.
OmniSafe is an infrastructural framework for accelerating SafeRL research.
Reinforcement learning and planning for Minecraft.
Implementation of Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones.
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.