Xiaoqian Liu's Projects
ๆฐๆฎ็ปๆๅ็ฎๆณๅฟ
็ฅๅฟ
ไผ็50ไธชไปฃ็ ๅฎ็ฐ
Avalanche: an End-to-End Library for Continual Learning.
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
๐ A ranked list of awesome machine learning Python libraries. Updated weekly.
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
PyTorch implementation of various methods for continual learning (XdG, EWC, online EWC, SI, LwF, GR, GR+distill, RtF, ER, A-GEM, iCaRL).
Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Config files for my GitHub profile.
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PyCIL: A Python Toolbox for Class-Incremental Learning
Python Multi-Agent Reinforcement Learning framework
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Collection of reinforcement learning algorithms
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)