lxqpku Goto Github PK

followers: 8.0 following: 14.0 repos: 25.0 gists: 0.0

Name: Xiaoqian Liu

Type: User

Company: Institute of Automation, Chinese Academy of Sciences

Bio: PH.D. Candidate of Computer Science at the University of Chinese Academy of Sciences.

Twitter: aislinn_liu

Location: Beijing, China

👋 Hi, I’m @Xiaoqian Liu.
👀 I’m interested in foundation models and foundation agents.
🌱 I'm currently a PH.D. candidate of computer science at the University of Chinese Academy of Sciences, and received master degreee at Peking Unveristy.
💞️ I’m looking to collaborate on developing foundation agents that are able to continuously learn and generalize well to various decision-making tasks.
📫 Reach me through email or github！

Xiaoqian Liu's Projects

algo

数据结构和算法必知必会的50个代码实现

avalanche

Avalanche: an End-to-End Library for Continual Learning.

awesome-lm-rl

A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.

best-of-ml-python

🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.

cheatsheets-ai

Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5

continual-learning

PyTorch implementation of various methods for continual learning (XdG, EWC, online EWC, SI, LwF, GR, GR+distill, RtF, ER, A-GEM, iCaRL).

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

dgl

Python package built to ease deep learning on graph, on top of existing DL frameworks.

diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

diffusion-rl

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

hindsight-experience-replay

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

lxqpku

Config files for my GitHub profile.

maml_rl

Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"

nlp-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

paper_writing_tips

pycil

PyCIL: A Python Toolbox for Class-Incremental Learning

pymarl

Python Multi-Agent Reinforcement Learning framework

q-transformer

Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

rlkit

Collection of reinforcement learning algorithms

trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

updet

Official Implementation of 'UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers' ICLR 2021(spotlight)

lxqpku Goto Github PK

Xiaoqian Liu's Projects

Recommend Projects

Recommend Topics

Recommend Org