Zhi Wang's Projects
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Attention based model for learning to solve different routing problems
CORRO code
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
PyTorch implementations of Generative Adversarial Networks.
Personal Homepage
Code for "Incremental reinforcement learning"
Code for "Incremental Reinforcement Learning in Continuous Spaces"
Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
MTDiff
Online Portfolio Selection toolbox
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
PyTorch implementation of soft actor critic
Random Network Distillation pytorch
Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548
Code for "Scalable lifelong reinforcement learning"
Online portfolio selection is a fundamental problem in computational finance, which has been extensively studied across several research communities, including finance, statistics, artificial intelligence, machine learning, and data mining, etc. This article aims to provide a comprehensive survey and a structural understanding of published online portfolio selection techniques.
Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)
Pytorch implementation of Deep Variational Information Bottleneck