heyuanmingong Goto Github PK

followers: 35.0 following: 13.0 repos: 28.0 gists: 0.0

Name: Zhi Wang

Type: User

Company: Nanjing University

Bio: An alchemist on reinforcement learning, an academic laborer in Heyuan.

Location: China

Blog: github.com/NJU-RL

heyuanmingong.github.io

Personal Homepage https://heyuanmingong.github.io

Zhi Wang's Projects

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

attention-learn-to-route

Attention based model for learning to solve different routing problems

ddpm2

This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.

decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

dgr

PyTorch implementation of "Continual Learning with Deep Generative Replay", NIPS 2017

diayn-pytorch

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

diffuser

Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"

gan

PyTorch implementations of Generative Adversarial Networks.

irl

Code for "Incremental reinforcement learning"

irl_cs

Code for "Incremental Reinforcement Learning in Continuous Spaces"

iwies

Code for "Instance Weighted Incremental Evolution Strategies (IW-IES)"

llirl

Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"

pearl

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

ppo-pytorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

random-network-distillation-pytorch

Random Network Distillation pytorch

sfbc

Codes accompanying the paper "Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling" (ICLR 2023) https://arxiv.org/abs/2209.14548

sllrl

Code for "Scalable lifelong reinforcement learning"

Online portfolio selection is a fundamental problem in computational ﬁnance, which has been extensively studied across several research communities, including ﬁnance, statistics, artiﬁcial intelligence, machine learning, and data mining, etc. This article aims to provide a comprehensive survey and a structural understanding of published online portfolio selection techniques.

varibad

Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)

vib-pytorch

Pytorch implementation of Deep Variational Information Bottleneck

heyuanmingong Goto Github PK

heyuanmingong.github.io

Zhi Wang's Projects

Recommend Projects

Recommend Topics

Recommend Org