Comments (1)
Each game needs to train an agent separately.
The dynamics function takes in the actions, and the policy function outputs the actions. Since different environments have different actions and the current agent interacts with one environment at a time, the trained Breakout model cannot transfer to another environment.
from efficientzero.
Related Issues (20)
- EfficientZero high memory consumption / keeps increasing after replay buffer is full HOT 7
- The first selfplay worker uses the same seed for all parallel environments HOT 2
- reproduce results for other environment
- Clarification on the atari environment?
- Envs seem not to work in parallel
- Cannot reproduce Breakout results
- Question about the transform between true reward and value prefix HOT 1
- Question about the index of pad_child_visits_lst in selfplay_worker.py HOT 2
- Question about the effect of state encoding indentity connection in dynamics network HOT 1
- Question about the effect of state encoding indentity connection in dynamics network
- Question about getting zero test score when I try to run EfficientZero on BabyAI grid environment HOT 2
- In reanalyze_worker GPU worker, why prepare policy targets and value targets separately? HOT 2
- WSL2 NVIDIA 3090 or M1 MBP correct environment
- How to use with SLURM
- Question about the effect of discount factor and done mask when calculating the target value?
- Question about the test phase not always running fully HOT 1
- ray warning HOT 1
- Code for continuous action space HOT 1
- EfficientZero V2 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from efficientzero.