Learning to Play in a Day: F

Hi guys, I have released the code at <a href="https://github.com/ShibiHe/Q-Optimal

Implement optimality tightening about atari HOT 8 OPEN

kaixhin commented on August 21, 2024

Implement optimality tightening

from atari.

Comments (8)

ShibiHe commented on August 21, 2024 2

Hi guys,
I have released the code at https://github.com/ShibiHe/Q-Optimality-Tightening. Please have a look.

Best,
Shibi

from atari.

petrosgk commented on August 21, 2024

I gave it a shot, however I am not sure how the discounted reward R is supposed to be used and I also need to check if future and past k-transitions are valid

https://github.com/petrosgk/Atari/tree/opt-tightening

from atari.

Kaixhin commented on August 21, 2024

Awesome - I'll try and have a look soon or next week! Would you be able to test it to try and replicate one of the results from the paper?

I started on this myself as well, so will see how our implementations compare.

from atari.

Aeroone commented on August 21, 2024

Hi, have you reproduced that optimality tightening results? I have tried some games based on tensorflow and openai gym but the results seem much worse than the papers' results. I am not sure whether I misunderstand something or miss some tricks in the paper. It seems that the paper doesn't include everything about their works.

from atari.

DanielTea commented on August 21, 2024

Does anyone know wether they have published the source code for optimal tightening, from the paper?

from atari.

Aeroone commented on August 21, 2024

No, they haven't published their code as far as I know. The tricks they use are not hard to implement but I can not still achieve their performance.

from atari.

petrosgk commented on August 21, 2024

I have tried implementing optimality tightening (see earlier post) but the results I get are also much worse than the paper's.

from atari.

Kaixhin commented on August 21, 2024

In my experience the smallest details in a paper can be key to reproducing results - and these may be missing or ambiguous. If anyone is reasonably confident in their implementation, you should try contacting one of the authors with specific questions.

from atari.

Recommend Projects

Implement optimality tightening about atari HOT 8 OPEN

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent