The imitation_learning from cherrypiesexy

imitation_learning's Issues

MiniHack

I'd like to use your algorithms on simples MiniHack environments (such as the followings: Room, Corridor, River ).

env = gym.make("MiniHack-Room-5x5-v0", observation_keys=("glyphs", "chars", "colors"))
print("########## OBSERVATION SPACE LOOK LIKE THIS ##########")
print(env.observation_space)
print(type(env.observation_space))
print()
print("########## ACTION SPACE LOOK LIKE THIS ##########")
print(env.action_space)
print(type(env.action_space))

OUTPUT:
########## OBSERVATION SPACE LOOK LIKE THIS ##########
Dict(chars:Box([[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
...
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]], [[255 255 255 ... 255 255 255]
[255 255 255 ... 255 255 255]
[255 255 255 ... 255 255 255]
...
[255 255 255 ... 255 255 255]
[255 255 255 ... 255 255 255]
[255 255 255 ... 255 255 255]], (21, 79), uint8), colors:Box([[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
...
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]], [[15 15 15 ... 15 15 15]
[15 15 15 ... 15 15 15]
[15 15 15 ... 15 15 15]
...
[15 15 15 ... 15 15 15]
[15 15 15 ... 15 15 15]
[15 15 15 ... 15 15 15]], (21, 79), uint8), glyphs:Box([[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
...
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]
[0 0 0 ... 0 0 0]], [[5976 5976 5976 ... 5976 5976 5976]
[5976 5976 5976 ... 5976 5976 5976]
[5976 5976 5976 ... 5976 5976 5976]
...
[5976 5976 5976 ... 5976 5976 5976]
[5976 5976 5976 ... 5976 5976 5976]
[5976 5976 5976 ... 5976 5976 5976]], (21, 79), int16))
<class 'gym.spaces.dict.Dict'>

########## ACTION SPACE LOOK LIKE THIS ##########
Discrete(8)
<class 'gym.spaces.discrete.Discrete'>

Process finished with exit code 0

Is this possible considering this obs/action space or your algorithms are not suitable?

[Question] Why should I set action*2 if I want to get the correct output action in GAIL?

Hi, Thank you for the implementation of the RL and IL algorithms, It is especially helpful!
During realizing some of the models, I had a confusion on the size of output_action settings:
For example, If I'm using GAIL, and for generator to output 2 (x,y)value, why should I set the real [ "out_put" *** 2** ] ?

Looking forward to your reply! Thank you!

Image and dict observation spaces support by BCO and GAIL implementations

You mentioned in the README

as: Behavioral Cloning from Observation (BCO) - technique to clone expert behavior into agent using only expert states, BCO (works bad for me, not supported now)

What do you mean by "works bad for me"? Does that specific to your experience OR Does the current BCO library (functions, data structures, etc) not fully functional?
Do the current BCO and GAIL support image and dict observation spaces?

cherrypiesexy / imitation_learning Goto Github PK

imitation_learning's People

Contributors

Stargazers

Watchers

Forkers

imitation_learning's Issues

MiniHack

[Question] Why should I set action*2 if I want to get the correct output action in GAIL?

Image and dict observation spaces support by BCO and GAIL implementations

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent