Is there a way you could expose the weight initialization for the MLP in policies use

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

specifying initial weights to mlp policy about rllab HOT 6 CLOSED

rll commented on July 25, 2024

specifying initial weights to mlp policy

from rllab.

Comments (6)

dementrock commented on July 25, 2024

Hi @alexbeloi,

Can you describe some more details? Are you pretraining the entire set of weights or only specific layers?

from rllab.

alexbeloi commented on July 25, 2024

Hi @dementrock,

Thank you for your help. I'm pretraining an entire set of weights (entire MLP network) in a related setting and I would like to initialize the mean network of the GaussianMLPPolicy as using all the weights from all the layers in the pretrained network.

from rllab.

dementrock commented on July 25, 2024

Do you have a proposed interface for doing this? My current feeling is that this would be a bit application-specific, e.g. a different way to do this would be to only pretrain the first few layers, and randomly initialize the rest.

from rllab.

dementrock commented on July 25, 2024

In principle you should be able to write a few lines of code to wrap this functionality, since the policy constructor takes a mean_network parameter, which could be your custom-initialized MLP.

from rllab.

alexbeloi commented on July 25, 2024

~~Unless I'm misunderstanding, it looks to me as though only CategoricalMLPPolicy takes a parameter (prob_network) to initialize its MLP network.~~

~~For example the constructor for GaussianMLPPolicy doesn't have a mean_network or std_network parameter or the like. It does have a std_share_network parameter which doesn't appear to do anything.~~ edit: It looks like I was looking at an older version of the code, I see now that GaussianMLPPolicy takes mean_network and std_network as parameters.

I could write a few lines to as you described, I just didn't want to get stuck behind the upstream master branch. If it's agreeable, I can try to write a minimal refactoring (edit: of MLP class) and make a pull request, then you can see if it is sufficiently clean for your future plans with this project.

from rllab.

alexbeloi commented on July 25, 2024

Actually, I just noticed that you have a GaussianMLPRegressor. I'll just use that to do my supervised training, then transfer its mean/std networks to the GaussianMLPPolicy.

Thanks for your active interaction. Apologies for the sudden inundation of requests from me ;-)

from rllab.

Recommend Projects

specifying initial weights to mlp policy about rllab HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent