Describe the bug Running hyperparameter tuning with SAC and Custo

Just a doubt, is it OK then to tune the hyperparameters with <code class="notranslate"

Can't tune hyperparameters with CustomSACPolicy - multiple values for keyword argument 'layers' about rl-baselines-zoo HOT 5 CLOSED

araffin commented on May 22, 2024

Can't tune hyperparameters with CustomSACPolicy - multiple values for keyword argument 'layers'

from rl-baselines-zoo.

Comments (5)

araffin commented on May 22, 2024

Hello,
This is normal. If you do hyperparameter tuning, you should set policy='MlpPolicy' otherwise you will get the mentioned error, as the CustomSACPolicy is already custom in term of number of layers, would be nice to change CustomSACPolicy to MlpPolicy but with policy_kwargs="dict(layers=[256,256])"

from rl-baselines-zoo.

PierreExeter commented on May 22, 2024

Ok thanks for your very quick reply.

from rl-baselines-zoo.

PierreExeter commented on May 22, 2024

Just a doubt, is it OK then to tune the hyperparameters with policy='MlpPolicy' and then to train the model with CustomSACPolicy? Does it not defeat the purpose of tuning in the first place? i.e. would hyperparameters optimised with one policy be also optimal for another policy?

from rl-baselines-zoo.

araffin commented on May 22, 2024

Does it not defeat the purpose of tuning in the first place? i.e. would hyperparameters optimised with one policy be also optimal for another policy?

If in your hyperparameter optimization you allow architecture search:

rl-baselines-zoo/utils/hyperparams_opt.py

Lines 245 to 250 in 645ea17

 net_arch = trial.suggest_categorical('net_arch', ["small", "medium", "big"]) 

 net_arch = { 

 'small': [64, 64], 

 'medium': [256, 256], 

 'big': [400, 300],

then it does make sense to have policy='MlpPolicy'.
However, if you fix the architecture (by commenting the lines above), then you can use CustomSACPolicy (or in a equivalent way, MlpPolicy + policy_kwargs="dict(layers=[256,256])")

from rl-baselines-zoo.

PierreExeter commented on May 22, 2024

Ok thanks a lot for your help, I'm closing this issue now.

from rl-baselines-zoo.

Recommend Projects

Can't tune hyperparameters with CustomSACPolicy - multiple values for keyword argument 'layers' about rl-baselines-zoo HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

	net_arch = trial.suggest_categorical('net_arch', ["small", "medium", "big"])

	net_arch = {
	'small': [64, 64],
	'medium': [256, 256],
	'big': [400, 300],