Issue I would like to know, how can I avoid zero coefficients in t

It's good to know it <a class="user-mention notranslate" data-hovercard-type="user" da

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Media excluded in optimiser because their coeffients are 0,about facebookexperimental/robyn

Comments (9)

gufengzhou commented on May 21, 2024

Hey, thanks for trying out Robyn. Well we don't directly force the model to not produce 0 coef at all, because we did this before and got issues when there were really weak variables inside that "deserved 0". As you know, you get a set of Pareto output. If all results are not satisfying, I would recommend to run a higher set_trials, e.g. 80, to get more results. Maybe there will be some results that are not reducing your search to 0. Does it make sense?

from robyn.

gibrano commented on May 21, 2024

It's good to know it @gufengzhou, and it make sense, but there are still cases like google brand where it is necessary to protect the brand and assign a budget, in the same way thanks for the feedback.

from robyn.

gufengzhou commented on May 21, 2024

What you can also try is increase the range of theta for brand. You're using 0-.3 now, you can try 0.3-0.9 to see if you get better result there. If that still doesn't work, try further splitting brand into more granular variables. Hope it helps.

from robyn.

Leonelsentana commented on May 21, 2024

Hi @gibrano as Gufeng said, thanks so much for using our code :)
The steps I usually take are:

Increase iterations/trials. If that does not work then:
Increase thetas and/or gammas which will increase the delayed effect in time of the media and increase the spend saturation inflection point. If that does not work then:
Try different train/test split ratios. You may analyze the correlation of the brand channel with the response (e.g sales) for both the train and test split periods. Ideally both periods should have a similar correlation and representation of the brand channel in question.
If none of the above work, try to see if different channel aggregations may help for this case?
Hope it helps and thanks for trying out Robyn!

from robyn.

gufengzhou commented on May 21, 2024

one more thing. I just saw that in the budget allocator, you have the upper contraints as 1.0 --> this means you don't allow any increase of spend. That's why you see unreasonable allocation results. upper constraint should be above 1

from robyn.

gibrano commented on May 21, 2024

Hi @gufengzhou and @Leonelsentana, thank you so much for the follow up you have given to the case. Following your advises, I have increased iterations to 800 and trials to 80, also I have splited the campaigns into more granular variables with hyperparameters:

My results with the Budget Allocator keeps giving me zero in some coefficients:

With max_historical_response:

With max_response_expected_spend:

Something I would expect with lowers and uppers bounds is that the budgets don't go out of bounds. On the other hand, I notice that in the plot of Response curve and mean spend by channel, there are positive values or positive budgets for each of the channels, or I don't know if I am misinterpreting it.

I appreciate your help.

from robyn.

gufengzhou commented on May 21, 2024

Generally speaking, Robyn does not guarantee all coefficients to be non-zero. We use ridge regression, a shrinkage method that will also reduce coefs to 0 when the correlation between that feature and dependent variable is "too weak". So there's no guarantee that all media will be non-zero.
Having said that, also with 800 * 80 total iterations, not one of the pareto optimal models is non-zero? If this is the case, one thing you could do is to increase the amount of pareto outputs and "hope" that you can get one non-zero. As you know, I'm outputting 3 pareto fronts by default, which can be increased (although quite hidden). In the .func script line 1439, I define the number of pareto fronts as paretoFronts <- c(1,2,3). You can try to increase to 1-5 or even higher, so that you can get more results. Of course the trade-off is clear: the higher the front number, the worse the model performance in terms of NRMSE and DECOMP.RSSE. But with 800*80 iterations, I can imagine higher fronts might still be quite good. Please try this and let me know if it works.
Regarding the positive budget in response curve plot, well for those 0 coef channels, you will still have a mean spend, but the responses are zero. You can also see that these channels response curves are just an horizontal line.

from robyn.

gufengzhou commented on May 21, 2024

Another thing worth trying is what Leo mentioned as second point: increase the bounds of hyperparameters for the 0 coef channels, esp. theta to 0 - 0.9 and gamma to 0.1 - 1. I see that you changed the thetas to 0.5-1. By increasing the bounds, we don't mean make it higher, but rather give the model more freedom. For gamma, usually we don't recommend to set it too low (allowing inflexion point to happen early). But in your case it might be worth trying. Let us know how it works.

from robyn.

gufengzhou commented on May 21, 2024

Feel free to reopen this if more issues occur

from robyn.

Media excluded in optimiser because their coeffients are 0 about robyn HOT 9 CLOSED

Comments (9)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent