Based on some work <a class="user-mention notranslate" data-hovercard-type="user" data

Some new tools for regularization were implemented in <a class="issue-link js-issue-li

Adding regularization about nengo-dl HOT 5 CLOSED

nengo commented on August 20, 2024

Adding regularization

from nengo-dl.

Comments (5)

drasmuss commented on August 20, 2024

Yeah that definitely seems like something that'd be good to have. One challenge I can foresee with the config approach is that I don't think there's an easy way to differentiate different regularization targets within an Ensemble. E.g., if I do net.config[nengo.Ensemble].l2_regularization, am I regularizing the encoders? biases? output activities?

One possibility is that nengo.Ensemble targets encoders, and nengo.ensemble.Neurons targets output activities. Biases wouldn't be targetable, but they aren't targetable with the Probe approach either (since biases aren't probeable), so we're not any worse off.

Another option would be to use a helper-function approach, instead of the config system. Something like:

with nengo_dl.Simulator(net) as sim:
    my_loss = {my_probe: "mse"}
    my_loss.update(nengo_dl.utils.l2_regularization(nengo.Ensemble, 0.001))
    sim.train(..., objective=my_loss)

Under the hood this would just be using the Probe approach, but it would automate the creation of the Probes and objective functions. Advantage of this is that we wouldn't have to add any new logic to nengo_dl, everything would be encapsulated within that helper function.

The third option would be to go more in-depth, adding functionality directly to the nengo_dl Simulator to support regularization. Something like

with nengo_dl.Simulator(net) as sim:
    sim.add_l2_regularization(nengo.Ensemble, "bias", 0.001)

The advantage of this approach is that it's more flexible, and would let us do things like target the biases. But I'm definitely a bit reluctant to add new top-level functions to the Simulator like that, as it adds complexity that is directly exposed to new users.

The fourth option would be the most general, allowing users to pass arbitrary Tensors for the objective. For example, you could do something like

with nengo_dl.Simulator(net) as sim:
    reg_loss = tf.reduce_sum([tf.nn.l2_loss(v) for v in tf.trainable_variables])
    sim.train(..., objective=reg_loss)

That is, rather than us trying to add support for these things directly in nengo_dl, we just let users do whatever they want through TensorFlow, and make it easier to insert that TensorFlow logic into a nengo_dl model. Advantage of this approach is that it is the most flexible (users could do all kinds of things, not just regularization). But it requires users to be more familiar with TensorFlow.

Another consideration; rather than directly specifying the regularization type through the parameter, we could allow users to pass the regularization function they want. E.g.

net.config[nengo.Connection].regularization = tf.nn.l2_loss

I like this because it means that users can now use the same method for whatever regularization type they want. The disadvantage is that it's a bit more awkward to specify scaling weights:

net.config[nengo.Connection].regularization = lambda x: 0.001 * tf.nn.l2_loss(x)

That doesn't seem toooo bad though?

I think my overall inclination would be the net.config[nengo.Connection].regularization = tf.nn.l2_loss approach, to start. I would keep our eyes on that though, and if it isn't meeting our needs go to one of the more general approaches.

from nengo-dl.

hunse commented on August 20, 2024

I like the helper function approach, becuase it's just helping users do what Pete and I are already doing now (and which seems to work well). Power users can easily examine the helper functions to see how to do their own e.g. If they want to use a different regularization function. And it would be easy to extend to target different things if necessary. One thing we might have to address with the helper function approach is how to remove probes if the user wants to run the network over time. Otherwise the weight probes take too much memory. An alternative to removing them would be having probes that only store/allow access to the current value of a tensor, rather than recording the whole history. Regularizaing biases is uncommon (I've never seen it done) so I wouldn't be worried about not supporting that.

…

On Fri, Aug 24, 2018, 13:39 Daniel Rasmussen ***@***.***> wrote: Yeah that definitely seems like something that'd be good to have. One challenge I can foresee with the config approach is that I don't think there's an easy way to differentiate different regularization targets within an Ensemble. E.g., if I do net.config[nengo.Ensemble].l2_regularization, am I regularizing the encoders? biases? output activities? One possibility is that nengo.Ensemble targets encoders, and nengo.ensemble.Neurons targets output activities. Biases wouldn't be targetable, but they aren't targetable with the Probe approach either (since biases aren't probeable), so we're not any worse off. Another option would be to use a helper-function approach, instead of the config system. Something like: with nengo_dl.Simulator(net) as sim: my_loss = {my_probe: "mse"} my_loss.update(nengo_dl.utils.l2_regularization(nengo.Ensemble, 0.001)) sim.train(..., objective=my_loss) Under the hood this would just be using the Probe approach, but it would automate the creation of the Probes and objective functions. Advantage of this is that we wouldn't have to add any new logic to nengo_dl, everything would be encapsulated within that helper function. The third option would be to go more in-depth, adding functionality directly to the nengo_dl Simulator to support regularization. Something like with nengo_dl.Simulator(net) as sim: sim.add_l2_regularization(nengo.Ensemble, "bias", 0.001) The advantage of this approach is that it's more flexible, and would let us do things like target the biases. But I'm definitely a bit reluctant to add new top-level functions to the Simulator like that, as it adds complexity that is directly exposed to new users. The fourth option would be the most general, allowing users to pass arbitrary Tensors for the objective. For example, you could do something like with nengo_dl.Simulator(net) as sim: reg_loss = tf.reduce_sum([tf.nn.l2_loss(v) for v in tf.trainable_variables]) sim.train(..., objective=reg_loss) That is, rather than us trying to add support for these things directly in nengo_dl, we just let users do whatever they want through TensorFlow, and make it easier to insert that TensorFlow logic into a nengo_dl model. Advantage of this approach is that it is the most flexible (users could do all kinds of things, not just regularization). But it requires users to be more familiar with TensorFlow. Another consideration; rather than directly specifying the regularization type through the parameter, we could allow users to pass the regularization function they want. E.g. net.config[nengo.Connection].regularization = tf.nn.l2_loss I like this because it means that users can now use the same method for whatever regularization type they want. The disadvantage is that it's a bit more awkward to specify scaling weights: net.config[nengo.Connection].regularization = lambda x: 0.001 * tf.nn.l2_loss(x) That doesn't seem toooo bad though? I think my overall inclination would be the net.config[nengo.Connection].regularization = tf.nn.l2_loss approach, to start. I would keep our eyes on that though, and if it isn't meeting our needs go to one of the more general approaches. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#50 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AB3J-scidZjek2QodfA80hroEmZ5fGNHks5uUDpggaJpZM4WKal-> .

from nengo-dl.

arvoelke commented on August 20, 2024

I'm finding that nengo_dl is over-fitting to my training data. Wondering what approach is currently recommended? I'm thinking the easiest solution right now would be to add noise to my training input data?

from nengo-dl.

drasmuss commented on August 20, 2024

There are a lot of different ways that you can try to avoid overfitting. Adding more training data is a good approach; perturbing your data with noise is one way to do that, but you can use various different data augmentation methods depending on what your data looks like (e.g. shifting, cropping, or just collecting/generating more raw data). You could also add noise to the activations (firing rates) within your model. The most common way to do this nowadays is through dropout layers. You can set this up so that it is only active during training, so those dropout layers effectively disappear when you're running your network later. Adding weight regularization can also help avoid overfitting. In practice it's probably a combination of all of those techniques, but if I was picking one to start with it would just be adding more training data, as that is usually the easiest (doesn't require any changes to your model).

from nengo-dl.

drasmuss commented on August 20, 2024

Some new tools for regularization were implemented in #73. It's basically the helper function approach from above, with some modifications. And I added an option to reduce probe memory usage, as @hunse suggested. I'm going to close this for now, but we can definitely re-open it if we find we want to explore one of the other options discussed above.

from nengo-dl.

Adding regularization about nengo-dl HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent