Hello, Chelsea. The batch normalization <a href="https://www.tensorf

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Batch normalization about maml HOT 8 CLOSED

srpv commented on July 29, 2024

Batch normalization

from maml.

Comments (8)

commented on July 29, 2024 3

@cbfinn, as you mentioned before,

I compute the test-time statistics using the test batch of data, instead of computing the average training statistics

This seems to be a bit of cheating especially on test-time. In general, we can assume evaluating only one sample at a time on test-time and then there is no way to get proper statistics for batch_norm. This means the test-set performance will partially dependent on the size of batch.

from maml.

cbfinn commented on July 29, 2024

I compute the test-time statistics using the test batch of data, instead of computing the average training statistics. This doesn't require keeping track of batch norm training statistics. [Note that train is always set to True when calling the batch_norm function, which means that tensorflow will compute the statistics using the current batch]

It's possible that it would work better by using training batch statistics, but I haven't tried it.

from maml.

srpv commented on July 29, 2024

It's not the issue I'm talking about. See the Note from tf.contrib.layers.batch_norm page

Note: when training, the moving_mean and moving_variance need to be updated. By default the update ops are placed in tf.GraphKeys.UPDATE_OPS, so they need to be added as a dependency to the train_op.

In other words, without these steps I wrote in the issue, moving_mean and moving_variance doesn't update at all (even during training). Again, maybe I'm missing some other way you're updating them.

from maml.

cbfinn commented on July 29, 2024

You only need to update moving_mean and moving_variance if you use them. In this case, the batch norm statistics are being computed using the batch data rather than a moving average of the statistics (so they don't need to be updated).

from maml.

srpv commented on July 29, 2024

OK. Indeed, they're needed only during testing.
Thanks.

from maml.

cbfinn commented on July 29, 2024

If only having access to a single test example is a constraint, you can also use a batch of N-1 training examples with a single test example to compute the statistics. This should perform equivalently, while only using one test example.

…

On Jan 11, 2018 9:23 PM, "Taesup (TS) Kim" ***@***.***> wrote: @cbfinn <https://github.com/cbfinn>, as you mentioned before, I compute the test-time statistics using the test batch of data, instead of computing the average training statistics This seems to be a bit of cheating especially on test-time. In general, we can assume evaluating only one sample at a time on test-time and then there is no way to get proper statistics for batch_norm. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABMlATdKcIxQvsXgakxVlYQG5dK0VsdJks5tJuxtgaJpZM4OzLMX> .

from maml.

dragen1860 commented on July 29, 2024

Hi, I see your approcach.
If I use moving average of the statistics by adding update_op into train ops, Then Need I set train=FALSE when testing use batch_norm function?

from maml.

cbfinn commented on July 29, 2024

Yes

…

On Thu, Apr 19, 2018, 7:35 PM Jackie Loong ***@***.***> wrote: Hi, I see your approcach. If I use moving average of the statistics by adding update_op into train ops, Then *Need I set train=FALSE* when testing use batch_norm function? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#9 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABMlAZJQHg-N16-ZxjUNiimpNGK70_XVks5tqUl3gaJpZM4OzLMX> .

from maml.

Batch normalization about maml HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent