Giter VIP home page Giter VIP logo

Comments (13)

lucidrains avatar lucidrains commented on June 17, 2024 2

@inspirit seems like i forgot a scaling factor too (alpha)

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024 1

you are actually doing it different few lines above in training block:

weight, ps = pack_one(self.weight, 'o *')

you pack weight before applying normalisation and then bring it back

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024 1

yup looks good to me :)

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on June 17, 2024 1

@inspirit one uncertainty I had was whether the values in the cosine sim attention is also pixel normed, just putting that out there in case you have an opinion

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024 1

I'm adopting it to a slightly different use: generator model in adversarial learning
will see ig i can make it learn :)

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on June 17, 2024 1

@inspirit yea for sure.. improvement to a unet should float all applications, from biomedical imaging to diffusion policies, etc. best of luck

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024 1

yup, thats exactly my thoughts

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on June 17, 2024
Screen Shot 2024-02-19 at 6 22 06 AM

👋

yes, that's how it is in Algorithm 1. if I am not mistakened?

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024

yes, it is different:
you are using 'F.normalize(t, dim = dim, eps = eps)' with default dim = -1

while in example you show above they flatten array starting dim 1 and use this last dimension to compute normalisation

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on June 17, 2024

@inspirit 🤦 oh yes, thank you

could you double check the latest commit?

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on June 17, 2024

@inspirit ok, do let me know how the Karras unet fairs! Tero Karras is in a league of his own in AI research, so highly optimistic on his magnitude preserving layers

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024

Hi! getting back to the weight normalisation part :)
as far as i understand in this line we multiply by 'fan_in' scale factor

normed_weight = normed_weight * sqrt(weight.numel() / weight.shape[0])

and then later we divide weights by the same scale factor
weight = normalize_weight(self.weight, eps = self.eps) / sqrt(self.fan_in)

thats basically reverting back so whats the idea?

from denoising-diffusion-pytorch.

inspirit avatar inspirit commented on June 17, 2024

i see from the paper this is intentional for gradient updates in the optimizer

from denoising-diffusion-pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.