Giter VIP home page Giter VIP logo

Comments (30)

lucidrains avatar lucidrains commented on September 25, 2024 3

@QuantPrincess ok, with this flag, you can do attention across space and time separately (axial attention)

this is my last open source contribution until further notice, good luck!

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 2

@QuantPrincess yup, similar complexity. can throw that in there too before month's end. i'm planning on just copy pasting my own code and making a few changes lol

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 2

@QuantPrincess awesome! yea i'll get that done

brings back memories of grad school when i tried to segment kidneys in CT scans with gofai algorithms (our team used watershed segmentation, super sh**ty). oh how far things have come

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 2

@MaximilienLC unfortunately there is the possibility my open source journey comes to an end soon and i can't get around to that. PR is welcome!

from denoising-diffusion-pytorch.

MaximilienLeClei avatar MaximilienLeClei commented on September 25, 2024 1

I trust you more than I trust myself so I'll gladly take the offer hhh. Working with audio-like signals that cannot be easily converted to spectrograms. Thanks a lot!

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 1

oh yes audio duh

ok, I'll knock this out tomorrow morning!

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 1

oh sorry I misread, thought you said you were working with spectrogram. nevermind!

from denoising-diffusion-pytorch.

MaximilienLeClei avatar MaximilienLeClei commented on September 25, 2024 1

btw unrelated to my first question, but while I have you here hhh:

I'm planning to eventually transition to conditional modeling, following the concatenation scheme proposed in Palette & Image Super-Resolution via Iterative Refinement (add extra channels with your conditional info). I'm not super familiar with all the changes that need to be made yet but will eventually (is the concatenation all you need?) Do you think that your lib is flexible enough for me to add this conditioning feature (btw would be happy to propose a PR if I succeed)?

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

While we are on the topic... If I wanted to adjust the original Unet to support a 3d input would the adjustments roughly be similar in terms of changing the conv to 3d + some other minor adjustments?

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

Amazing! Thank you.

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 1

@QuantPrincess i can get out a 3d version too, if you aren't doing video stuff. if you are, i'd recommend this

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

Thanks so much, really appreciate the recommendation. Not doing video stuff though, just some 3d imaging. I will also try to adjust your code base for 3d in mean time!

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

Haha yes CTs! Thanks so much for your help. Your code base is really a pleasure to work with.

from denoising-diffusion-pytorch.

MaximilienLeClei avatar MaximilienLeClei commented on September 25, 2024 1

@MaximilienLC knocked it out just now - let me know if this runs for you

Thanks @lucidrains, will do in the coming days and report back!

btw, you might have missed my latest question

I'm planning to eventually transition to conditional modeling, following the concatenation scheme proposed in Palette & Image Super-Resolution via Iterative Refinement (add extra channels with your conditional info). I'm not super familiar with all the changes that need to be made yet but will eventually (is the concatenation all you need?) Do you think that your lib is flexible enough for me to add this conditioning feature (btw would be happy to propose a PR if I succeed)?

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024 1

@QuantPrincess let me get those in there this weekend, but do feel free to try it out as it is in the meanwhile

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

Thank you! I will look at tonight and get back to you. I really can't express enough how awesome your code base is to work with!

from denoising-diffusion-pytorch.

Parskatt avatar Parskatt commented on September 25, 2024 1

@lucidrains see #296

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024 1

Thanks so much @lucidrains ! Cant wait to test out your work on the axial attention. Best of luck on your next endeavors!

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@MaximilienLC yup, it is quite simple; just replace the mp conv2d with mp conv1d https://github.com/lucidrains/lumiere-pytorch/blob/main/lumiere_pytorch/mp_lumiere.py#L120 + some other adjustments

would you like me to just knock this out for you tomorrow morning?

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@MaximilienLC what type of data are you working with?

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@MaximilienLC gave this more thought while walking doggo, and i think you need 2d still (to mix adjacent frequency information). i'll make it so the unet2d, you can finely control whether to downsample only the height or the width (and customize the conv2d kernel height/width dimensions too). that should generalize the 1d case

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@QuantPrincess what are you using it for? the latest video unets are all pretrained with images on conv2d with conv1d slipped in for time dimension during video training stage (unless if Sora changed all that, haven't read the technical paper yet)

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@MaximilienLC knocked it out just now - let me know if this runs for you

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@QuantPrincess ohh got it, you working with CT / MRI segmentation? yea i can add 3d for you then tomorrow morning

from denoising-diffusion-pytorch.

QuantPrincess avatar QuantPrincess commented on September 25, 2024

Thank you for this! I will check out today and report back. Really appreciate your help. :)

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@QuantPrincess hey! so i realized it still lacks a few features to be usable (factorized attention, and being able to specify downsamples in space vs time separately) CT slices will be a much smaller than the spatial dimensions

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@QuantPrincess let me know if this is intuitive

with downsample_types, you can control at each stage whether to downsample image (spatial) or frame (slices) or all (both). you can also control how many MP resnet blocks are at each stage by passing in a tuple of integer into num_blocks_per_stage

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@QuantPrincess i can get to the factorized attention mid next week and finish off this issue

from denoising-diffusion-pytorch.

Parskatt avatar Parskatt commented on September 25, 2024

I think there might be some implementation mistake in the 3DUnet, I'm getting exploding activations. I'll see if I can make a simple repro.

from denoising-diffusion-pytorch.

lucidrains avatar lucidrains commented on September 25, 2024

@Parskatt thank you Johan! you beat me to it

from denoising-diffusion-pytorch.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.