Giter VIP home page Giter VIP logo

Comments (6)

jantic avatar jantic commented on August 15, 2024

I actually tried the original wgan, wgan-gp and even tried the consistency penalty from "improving the improved training of Wasserstein gans". And I mean...really really tried. Desperately. For 6 weeks. Because I kept getting "almost good" results that kept diverging after initially looking promising. It drove me nuts, because I kept thinking "this should work!!!" I clearly am missing something on that. Anyway- I just casually plugged in the Self-Attention GAN stuff after all that frustration, and it just worked. The first time. It was amazing.

As far as it being "unsupervised"- I guess technically you'd call it both unsupervised and supervised? Not really sure how to define it honestly. Probably sounds stupid but...yeah....

from deoldify.

zhangrong1722 avatar zhangrong1722 commented on August 15, 2024

Oh,I'm sorry.Regarding as being unsupervised,I want to know whether groundtruth is used during training.

from deoldify.

jantic avatar jantic commented on August 15, 2024

So what it does is strictly supervised learning when it comes to training the generator on perceptual loss- that is, the loss that encourages the generator to replicate the black and white input image (the target is the color version of the same image). Now the reason why I say I'm fuzzy on definitions here is because the critic portion is looking at real and fake versions and being asked to assign a score to each for "realism", but it's not actually being told that the fake image and real image should match per se. So all the critic can tell the generator then is that "that's not realistic" and by how much according to a numerical score. But the end result is that taken together this combination drives the generator to create vividly colorful transformations of the input black & white images.

So mostly yes on "ground truth is used" but it's complicated I guess.... I might just be confused myself on definitions though honestly...

from deoldify.

zhangrong1722 avatar zhangrong1722 commented on August 15, 2024

Taking classification problem as an example,given an image of cat,its groundtruth or label is cat.In your work,what I say groundtruth is the color version of the same image given an input black image.So my question is that whether both the black images and their correspondingly colorful version exist in your dataset.

I'm sorry for confusing you for so long.

from deoldify.

jantic avatar jantic commented on August 15, 2024

Yes that's what I meant when I said "that is, the loss that encourages the generator to replicate the black and white input image (the target is the color version of the same image)."

It's just Imagenet photos being converted to gray scale, then the neural net's job is to convert it back to color. But I say it's complicated because that's not the complete picture (see above).

from deoldify.

zhangrong1722 avatar zhangrong1722 commented on August 15, 2024

And how many images are there in your dataset?

from deoldify.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.