Giter VIP home page Giter VIP logo

image_classification_sota's People

Contributors

hunto avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

image_classification_sota's Issues

训练效果

您好,想问下提供的resnet或mobilnetv2结构,您有训练测试达到很高的acc吗?还是里面的参数需要自己继续调整。我直接用mbv2训练自己的分类任务,效果不是很好

about the dimension

Hi, thanks for opening the source code. I read the paper, I find you use logits and features before pooling to perform diffusion. but for the logits, I guess the dimension is [B, C] B is the batch size, and C is the class number. This will cause a dimension mismatch in autoencoder, how to solve it. Thanks for your reply.

split global token and image token

Hello,
When you split global token and image token from the input x, shouldn't it be split into [B, :NT, C] and [B, NT: , C]?
But the code in the forward_feature function, it is split from the channel dim for x_glb.

So, assuming x has the shape of [1,3134,64], then global token shape will be [1,8,64] and image token shape will be [1,3136,64].
Please let me know if I am wrong.

[Question] Could I use DIST in RetinaFace?

Could I use DIST in RetinaFace?

RetinaFace have only 2class(face, not face). so Pearson's correlation coefficient seems to be inefficient.

In summary, if the class is small, the dist is inefficient. Especially in the case of binary, it looks more inefficient.
I wonder if the above opinion is correct.

replace the dbb block

Can I replace the DBB module and model with my own reparameterization module and model?

lack edgenn model

Hi, sir, the classification lack edgenn model, can you upload it.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.