Giter VIP home page Giter VIP logo

fod's Introduction

Hi there 👋

I am pursuing my Ph.D. degree (2023.04 - Present) in Shanghai Jiao Tong University, Shanghai, China. My research direction focuses on how to improve the performance of image and video anomaly detection.

  • 💬 Feel free to send me emails if you have interests.

Xincheng Yao's github stats

fod's People

Contributors

xcyao00 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

fod's Issues

Validation Set.

Hi, thanks for your great work!
It seems that you directly use the test set as the validation set in code:

FOD/trainer.py

Line 207 in d97b44b

img_auc, pix_auc = self.test(vis=False)
. However, I find there is indeed no explicit validation set in the MvTec dataset. As I am not familiar with the field of anomaly detection, I would like to know if using the test set as the validation set is a common practice for this task.

精度

您好,我跑您的代码精度没有论文里高,大概是这样的

Mean Image-AUC: 0.974334492926727
Mean Pixel-AUC: 0.9817032875158126

想请教下是什么原因呢,因为 backbone 不同吗?

Abnormal training results

Hello, after I replaced the dataset, there were abnormal results. For example, the Pixel AUC increased from 0.72 to 0.8, but the Image AUC value decreased from 0.9 to 0.8. Can you help me answer my questions? thanks

显存不足

我只有6G的显存,我使用resnet18替换wd_resnet50,并把batch_size调为4,输入是256,但是内存占用仍然过高,有什么办法可以训练吗

whether this mod is unsupervised or self-supervised?

Hello, I would like to ask why I don't find the decoder section inside the mod of the code, is there no decoder? And I see that the paper is written with self-supervision inside, may I ask if this paper is based on reconstructed transformer or non-reconstructed transformer? Looking forward to your answer, thank you!

A bug in the project.

I found a bug in the project when I wanted to run the project in cuda:2. “RuntimeError :Expected all tensors to be on the same device, but found at least two devices, cuda:2 and cuda:0!” occurred. On closer inspection, I found that the problem is at pos_embed.cuda() of PositionEmbedding.

询问关于论文中的“patch-wise”

您好,我在阅读您的论文中,对于这句话没法准确翻译:In this paper, we propose a novel AD framework: FOcusthe-Discrepancy (FOD), which can simultaneously spot the patch-wise, intra- and inter-discrepancies of anomalies.尤其是patch-wise这个名词,我想了很久,也没明白什么意思,您能解释下它的正确含义吗?非常感谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.