zhenzhao / imas Goto Github PK

View Code? Open in Web Editor NEW

35.0 35.0 5.0 3.1 MB

[CVPR'23] Instance-specific and Model-adaptive Supervision for Semi-supervised Semantic Segmentation

Home Page: https://arxiv.org/abs/2211.11335

Shell 0.69% Python 99.31%

instance-specific-model-adaptive semi-supervised-learning semi-supervised-segmentation

imas's People

Contributors

Stargazers

Watchers

Forkers

cv-seg woaixuexi7 pipixiapipi cv-seg

imas's Issues

关于paper中Model-adaptive strong augmentations公式表述

作者您好，感谢您出色的工作！在阅读paper时有处疑惑，在CVPR 2023 Open Access版本的paper中，3.3.1节中instance-based augmentations章节中的公式(7)：

是否应该修改为：

即，若当前样本较为困难（对应的$\gamma$较大）时，对应需要为弱增强版本分配更大的比例来弱化增强程度，期待您的解惑！

Question about the CutMix experiment

Hi! I see you conduct the ablation experiments of Intensity-based augmentations and CutMix-based augmentations. But I wonder if there still exists obvious improvement between vanilla CutMix strategy and designed CutMix-based augmentation strategy, including your another work AugSeg accepted by CVPR2023? Since when I conduct my own experiment, I find that just applying vanilla CutMix can yield a satisfying performance. Thanks!

iMAS 泛用性

作者你好，谢谢你提供这个好方法
发现你提及曾研究过UniMatch框架，iMAS本身是应用在教师-学生模型，但UniMatch没有引入教师模型
有点疑惑iMAS能否应用在UniMatch的框架上?
希望得到你的回覆，谢谢 !

关于教师模型设置的问题

[您好！我在实现自己的代码时，也遇到了U2PL一样的问题（详见此链接），即如果把教师模型设置为eval模式，掉点会很严重，所以不得不设置为train模式。但是我看到您这份代码把教师模型设置为eval模式，performance也很不错。想问一下您觉得把教师模型设置为eval模式会掉点的原因是什么？

Question about the email

Hi! Sorry to bother you here, but I just sent an email to you yesterday and I'm not sure if you have received it. If it's not too much trouble and you happen to see it, could you kindly drop me a reply? Thank you very much!

分布式问题

作者您好，请问我在超算平台上运行程序，申请了多张卡。但是只使用了一张卡，其它卡全部未使用，请问是什么原因，怎么能解决。

The setting of output stride

Hi! In Table 4 in your paper, there are two kinds of results. The bottom line of results corresponds to the output_stride of 8 (mIoU of 75.2 78.0 78.2 80.2 vs 1/16 1/8 1/4 1/2, respectively). And I wonder what's the number of output_stride in the other experiment(mIoU of 74.3 77.4 78.1 79.3 vs 1/16 1/8 1/4 1/2, respectively)?

损失问题

在进行硬度定量分析的时候，问什么计算出的wIou还要乘上那个高置信度比例再算

The accuracy is not close to the accuracy displayed in log

When I tried learning using config_semi.yaml of citys_semi372, the accuracy was Best-STU:70.40/239 Best-EMA: 70.33/239, and the author's log Best-STU:76.94/236 Best-EMA: 77.43/188. There was a large discrepancy.
Due to the GPU, the number of training batches was set to 8 on a single GPU, so there is a difference in the learning environment, but is there really that much of a difference?

关于dist.all_gather的疑问

您好！请问您在代码里用dist.all_gather这个函数的时候，会出现以下几个问题吗？（1）显存利用率卡在100%，训练进程卡住；（2）训练不稳定，中途可能会报nccl相关的bug. 谢谢！

voc

您好，请问您能提供voc的config文件吗，我自己改的不确定是否正确

关于无标签数据损失计算问题

你好，我看到代码中，关于无标签数据集，仅是对该类数据进行强弱增强，然后利用原有标签进行损失计算对吗？但是随机裁剪操作会不会影响数据与标签的对应关系，进而影响实验结果呢？

关于GPU的使用和推理细节的疑问

您好！请问您这份工作是在多少张什么类型的GPU上进行的？另外我看到您对cityscapes数据集进行推理的时候，采取的是sliding windows的方法。我自己用sliding windows推理（代码和UniMatch完全一致），在4张V100进行推理，需要大约3分钟，但是看您的日志只需要半分钟。想问一下有可能哪里出现了问题？谢谢！

wIOU

您好，我想问一下在论文中的这个wIOU是您提出来的公式还是只是借用的已有的这个公式呢？

dist.all_reduce(loss)的目的

作者你好！在这行代码里，我看到你使用了dist.all_reduce，但是一般来说使用了DDP就不需要再把各GPU上的值给汇总起来了。请问这行代码的作用是什么呢？

Pretrained Network ResNet50 file not available

Hello,

I'm trying to download the ResNet50 pretrained network listed in the ReadMe, but I am receiving this error when clicking the link:

"Sorry, the file you have requested does not exist."

Is there anywhere else I can find this file? I tried downloading resnet encoder weights from the pytorch repository but there seems to be differences between your code and that model file.

Thanks!

The setting of output stride

关于无标签数据的增强

您好，感谢你卓越的工作，我有一个问题是关于你在网络训练时使用分布式训练，那么原文中批处理是16，如果用8张卡的话，那么每张卡就2个无标签图片，那么增强就只能在两个图片上进行增强嘛？因为感觉每个gpu都只是处理自己的数据。

Question about the Intensity-based augmentations

Hi! I just read the paper of iMAS and find the formula 8. Theoretical speaking, the harder the sample is, the larger its gamma is. So for the hard samples, it is inappropriate to assign large proportion of strong augmentation. For the easy samples, the strong augmentation can be added without too much concern. So I think the correct formula is written as: A_s <- (1 - gamma) * A_s + gamma * A_w. But in the paper it is A_s <- (1 - gamma) * A_w + gamma * A_s. So I am confused by it. Could you please explain it? Thanks a lot!