Giter VIP home page Giter VIP logo

ccvc's Introduction

CCVC

This is the official PyTorch implementation of our paper:

Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation In Conference on Computer Vision and Pattern Recognition (CVPR), 2023

Abstract. Semi-supervised semantic segmentation (SSS) has recently gained increasing research interest as it can reduce the requirement for large-scale fully-annotated training data. The current methods often suffer from the confirmation bias from the pseudo-labelling process, which can be alleviated by the co-training framework. The current co-training-based SSS methods rely on hand-crafted perturbations to prevent the different sub-nets from collapsing into each other, but these artificial perturbations cannot lead to the optimal solution. In this work, we propose a new conflict-based cross-view consistency (CCVC) method based on a two-branch co-training framework which aims at enforcing the two sub-nets to learn informative features from irrelevant views. In particular, we first propose a new cross-view consistency (CVC) strategy that encourages the two sub-nets to learn distinct features from the same input by introducing a feature discrepancy loss, while these distinct features are expected to generate consistent prediction scores of the input. The CVC strategy helps to prevent the two sub-nets from stepping into the collapse. In addition, we further propose a conflict-based pseudo-labelling (CPL) method to guarantee the model will learn more useful information from conflicting predictions, which will lead to a stable training process. We validate our new CCVC approach on the SSS benchmark datasets where our method achieves new state-of-the-art performance.

Getting Started

Installation

cd CCVC
conda create -n CCVC python=3.6
conda activate CCVC
pip install -r requirements.txt
conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge

Please refer to UniMatch for more implement details

Pretrained Backbone:

ResNet-50 | ResNet-101

├── ./pretrained
    ├── resnet50.pth
    └── resnet101.pth

Dataset:

Please modify the dataset path in configuration files.

The groundtruth mask ids have already been pre-processed. You may use them directly.

├── [Your Pascal Path]
    ├── JPEGImages
    └── SegmentationClass
    
├── [Your Cityscapes Path]
    ├── leftImg8bit
    └── gtFine

Usage

CCVC (without data augmentation)

python CCVC_no_aug.py \
    --config 'configs/pascal.yaml' \
    --backbone 'resnet101' \
    --labeled_id_path 'partitions/pascal/366/labeled.txt' \
    --unlabeled_id_path 'partitions/pascal/366/unlabeled.txt' \
    --save_path 'exp/pascal/366/test/' \
    --load_path 'test_DDP' \
    --nodes 1 \
    --port 4434 \
    --gpus 4 \
    --epochs 40 \
    --batch_size 2 \
    --crop_size 512 \
    --mode_mapping 'else' \
    --mode_confident 'vote_threshold' \
    --conf_threshold 0.9 \
    --use_SPL False \
    --use_con True \
    --use_dis True \
    --use_MLP True \
    --use_norm True \
    --use_dropout True \
    --w_CE 5.0 \
    --w_con 2.0 \
    --w_dis 1.0 \
    --lr_network 10.0 \
    --lr_backbone 1.0

or

CCVC (with data augmentation)

python CCVC_aug.py \
    --config 'configs/pascal.yaml' \
    --backbone 'resnet101' \
    --labeled_id_path 'partitions/pascal/366/labeled.txt' \
    --unlabeled_id_path 'partitions/pascal/366/unlabeled.txt' \
    --save_path 'exp/pascal/366/test/' \
    --load_path 'test_DDP' \
    --nodes 1 \
    --port 4434 \
    --gpus 4 \
    --epochs 40 \
    --batch_size 4 \
    --crop_size 512 \
    --mode_mapping 'else' \
    --mode_confident 'vote_threshold' \
    --conf_threshold 0.9 \
    --use_SPL False \
    --use_MLP True \
    --use_norm True \
    --use_dropout True \
    --w_CE 5.0 \
    --w_con 2.0 \
    --w_dis 1.0 \
    --lr_network 10.0 \
    --lr_backbone 1.0

or

you can directly run the sh files like:

bash ./tools/train.sh

To run with different settings, please modify the above mentioned settings.

If you run the code for more epochs, you may get a better result.

Note that all of our experiments are tested on 4 A6000 GPUs.

Citation

If you find these projects useful, please consider citing:

@article{wang2023conflict,
  title={Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation},
  author={Wang, Zicheng and Zhao, Zhen and Zhou, Luping and Xu, Dong and Xing, Xiaoxia and Kong, Xiangyu},
  journal={arXiv preprint arXiv:2303.01276},
  year={2023}
}

Acknowledgement

We thank AEL, CPS, CutMix-Seg, DeepLabv3Plus, PseudoSeg, PS-MT, SimpleBaseline, U2PL, UniMatch and other relevant works for their amazing open-sourced projects!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.