Giter VIP home page Giter VIP logo

xrosliang / minet Goto Github PK

View Code? Open in Web Editor NEW

This project forked from lartpang/minet

0.0 1.0 0.0 1001 KB

CVPR2020, Multi-scale Interactive Network for Salient Object Detection, https://lartpang.github.io/MINet

Home Page: http://openaccess.thecvf.com/content_CVPR_2020/papers/Pang_Multi-Scale_Interactive_Network_for_Salient_Object_Detection_CVPR_2020_paper.pdf

License: MIT License

Python 100.00%

minet's Introduction

Multi-scale Interactive Network for Salient Object Detection

License: MIT

CVPR 2020.

Changelog

The code and experimental results have be released now ๐Ÿ˜„.

  • 2020/6/21: Some useful modifications.
    • I optimized the testing process. For details, you can see code/readme.md.
    • Added some backbone models using ws+gn which can further improve the generalization performance of the model. If there is a need, I can provide the pretrained parameter file of the corresponding improved version of MINet. :)
    • The directory structure has been adjusted.
  • 2020/5/10: Some meaningful changes.
    • Some methods have been tried to improve the performance of the model on specific operating environments and devices. The specific experimental results can be found in the section "More Experiments" of ./code/readme.md.
    • Use black as the repository formatting tool, and set the configuration file ./code/pyproject.toml.
  • 2020/5/6: Add some new attempts and improvements.
    • Modified the method of importing model from the directly setting in config.py, and change it to the automatic selection and instantiation of the specific model class according to the model alias (exp_name in config.py).
    • Added a lighter setting for AIM and MInet.
    • Added an attempt to checkpoint features of PyTorch.
    • Added learning rate decay strategy with learning rate warm-up. However, the parameter setting is not flexible at present, and needs to be improved.
    • A new optimization strategy has been added to imitate the settings from F3Net.
    • Upload the MIT LICENSE file.
  • 2020/4/16: Modify some misleading descriptions in the readme.md file.
  • 2020/4/7: Simplify the structure of the repository.
  • 2020/3/29: Update the method of evaluating results. (See the readme.md for more details.)
  • 2020/3/28: Update our code, results, pretrained parameters and some documents.

Repository Details

  • code: Complete training and testing code about our method. The readme.md file describes how to use the code.
  • docs: Github page about out paper. Here are some paper details.

Related Links

Paper Details

Abstract

Deep-learning based salient object detection methods achieve great progress. However, the variable scale and unknown category of salient objects are great challenges all the time. These are closely related to the utilization of multi-level and multi-scale features. In this paper, we propose the aggregate interaction modules to integrate the features from adjacent levels, in which less noise is introduced because of only using small up-/down-sampling rates. To obtain more efficient multi-scale features from the integrated features, the self-interaction modules are embedded in each decoder unit. Besides, the class imbalance issue caused by the scale variation weakens the effect of the binary cross entropy loss and results in the spatial inconsistency of the predictions. Therefore, we exploit the consistency-enhanced loss to highlight the fore-/back-ground difference and preserve the intra-class consistency. Experimental results on five benchmark datasets demonstrate that the proposed method without any post-processing performs favorably against 23 state-of-the-art approaches. The source code will be publicly available at https://github.com/lartpang/MINet.

Architecture

Comparison

BibTeX

@inproceedings{MINet-CVPR2020,
    author = {Youwei Pang and Xiaoqi Zhao and Lihe Zhang and Huchuan Lu},
    title = {Multi-scale Interactive Network for Salient Object Detection},
    booktitle = CVPR,
    year = {2020}
}

minet's People

Contributors

lartpang avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.