Giter VIP home page Giter VIP logo

crd-net's Introduction

Cross-modal attention network for retinal disease classification based on multi-modal images (CRD-Net)

Example Images from the Dataset

Introduction

This repository contains the implementation of the paper "Cross-modal attention network for retinal disease classification based on multi-modal images" by Zirong Liu, Yan Hu, Zhongxi Qiu, Yanyan Niu, et al.

Abstract:

Multi-modal eye disease screening improves diagnostic accuracy by providing lesion information from different sources. However, existing multi-modal automatic diagnosis methods tend to focus on the specificity of modalities and ignore the spatial correlation of images. This paper proposes a novel cross-modal retinal disease diagnosis network (CRD-Net) that digs out the relevant features from modal images aided for multiple retinal disease diagnosis. Specifically, our model introduces a cross-modal attention (CMA) module to query and adaptively pay attention to the relevant features of the lesion in the different modal images. In addition, we also propose multiple loss functions to fuse features with modality correlation and train a multi-modal retinal image classification network to achieve a more accurate diagnosis. Experimental evaluation on three publicly available datasets shows that our CRD-Net outperforms existing single-modal and multi-modal methods, demonstrating its superior performance.

Features

  • Cross-Modal Attention (CMA) Module
  • Multiple Loss Functions
  • Evaluation on three publicly available datasets: MMC-AMD, APTOS-2021, and GAMMA

Installation

Clone this repository:

git clone https://github.com/ZirongLiu/CRD-Net.git
cd CRD-Net

Install the required dependencies:

pip install -r xxx.txt

Usage

Training

To train the CRD-Net model, use the following command:

python xxx.py --dataset [MMC-AMD/APTOS-2021/GAMMA] --epochs 150 --batch_size 8

Evaluation

To evaluate the model on the test set, use the following command:

python xxx.py --dataset [MMC-AMD/APTOS-2021/GAMMA] --checkpoint_path path/to/checkpoint

Datasets

The datasets used in this study are publicly available:

  1. MMC-AMD: [Dataset Link]
  2. APTOS-2021: [Dataset Link]
  3. GAMMA: [Dataset Link]

Ensure you have the datasets downloaded and organized as described in the documentation before training or evaluating the model.

Results

The CRD-Net model achieves state-of-the-art performance on the MMC-AMD, APTOS-2021, and GAMMA datasets. Detailed performance metrics and comparison with other methods are provided in the paper.

Class Activation Map Visualization

Citation

If you find this repository useful in your research, please cite the following paper:

@article{liu2024crossmodal,
  title={Cross-modal attention network for retinal disease classification based on multi-modal images},
  author={Liu, Zirong and Hu, Yan and Qiu, Zhongxi and Niu, Yanyan and Zhou, Dan and Li, Xiaoling and Shen, Junyong and Jiang, Hongyang and Li, Heng and Liu, Jiang},
  journal={Biomedical Optics Express},
  volume={15},
  number={6},
  pages={3699--3713},
  year={2024},
  publisher={Optica Publishing Group}
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (82102189 and 82272086) and the Shenzhen Stable Support Plan Program (20220815111736001).

Contact

For any questions or issues, please contact me.

crd-net's People

Contributors

zirongliu avatar

Stargazers

yyNiu avatar

Watchers

Kostas Georgiou avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.