Giter VIP home page Giter VIP logo

ece228projectteam10's Introduction

Machine Learning Applications on Breast Cancer Diagnosis -- ECE 228 2022 Spring, Team 10

This is the Github project for machine learning applications on breast cancer diagnosis, created by team 10 of ECE 228, 2022 spring.

Group members:
Haodong Qin: [email protected]
Mingchen Yao: [email protected]
Min Zhong: [email protected]

Introduction

There are four folders within this project: Diagnostic_data, Self_constructed_CNN, Pre_trained_model and SHAP, which corresponds to the four parts of the project.

Diagnostic_data -- ML methods applied on the diagnostic (tabular) dataset.
Self_constructed_CNN -- VGG-16-like CNN applied on the image dataset.
Pre_trained_model -- Several pretrained CNN models applied on the image dataset.
SHAP -- SHAP analysis on both diagnostic data NN and image data NN.

Run the Code

There are mainly jupyter notebooks for all the three methods, should be easy to run. Make sure you have Pytorch and Scikit-learn installed and the corresponding dataset downloaded.

Also, it would be better to use GPU for the CNN training with the image dataset.

Diagnostic_data

The dataset needed for the analysis is already included in the same directory as DiagnosticData.ipynb. It is part of the Wisconsin Breast Cancer (Diagnostic) Dataset: https://archive.ics.uci.edu/ml/datasets/breast+cancer+wisconsin+(diagnostic)

Self_constructed_CNN

This is a self-constructed CNN. Please make sure that this directory contains the image data. image data can be downloaded from https://www.kaggle.com/code/mohitgora/cancer-detection-cnn/data. Code is run in jupyter notebook.

SHAP

Please make sure you have Shap installed. The SHAP analysis itself may take several minutes to complete. Besides, before each analysis, the network need to be trained, which takes ~minutes for diagnostic data and ~ 1 hour for image data. It would be better to use GPU for the CNN training with the image dataset.

ece228projectteam10's People

Contributors

haodonghd avatar minzhong98 avatar ymch815 avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

minzhong98

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.