Refactor of FMImaging, v1

Overview

This is the v1 refactor of the FMImaging codebase, which contains basic utilities for single-task segmentation, classification, and image enhancement with DDP. The purpose of the refactor was to:

Reduce the amount of new code needed for new projects, with zero-code solutions for basic applications and lightweight customizations for others.
Reduce the amount of rewritten code per project.
Make organization clearer (e.g., by consolidating configs, metrics, optimizers, losses, etc.) to keep codebase clean as we continue to add complexity.
Prepare consolidated codebase for FM experiments, including:
- Build in pre/post/backbone structure.
- Include utils for segmentation, classification, and enhancement tasks.

Organization

The codebase organizes directories by utility. The run.py file shows how the codebase progresses:

In the setup dir, args are parsed into a config and initial setup functions are run.
In the data dir, torch datasets are created.
In the loss dir, the loss function is defined.
In the model dir, the Model Manager is defined, which contains the pre/backbone/post modules and utils for saving and loading the model.
In the optim dir, the Optim Manager is defined, which contains the optimizer and scheduler.
In the metrics dir, the Metric Manager is defined, which tracks all performance metrics during training.
In the trainer dir, the Train Manager is defined, which controls DDP and the train/eval loops.

Each project can be stored in the projects dir.

These dirs are intended to be modular, so one utility can be customized without needing to rewrite code in other dirs. We can use this codebase with no customizations, just specifying hyperparameters and variables via the command line and using the default functions. Alternatively, we can supply customizations to each of the above components when needed. Both solutions are described below.

Warnings

A final few notes on the refactor:

There will be bugs. I can help debug, just let me know.
I tested the omnivore model most extensively; STCNNT runs but I have not trained it to completion.
There are files and functions I took out when I couldn't tell if/how they were used or how to make them general solutions (e.g., running_inference.py, utilities.py, wrapper around ddp cmd). We can put old functions back in, but try to keep the organization of the refactor.
There are additional utils we can build out (e.g., adding more args and optimizers, adding more augmentation functions, adding more losses). I put basic utils in. New utils should be relatively easy to add in the config + in the current organization.
I have not tested DDP on multiple nodes.

Using the codebase

With no customizations

The codebase can be directly used with no customizations to train simple segmentation, classification, and image enhancement tasks. Examples of this are included in the projects folder, including abct_segment (3D multiclass segmentation), ct_denoise (3D image denoising), mrnet_classify (3D binary classification), ptx_classify (2D binary classification), and tissue_segment (2D binary segmentation). Each of these trains a model for their respective tasks simply by supplying args via the command line. All available args can be found in setup/parsers

To use the default codebase, you need to format your data according to the following structure:

├── task_name
│   ├── subject_1
│   │   ├── subject_1_input.npy
│   │   ├── subject_1_output.npy (if training a segmentation or enhancement task)
│   ├── subject_2
│   │   ├── subject_2_input.npy
│   │   ├── subject_2_output.npy (if training a segmentation or enhancement task)
│   ├── subject_3
│   │   ├── subject_3_input.npy
│   │   ├── subject_3_output.npy (if training a segmentation or enhancement task)
│   ├── task_name_metadata.csv (if training a classification task)

The task_name and subject_IDs can be chosen by the user. However, each subject's folder needs to have a file named <subject_ID>_input.npy, and if training a segmentation or enhancement task, a file named <subject_ID>_output.npy. Each numpy file should be formatted as an array of shape X Y Z C, where Z and C are optional dimensions (i.e., they can be squeezed for 2D or single-channel tasks). For segmentation, the output file's channel dimension should either be squeezed or C=1. If training a classification task, the <task_name> directory also needs to have a csv of labels called <task_name>_metadata.csv, formatted as follows:

SubjectID	Label
subject_1	0
subject_2	1

where subject_1 matches the naming convention of the data directories.

With customizations

If you need additional customizations, you can customize each component in the run.py file. Namely, you can add a custom parser, replace the dataset or loss functions, or modify the Model Manager, Optim Manager, Metric Manager, or Train Manager. To customize the codebase, you will need to:

Create a custom run.py file. This should mirror the default run.py file with the same sequence of steps.
Replace each step in run.py modularly with your custom code, as needed.

Example projects with customizations include cifar_classify, which has a custom dataset, and example_custom_project, which has a custom parser, dataset, loss, and Model Manager.

For futher details on how to customize the codebase, see projects/example_custom_project/custom_run.py. This file will explain how to customize each component and includes an example.

xueh2 / imagingtransformer Goto Github PK

imagingtransformer's Introduction

Refactor of FMImaging, v1

Overview

Organization

Warnings

Using the codebase

With no customizations

With customizations

imagingtransformer's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent