choosehappy / histoqc Goto Github PK

View Code? Open in Web Editor NEW

255.0 9.0 100.0 20.27 MB

HistoQC is an open-source quality control tool for digital pathology slides

License: BSD 3-Clause Clear License

Python 27.40% CSS 4.27% HTML 1.42% JavaScript 66.72% Dockerfile 0.19%

histoqc's Introduction

HistoQC

HistoQC is an open-source quality control tool for digital pathology slides

Requirements

Tested with Python 3.7 and 3.8 Note: the DockerFile installs Python 3.8, so if your goal is reproducibility you may want to take this into account

Requires:

openslide

And the following additional python package:

python-openslide
matplotlib
numpy
scipy
skimage
sklearn
pytest (optional)

You can likely install the python requirements using something like (note python 3+ requirement):

pip3 install -r requirements.txt

The library versions have been pegged to the current validated ones. Later versions are likely to work but may not allow for cross-site/version reproducibility (typically a bad thing in quality control).

Openslide binaries will have to be installed separately as per individual o/s instructions

The most basic docker image can be created with the included (7-line) Dockerfile.

Installation

Using docker

Docker is now the recommended method for installing and running HistoQC. Containerized runtimes like docker are more portable and avoid issues with python environment management, and ensure reproducible application behavior. Docker is available for Windows, MacOS, and Linux.

Note: These instructions assume you have docker engine installed on your system. If you do not have docker installed, please see the docker installation instructions.

Begin by pulling the official HistoQC docker image from docker hub. This repository contains the latest stable version of HistoQC and is guaranteed up-to-date.
```
docker pull histotools/histoqc:master
```

Next, run the docker image with a few options to mount your data directory and expose the web interface on your host machine.

docker run -v <local-path>:/data --name <container-name> -p <local-port>:5000 -it histotools/histoqc:master /bin/bash
# Example:
# docker run -v /local/datadir:/data --name my_container -p 5000:5000 -it histotools/histoqc:master /bin/bash

A terminal session will open inside the docker container. You can now run HistoQC as you would on a local machine.
If you exit the shell, the container will stop running but no data/configuration will be lost. You can restart the container and resume your work with the following command:
```
docker start -i <container-name>
# Example:
# docker start -i my_container
```

Using pip

You can install HistoQC into your system by using

git clone https://github.com/choosehappy/HistoQC.git
cd HistoQC
python -m pip install --upgrade pip  # (optional) upgrade pip to newest version
pip install -r requirements.txt      # (required) install pinned versions of packages
pip install .                        # (recommended) install HistoQC as a package

Note that pip install . will install HistoQC as a python package in your environment. If you do not want to install HistoQC as a package, you will only be able to run HistoQC from the HistoQC directory.

Basic Usage

histoqc CLI

Running the pipeline is now done via a python module:

C:\Research\code\HistoQC>python -m histoqc --help
usage: __main__.py [-h] [-o OUTDIR] [-p BASEPATH] [-c CONFIG] [-f] [-b BATCH]
                   [-n NPROCESSES] [--symlink TARGET_DIR]
                   input_pattern [input_pattern ...]

positional arguments:
  input_pattern         input filename pattern (try: *.svs or
                        target_path/*.svs ), or tsv file containing list of
                        files to analyze

optional arguments:
  -h, --help            show this help message and exit
  -o OUTDIR, --outdir OUTDIR
                        outputdir, default ./histoqc_output_YYMMDD-hhmmss
  -p BASEPATH, --basepath BASEPATH
                        base path to add to file names, helps when producing
                        data using existing output file as input
  -c CONFIG, --config CONFIG
                        config file to use
  -f, --force           force overwriting of existing files
  -b BATCH, --batch BATCH
                        break results file into subsets of this size
  -s SEED, --seed SEED,
                        set a seed used to produce a random number in all modules                    
  -n NPROCESSES, --nprocesses NPROCESSES
                        number of processes to launch
  --symlink TARGET_DIR  create symlink to outdir in TARGET_DIR

Installed or simply git-cloned, a typical command line for running the tool thus looks like:

python -m histoqc -c v2.1 -n 3 "*.svs"

which will use 3 process to operate on all svs files using the named configuration file config_v2.1.ini from the config directory.

In case of errors, HistoQC can be run with the same output directory and will begin where it left off, identifying completed images by the presence of an existing directory.

histoqc.config CLI

Supplied configuration files can be viewed and modified like so:


C:\Research\code\HistoQC>python -m histoqc.config --help
usage: __main__.py [-h] [--list] [--show NAME]

show example config

optional arguments:
  -h, --help   show this help message and exit
  --list       list available configs
  --show NAME  show named example config

Alternatively one can specify their own modified config file using an absolute or relative filename:

python -m histoqc.config --show light > mylight.ini
python -m histoqc -c ./mylight.ini -n 3 "*.svs"

histoqc.ui CLI

HistoQC now has a httpd server which allows for improved result viewing, it can be accessed like so:

C:\Research\code\HistoQC>python -m histoqc.ui --help
usage: histoqc.ui [-h] [--port PORT] resultsfilepath

launch server for result viewing in user interface

positional arguments:
  resultsfilepath       Specify the full path to the results file. The user must specify this path.

optional arguments:
  -h, --help            show this help message and exit
  --port PORT, -p PORT  Specify the port [default:5000]

After completion of slide processing, view results in your web-browser by running the following command:

python -m histoqc.ui <results-file-path>
# Example:
# python -m histoqc.ui ./histoqc_output_YYMMDD-hhmmss/results.tsv

Note: The results file is a tab-separated file generated by HistoQC containing the quality control metrics for each slide. HistoQC generates the results file in the output directory specified by the -o flag, or formatted as histoqc_output_YYMMDD-hhmmss by default.

You may then navigate to http://<hostname>:5000 in your web browser to view the results.

Configuration modifications

HistoQC's performance is significantly improved if you select an appropriate configuration file as a starting point and modify it to suit your specific use case.

If you would like to see a list of provided config files to start you off, you can type

python -m histoqc.config --list

and then you can select one and write it to file like so for your modification and tuning:

python -m histoqc.config --show ihc > myconfig_ihc.ini

Advanced Usage

See wiki

Notes

Information from HistoQC users appears below:

the new Pannoramic 1000 scanner, objective-magnification is given as 20, when a 20x objective lense and a 2x aperture boost is used, i.e. image magnification is actually 40x. While their own CaseViewer somehow determines that a boost exists and ends up with 40x when objective-magnification in Slidedat.ini is at 20, openslide and bioformats give 20x.

1.1. When converted to svs by CaseViewer, the MPP entry in ImageDescription meta-parameter give the average of the x and y mpp. Both values are slightly different for the new P1000 and can be found in meta-parameters of svs as tiff.XResolution and YResolution inverse values, so have to be converted, also respecting ResolutionUnit as centimeter or inch

Citation

If you find this software useful, please drop me a line and/or consider citing it:

"HistoQC: An Open-Source Quality Control Tool for Digital Pathology Slides", Janowczyk A., Zuo R., Gilmore H., Feldman M., Madabhushi A., JCO Clinical Cancer Informatics, 2019

Manuscript available here

“Assessment of a computerized quantitative quality control tool for kidney whole slide image biopsies”, Chen Y., Zee J., Smith A., Jayapandian C., Hodgin J., Howell D., Palmer M., Thomas D., Cassol C., Farris A., Perkinson K., Madabhushi A., Barisoni L., Janowczyk A., Journal of Pathology, 2020

Manuscript available here

histoqc's People

Contributors

Stargazers

Watchers

Forkers

foggydae pjl54 qvduoduo1997 ilknuricke yijiangchen 3dimaging qigongfda huangch 13324077 vallurumk matthieurouland birm skrackow kheffah zsj0577 christinaliang aaronponceuv monjoybme mohanapriyanarapareddy petrovm3 histopathology skyclub3 jiayunli amir-reza-sadri ejri ylch trasse pacific89 fernandorome maberyick jjhbw rexmiao satishev frmdstryr jesperkers jmnyman rraiff3 bayer-science-for-a-better-life nki-ai tasvora inoscirculation bswhite skoc dsouzavijeth kaczmarj samkaranth min-sheng r-j96 aditya964 bytehexler volodymyrchapman tossman korjmorj willpower057 adamjtaylor elaheh-alizadeh mohsenhariri lshzhang lavlabinfrastructure rajeevyadav lavlabinfrastructure maduc7 ap-- sarthakpati guillermojp nunocalaim ant0nsc henry111383 julienmassonnet nanli-emory andreped fangliu117 ntomita brucexu0222 curlup ngtesig cardenm ajinkya-kulkarni cielal yuchaejung audiowiz yanfang-research jacksonjacobs1 veyselolgun petroslk cgdogan suhasthegame mc2-center 1059444127 zhouhui521 gitmdeen zhihaowan parvathanenibio aacai999 kaiko-ai nengwp danielaschacherer sarderlab

histoqc's Issues

UI: add editable comment field (or checkbox) for user input

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 88, http://hawking.case.edu:3000/issues/88
Original Date: 2018-01-03

then offer to save to file/export

this would allow real-time note takin

Add error log file

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 113, http://hawking.case.edu:3000/issues/113
Original Date: 2018-01-15

None

Module: Slide thickness prediction

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 83, http://hawking.case.edu:3000/issues/83
Original Date: 2018-01-03

are we sure all slides are cut to the same thickness?

if not, determining approximate thickness would be useful

UI: on clicking on graph also show row

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 108, http://hawking.case.edu:3000/issues/108
Original Date: 2018-01-10

None

Stain prediction

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 77, http://hawking.case.edu:3000/issues/77
Original Date: 2018-01-03

Often time images in the TCGA may not be H&E, but may also be DAB and others.

Need to identify these images

UI: Move overlay image to side

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 85, http://hawking.case.edu:3000/issues/85
Original Date: 2018-01-03

Comment from Mario:

!screenshot_2_1515001688.png!

for figure 5, this placement was a result of my javascript skills. i don't think its so terrible because either clicking the image again or pressing the "escape" button closes the image so that you can again see the graph, so its perhaps not as bad as you thought.

Yes, it also didn't bother me too much when you gave the live demo the other day. But wanted to point it out anyway.

Module: Detect Compression quality

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 124, http://hawking.case.edu:3000/issues/124
Original Date: 2018-02-02

None

Module: Calcification detection

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 109, http://hawking.case.edu:3000/issues/109
Original Date: 2018-01-13

None

UI: display filename

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 106, http://hawking.case.edu:3000/issues/106
Original Date: 2018-01-10

after loading, should have filename with whatever path information available visible

UI: Add save button for change

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 104, http://hawking.case.edu:3000/issues/104
Original Date: 2018-01-10

check to make sure reloading actually works

UI: Limit values to reasonable number of digis

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 87, http://hawking.case.edu:3000/issues/87
Original Date: 2018-01-03

None

Inline Magnification detection

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 81, http://hawking.case.edu:3000/issues/81
Original Date: 2018-01-03

right now the magnification comes from the metadata, which if blank is reported as blank

should auto compute on the fly if needed

Additional Meta data columns

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 91, http://hawking.case.edu:3000/issues/91
Original Date: 2018-01-03

z-stack?
bit depth
number of color channels

Module: Frozen vs FFPE

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 79, http://hawking.case.edu:3000/issues/79
Original Date: 2018-01-03

need to be able to asses if the image contains a flash frozen or an FFPE module

migrate to python 3.x

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 95, http://hawking.case.edu:3000/issues/95
Original Date: 2018-01-08

None

Add "post processing" module

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 121, http://hawking.case.edu:3000/issues/121
Original Date: 2018-01-24

to do final things on use mask

e.g., size threshold, smoothing, etc

UI: Add left/right arrows to overlay

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 93, http://hawking.case.edu:3000/issues/93
Original Date: 2018-01-05

should add visual guidance that the overlay image can be scrolled through

maybe gray out background so that the layout is clearly shown as being "in focus"

Module: knife chatter

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 120, http://hawking.case.edu:3000/issues/120
Original Date: 2018-01-24

None

UI: Scroll to row for image/graph/overlay

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 99, http://hawking.case.edu:3000/issues/99
Original Date: 2018-01-08

None

Continue after fail of backend

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 112, http://hawking.case.edu:3000/issues/112
Original Date: 2018-01-15

add a "force" requirement to front end to force overwriting of existing files, otherwise skip it if it exists

UI: add error log view (overlay?)

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 116, http://hawking.case.edu:3000/issues/116
Original Date: 2018-01-15
Original Assignee: Ren Zuo

None

Module: Marker detection

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 80, http://hawking.case.edu:3000/issues/80
Original Date: 2018-01-03

would like to ignore all pen markings on slides

UI: add tag functionality

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 115, http://hawking.case.edu:3000/issues/115
Original Date: 2018-01-15
Original Assignee: Ren Zuo

can be comma separated in a separate field (to be provided by back end, likely named "tag")

click "add tag", and any tag added is appended to the already existing list of tags for that image

this comes into play when multi selecting - > add tag - > "too blurry"

then sorting differently, multiselecting- > add tag - > "folded tissue"

any of the images which were in both selected lists should have both tags

bonus functionality: provide list of suggested tags (e.g., when starting to type, show tags already existing) to limit tag "wander"

Address python warnings (currently all commented out)

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 111, http://hawking.case.edu:3000/issues/111
Original Date: 2018-01-15

None

Refactor baseImage to be inherited from userDict

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 110, http://hawking.case.edu:3000/issues/110
Original Date: 2018-01-15

None

UI: change multiple comments at once using multiselect

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 114, http://hawking.case.edu:3000/issues/114
Original Date: 2018-01-15

e.g., select 10 or 20 images, and only be required to type the comment once and then apply it to all of them

Parallel Processing of Images

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 76, http://hawking.case.edu:3000/issues/76
Original Date: 2018-01-03
Original Assignee: Andrew Janowczyk

to improve runtime, can process images in parallel

note, need to use multiprocessing and not threading, as python will block

this makes it a bit more tricky

UI: Filename paths all relative or absolute

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 84, http://hawking.case.edu:3000/issues/84
Original Date: 2018-01-03

Currently in the UI there are a few places where the filenames appear as both relative, aboslute, or "hidden" path names

should be homogenized, preferably to something which allows the user to rapidly find the files via copy->paste (e.g., absolute)

absolute has the downside wherein its a bit tricky if the directory is moved, without special attention, the files won't be found (e.g., can't save abosltue filenames in csv file)

Detect CoverSlip Module

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 97, http://hawking.case.edu:3000/issues/97
Original Date: 2018-01-08

None

UI: multiselect, use "select" from datatables

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 101, http://hawking.case.edu:3000/issues/101
Original Date: 2018-01-08

to provide better o/s style selection

UI: On overlay add up/down arrows to go to next image in table

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 102, http://hawking.case.edu:3000/issues/102
Original Date: 2018-01-08

None

UI: Graphs not displayed correctly when large number of images are present

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 117, http://hawking.case.edu:3000/issues/117
Original Date: 2018-01-17

can expect final datasets to have 1000 images, need a way of display them nicely

!screenshot_1_1516197106.png!

UI: Save State

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 105, http://hawking.case.edu:3000/issues/105
Original Date: 2018-01-10

so that data isn't locally lost on reload etc

should be an option in datatables

https://datatables.net/examples/basic_init/state_save.html

UI: If warning field is not empty highlight row (or add button to highlight rows with errors)

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 125, http://hawking.case.edu:3000/issues/125
Original Date: 2018-02-14
Original Assignee: Ren Zuo

None

Parse out all of Slide Metadata

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 89, http://hawking.case.edu:3000/issues/89
Original Date: 2018-01-03

and present in individual columns

this may become tricky because each manufacturer has different metadata information

also, has the downside of crowding the image, would need to put better "sorting" in the column orders, so that the important values aren't hidden all the way to the right

UI: Full keyboard operation on grid

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 107, http://hawking.case.edu:3000/issues/107
Original Date: 2018-01-10

have "excel" keys, so that the arrow keys allow for movement around the table

pressing "i": shows the overly
pressing : selects the item
pressing in editable item lets you edit, then rejects changes or accepts

Move logging over to logging module instead of "print"

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 126, http://hawking.case.edu:3000/issues/126
Original Date: 2018-02-16

None

UI: Limit cell edits to comment field

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 100, http://hawking.case.edu:3000/issues/100
Original Date: 2018-01-08

None

Stain deconvolution module

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 123, http://hawking.case.edu:3000/issues/123
Original Date: 2018-01-31

None

UI: put images into rows, not a single row

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 96, http://hawking.case.edu:3000/issues/96
Original Date: 2018-01-08

its impossible to scroll horizontally through all these images

!screenshot_1_1515428485.png!

ClassificationModule.byExampleWithFeatures:pen_markings
ClassificationModule.byExampleWithFeatures:coverslip_edge

so that the same module can be re-used with different options

Break sets up

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 118, http://hawking.case.edu:3000/issues/118
Original Date: 2018-01-17

command line parameter specifying the maximum number of samples per output file

this will help the front end display data and the user review data in coherent chunks

this is ralted to

UI: tooltip pop-ups

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 92, http://hawking.case.edu:3000/issues/92
Original Date: 2018-01-03

for buttons and as well for the actual columns

this would help inexperienced users understand quickly what the particular metric is for, how it was generated, etc

If UI configuration becomes complex, consider having a default config file present/loaded

Author Name: Andrew Janowczyk (@choosehappy)
Original Redmine Issue: 119, http://hawking.case.edu:3000/issues/119
Original Date: 2018-01-24

None

choosehappy / histoqc Goto Github PK

histoqc's Introduction

HistoQC

Requirements

Installation

Using docker

Using pip

Basic Usage

histoqc CLI

histoqc.config CLI

histoqc.ui CLI

Configuration modifications

Advanced Usage

Notes

Citation

histoqc's People

Contributors

Stargazers

Watchers

Forkers

histoqc's Issues

Recommend Projects

Recommend Topics

Recommend Org