Giter VIP home page Giter VIP logo

digitalslidearchive / histomicstk Goto Github PK

View Code? Open in Web Editor NEW
381.0 381.0 113.0 131.14 MB

A Python toolkit for pathology image analysis algorithms.

Home Page: https://digitalslidearchive.github.io/HistomicsTK/

License: Apache License 2.0

Python 96.79% CMake 0.20% Shell 0.14% Dockerfile 0.45% Cython 2.42%
bioimage-informatics computer-vision digital-slide-archive histology machine-learning medical-image-processing python

histomicstk's People

Contributors

abnsy avatar brianhelba avatar cdeepakroy avatar cooperlab avatar danlamanna avatar dependabot[bot] avatar dgutman avatar estelle900 avatar idc9 avatar jbeezley avatar jcfr avatar jellywang-3 avatar kheffah avatar kotfic avatar leengit avatar m101010 avatar manthey avatar marysalvi avatar naglepuff avatar ngchanway avatar predicative avatar salamb avatar sgratzl avatar slee172 avatar subinkitware avatar sumanthratna avatar thewtex avatar zachmullen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

histomicstk's Issues

Poisson mixture model cuts

Implement poisson mixture modeling and binary pygco graph cuts for foreground/background segmentation.

Slicer XML based command-line argument parser

Create a slicer xml based command-line argument parser using argparse to enable developers to write slicer execution model CLIs in python.

Once this is done developers will be able to write CLIs in python once which can then be run both on command-line (the in-line python script CLIs that we have now wont allow this) and called over the web via REST end point.

Remove dependency on matplotlib

I don't think we need to require matplotlib for installing histomicstk. This is causing issues installing on our cluster as it tries to pull in all kinds of other packages. People are free to install this independently and there are other options for viewing images.

3rd party nuclear segmentation with pygco

Implement end-to-end nuclear segmentation pipeline using the following components:
-Whole-slide Reinhard normalization
-Poisson mixture models / binary graph cuts
-Constrained log (cLoG) splitting to produce label image
-Multilabel graph cuts refinement to refine label image

Address memory limits in Sample.py

Sample attempts to load a low-res version of the entire slide to generate a tissue mask. If a 1.25X magnification is not provided by the WSI then this could result in loading an entire slide at 5X or high magnification.

Fill values distort model fitting procedures

The fill values used when requesting image tiles from OpenSlide can produce errors in building models to distinguish tissue from glass. Need to modify code to exclude pixels from filled areas that are added to create an integral-sized tiling.

Develop front-end for DSA

The front end should have ways to:

  • select data from girder
  • viewer for visualizing whole-slide images and annotations/results
  • select and run analysis CLIs exposed in the server directory of histomicsTK

Cycle in MaxClustering

There is a cycle in trajectory tracking for MaxClustering that causes an endless loop. Need to check the edge values of tracking matrices for correctness.

Fix optimization routine in SimpleMask.py

Optimization routine 'fmin_slsqp' from scipy is stopping early before convergence. Need to examine stopping criteria in documentation and do some more extensive testing for constrained curve fitting of image histograms.

Refactor histomicstk into submodules and correct naming

Issues include moving functions into submodules, converting function names from snake-case to camel-case, correcting function names in "See Also" document strings, and adding underscore prefixes to non-exposed functions.

MaxClustering - fix seeding

Deal with cases where there are multiple connected seed pixels in (Response == Max) - typically encountered where Response is flat.

Create an API for getting tiles of whole-slide images

A good way to do this is by creating an abstract WholeSlideImage class which defines the API and then make derived classes of it such as GirderWholeSlideImage (which serves an image stored on girder) and LocalWholeSlideImage (which serves an image located on the local machine).

We would want all analysis functions in histomicstk to take objects of the abstract WholeSlideImage class as input instead of a numpy array or PIL image.

Sample.py produces empty output

On images:
TCGA-HW-8322-01Z-00-DX1.B6F32F6B-7FA3-42E4-98CB-F2F55208F7C4.svs
TCGA-HT-7479-01Z-00-DX1.E310974D-E52C-4634-8E16-2A41E8C37D45.svs
TCGA-DU-8165-01Z-00-DX1.8d633ff1-6aed-41e5-8518-8856dbe4a718.svs
TCGA-DU-8166-01Z-00-DX1.82951397-8c63-4c0f-8696-f081b170e21f.svs

File "/opt/lib/python2.7/site-packages/histomicstk-0.1.0-py2.7.egg/histomicstk/Sample.py", line 119, in Sample
Pixels = np.concatenate(Pixels, 1)

Remove condensing from label processing functions

Functions for processing label images automatically condense the label image values to fill gaps for simplicity. This breaks the correspondence between the values of objects before and after calling the function. This can be fixed by skipping over 'empty' labels in the input.

Optimization fails in SimpleMask.py

On image TCGA-S9-A6UA-01Z-00-DX1.99AB8786-858B-46A0-94BA-C5AA1CD5351B.svs

File "/opt/lib/python2.7/site-packages/histomicstk-0.1.0-py2.7.egg/histomicstk/SimpleMask.py", line 55, in SimpleMask
TissuePeak = Peaks[yHist[Peaks[1:]].argmax()+1] #take highest peak among remaining peaks as background
ValueError: attempt to get argmax of an empty sequence

Rendering of annotations in HistomicsTK front-end

This issue is about adding the functionality necessary for rendering annotations on top of images in the HistomicsTK front-end.

@manthey and @jbeezley. Can one of you please take this up?

You can use some dummy annotations of each type to build the functionality necessary to render them.

Boundary and feature formats

Define the formats to be produced by object segmentation and feature extraction.

Boundaries - will need a function to convert a list of 2 x N arrays (x,y) to some format for consumption into Girder.

Features - how to link the feature names to the rows of a K x N array. Would these be consumed into Girder or kept as arrays on disk?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.