Giter VIP home page Giter VIP logo

cidgoh / virus-mvp Goto Github PK

View Code? Open in Web Editor NEW
5.0 5.0 2.0 80.3 MB

VirusMVP is an interactive heatmap-centric app that integrates viral genomic mutations, lineage information and curated functional impact to study the spread and evolution of viruses in Canada and globally.

Home Page: https://virusmvp.org/

License: MIT License

Python 92.91% JavaScript 6.77% Dockerfile 0.29% Shell 0.04%
bioinformatics covid-19 dash plotly sars-cov-2 visualization

virus-mvp's People

Contributors

anwarmz avatar despean avatar duanjunhyq avatar ivansg44 avatar miseminger avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

virus-mvp's Issues

Improved footer

Footer was made pretty quickly prior to first official release.

Make it look more professional, and also include link to source code (github) and contact email.

Add an option to download Surveillance report

  • Somewhere on the visualization, need to add a button that can be used for downloading surveillance report
  • Surveillance report will be generated during the genomics workflow when the user uploads an input file

Toggling switch on/off -> improper heatmap sizing

Toggling clade defining mutations switch on and off, and then resizing the window, resizes the heatmap cells. The bars should stay the same size at all times, with scrollable overflow.

heatmap_resizing_bug

Good place to start investigating would be the clade defining mutations switch callback. There is something different about how the heatmap is first rendered on application launch, outside the callback, because this bug does not happen without toggling the switch.

Loading indicators

Good to have loading indicators for operations that may take a while

e.g., regenerating the heatmap cells figs when there are a lot of strains or running the pipeline

Simplify hover display

Should round off alt freqs and think of a better way to summarize mutations instead of listing as many as we can fit into the hover box

Limit number of visible rows in heatmap

Heatmaps are pretty good at visualizing a large number of rows and columns, but our cells are pretty big and resource intensive due to marker annotations, hover boxes, heterozygosity spacing considerations, etc.

We might need to limit the number of rows a user can see, while still allowing user to select from a pool of rows exceeding that number from the select lineages modal. Another consideration is whether we want to make the heatmap cells and y-axis scrollable, and then limit the number of rows visible based solely on performance considerations--not screen real estate. Scrolling is easier than having to select lineages.

Display sample size

Encode sample sizes. One idea is to have a column to the right of the heatmap.

Display single genome inputs

Related #56

We will need to display these without colour since frequency is not a factor.

Things to consider for distinguishing single genomes from lineages:

  • All black or white filled squares
  • Different shaped markers?

Improve documentation

Started in #44

Things to do:

  • Instructions for interacting with submodule
  • Instructions for windows users

Display a subset of lineages on start

After #66 and #75 are settled, we may want to display only a subset of lineages to begin with. With all lineages displayed, the heatmap cells callback takes about 1 second to run, which slows things down a bit.

Deploying on website without user upload option

  • We need to have a separate version of COVID-MVP (possibly on website branch) that doesnt have user upload option
  • We will still need surveillance report downloading option for the data we will be hosting - VirusSeq Data portal

Replace n/a in hover with more accurate term

N/A could imply we did not look for a function for a mutation or that we are missing data. We should use something more specific, that more strongly implies a function has not yet been discovered for a mutation.

Improve COVID-MVP performance

Getting slow now with ~700 lineages

Possibilities:

  • Greater parallelization of viz generation. Maybe one diagram per heatmap row?
  • Improve data parsing efficiency when only slight changes are made, or split up data parsing into parallel code for each lineage

"Jump to" fn to supplement scrolling

Scrolling to a position along the x-axis may get annoying as the number of mutations increases. Need a way to jump to a position directly.

One possible solution: search fn that allows you to jump to a specific nucleotide position.

Further improvements to caching

  • Cache does not seem to be re-accessed on page reload--fix this
  • Some of the other intensive operations could be cached (e.g., heatmap generation)

Integrate Nextflow commands into visualization module

Currently, data_parser.py is converting VCF to GVF which is then parsed for visualization. Lets move towards integrating nextflow commands now which will require following updates

  1. Allow user to upload .vcf or .fasta or .tsv file.
  2. Need a check to confirm the file type based on which this nextflow command will be structured

nextflow run nf-ncov-voc/main.nf -profile < conda | singularity | docker > --prefix < unique_user_id > --mode user --input_type < vcf | tsv | fasta > --userfile < user_uploaded_file > --outdir < output directory where files will be generated >

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.