Giter VIP home page Giter VIP logo

ballad's People

Contributors

envest avatar

Watchers

 avatar

ballad's Issues

Missing ALT value

Include a value when missing ALT in extract_variant_sites_liftover.py

Get consensus genotypes from loom

Given a set of SNPs, determine genotype of a sample at each SNP. Take coverage variability and allele dropout into consideration. Use this as input to unpooling loom files.

`filter_loom.py` output columns in logical order

Currently output file has column names in alphabetical order (from the list of loom file row attribute keys). The order could be more meaningful, but would have to be pre-specified / harder to adapt to future file formats.

Make `filter_loom.py` much faster

Try using loompy.combine along a shared axis (GRCh38 variant) to get everything in the same order, rather than loop through each variant

Output filtered loom file

Flat variants + barcodes file with so much repeated info takes lots of time to write. Maybe just output the filtered loom file? Possibly even just the indexes of kept variants and barcodes

Add filtering parameters to command line

Set sensible defaults, but let user specify non-default values for each parameter one by one or as a set (via parameter file) which overrides all other given parameters. Then check that each parameter makes sense. Force explicit overwrite

Unpool loom file based on known genotypes

  • Need to know consensus genotypes for each patient at a set of SNPs
  • For each cell, measure how similar the available genotypes are to known consensus profiles
  • If one is clearly more similar, assign barcode to that patient
  • Output unpooled loom files

Annotate variants in a loom file

Use ensemble-vep to predict effect of each variant in a loom file. Given a loom file, extract the variant sites to use as input to VEP.

Document with additional directions for downloading VEP resources in README

Speed up filter_loom.py

Run timing tests to see what steps are slowest, and improve. Try to make everything run in < 1 minute.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.