Giter VIP home page Giter VIP logo

vcf-filtering's People

Contributors

berntpopp avatar

Stargazers

 avatar  avatar

Watchers

 avatar

vcf-filtering's Issues

Implement filter_phenotypes.sh in filter_variants.sh

Use our final script "filter_phenotypes.sh" in "filter_variants.sh" to output the phenotypes of all filtered samples. If the xlsx flag is set, the output of filter_phenotypes.sh should be in a separate sheet. If the output is TSV (the default), it will be two files with the same basename and ".phenoytpes" appended before the "filter_phenotypes.sh" output. If the output is stdout, the phenotypes should not be printed. In general, there should be an optional flag to request phenotypes. Also, the location of the "filter_phenotypes.sh script and the input-file argument to it should be added as arguments to this script.

TODO:

  • Add new arguments: Add arguments to specify the location of the "filter_phenotypes.sh" script, the phenotype input file, and a flag to request phenotype output.
  • Handle new arguments and conditions: Modify the "filter_variants.sh" script to parse and handle these new arguments. Include logic to call "filter_phenotypes.sh" under the appropriate conditions.
  • Manage output formats: Handle different output scenarios (XLSX, TSV, STDOUT) as specified.
  • Requires #10

Make tsv_to_excel.R more generic

For conversion, the script "tsv_to_excel.R" should accept various separated file formats (including csv).

TODO:

  • add argument to explicitly set the separator
  • Include logic to determine the separator based on the file extension.
  • Include validation logic
  • rename script accoringly (maybe convert_to_excel.R)

Add a script to generate screenshots for visual validation

Add a script that will generate screenshots of the variants' alignment in the IGV browser so that it can be validated. This should be done for all the variants passing filtering. The screenshot should include at least one control.

TODO:

  • check for existing implementations and tools
  • check if IGV is suitable for this task
  • check what scriping language to use
  • decide for a format of the screenshots
  • decide for a file naming convention
  • decide for a folder structure
  • Implement the script

Implement sheet handling in tsv_to_excel.R script

  • Include an argument to set the sheet name in the excel file. This should have a default value.
  • Instead of overwriting, add an argument to add the sheet to an existing excel file.
  • Add logic to not overwrite existing sheets with the same name if they already exist, but instead add a number as an iterator.

Needed for #4

Statistics for the filtered table

Add basic statistics for the filtered table in echo command.
This could include for example:

  • number of variants
  • number of samples
  • number of genes
  • het / hom counts
  • type of variants (SNV, indel, missense, lof, etc)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.