Giter VIP home page Giter VIP logo

rig's Introduction

RIG

This respository is an implementation of the RIG workflow as a series of Bash scripts. Users should seriously consider a Scala-based implementation using GATK's Queue before resorting to Bash. We resorted to Bash due to Queue and our compute cluster not playing nicely together. We provide this code as is, without warranty or guarantee. Users interested in porting this to their architecture would likely need to change the way resources are requested via qsub, and change how logging files are parsed by the utility functions. See the publication for additional information on the workflow: http://g3journal.org/content/early/2015/02/13/g3.115.017012.full.pdf+html

The current implementation consists of two configuration files, a group of pipelines, the jobs called by the pipelines, and some utility scripts. The pipelines are:

  1. Naive Reduced Representation qsub (NRRB)
  2. Informed Reduced Representation qsub (IRRB)
  3. Initial Informed Whole Genome qsub (IIWGB)
  4. Informed Whole Genome qsub (IWGB)

The current version for each pipeline can be found in its respective directory, and the job scripts those pipelines launch are in the jobScripts directory.

Those interested in the annotations and software versions corresponding to the original RIG manuscript that were used for hard filtering and VQSR at the time of writing will find those in the 2015_manuscriptParameters directory:

  1. Naive sorghum RR population and family hard filtering: https://github.com/Frogee/RIG/tree/master/2015_manuscriptParameters/NaiveSorghumPopulationRR
  2. Naive Arabidopsis WGS hard filtering:
  3. https://github.com/Frogee/RIG/tree/master/2015_manuscriptParameters/NaiveArabidopsisWGS
  4. Informed sorghum WGS recalibration: https://github.com/Frogee/RIG/tree/master/2015_manuscriptParameters/InformedSorghumWGS

How to run a pipeline

Choose and modify one of the two configuration files:

  1. If you are starting with .fastq files, choose the fastq configuration file.
  2. If, as part of the WGS pipelines, you have already converted .fastq files to .bams using the fastq configuration file, choose the bam configuration file.

Modify the configuration file to specify a number of paths and resources, as well as the pipeline that will be used.

The configuration file itself is a bash script that is executed, which in turn launches the designated pipeline. The pipeline submits jobs, tracks job status, and manages job submission.

Contact

Contact Ryan McCormick at [email protected] with questions or comments regarding the Bash implementation or RIG design. The GATK forums are also a great place for finding answers: http://gatkforums.broadinstitute.org/

rig's People

Contributors

frogee avatar mulletlab avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.