Giter VIP home page Giter VIP logo

aws-batch-genomics's Introduction

Genomics Research on AWS

This project demonstrates how to run a large-scale, genomics secondary-analysis pipeline on AWS using AWS Step Functions and AWS Batch. You can learn more about how we used the new AWS Step Functions integration with AWS Batch in our blog post, Building Simpler Genomics Workflows on AWS Step Functions.

Standard Genomic Secondary-Analysis Workflow

PREREQUISITES

  • AWS CLI
  • Admin permissions for deployment

DEPLOY

Will build and deploy everything needed to run a secondary-analysis pipeline on AWS. This includes:

  1. Custom AMI with 1TB scratch volume using the ECS AMI for your region (10 min)
  2. Isaac, Strelka, Samtools-Stats and SnpEff as Docker images in ECR (40 min)
  3. AWS Batch queues, compute environments, job definitions and an AWS Step Functions state machine (10 min)
  4. Roles and buckets needed.
$ setup.sh 

USAGE

Run the Workflow

Copy the workflow input that was output to the terminal window after the deployment completed. Use that input to run the pipeline in the StepFunctions console. You can also find example workflow inputs in the outputs section of the batch-genomics-pipeline Cloudformation stack.

Update Pipeline Stack

Deploy changes made to local Cloudformation templates.

$ update.sh

Teardown Pipeline Stack

Deletes everything except the custom AMI and ECR images.

$ teardown.sh

aws-batch-genomics's People

Contributors

ajfriedman18 avatar delagoya avatar dougalb avatar gf-atebbe avatar stephen-terrell avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.