Giter VIP home page Giter VIP logo

bam2cram's Introduction

BAM file compression benchmark pipeline

Benchmarks of CRAM 2.1 and 3.0 are using the faster CRAM codecs; primarily deflate and rANS.

Also included is the performance of the proposed CRAM v3.1 standard. This is not yet a ratified GA4GH standard, but these figures give indicative results.

Rationale

The following Samtools (version 1.14) profiles are tested:

Profile CRAM versions options
default 3.0, 3.1
fast 3.0, 3.1 seqs_per_slice=1000, level=1
normal 3.0, 3.1 seqs_per_slice=10000
small 3.0 seqs_per_slice=25000, level=6,use_bzip2
small 3.1 seqs_per_slice=25000, level=6,use_bzip2,use_fqz
archive 3.0 seqs_per_slice=100000,level=7,use_bzip2
archive 3.1 seqs_per_slice=100000,level=7,use_bzip2,use_fqz,use_arith
archive lzma 3.0 seqs_per_slice=100000,level=7,use_bzip2, use_lzma
archive lzma 3.1 seqs_per_slice=100000,level=7,use_bzip2,use_fqz,use_arith, use_lzma

Requirements This workflow requires at least 2 CPUs and 4GB of memory.

Usage

Usage:
nextflow run main.nf --input input.csv --reference reference.fasta [Options]

Inputs Options:
--input         Input csv file with sample_id, bam and bai paths
--reference     Reference fasta file

Resource Options:
--cpus          Number of CPUs (int)
                (default: 2)  
--max_cpus      Maximum number of CPUs (int)
                (default: 2)
--memory        Memory (memory unit)
                (default: 4 GB)   
--max_memory    Maximum memory (memory unit)
                (default: 4 GB)
--time          Time limit (time unit)
                (default: 8h)
--max_time      Maximum time (time unit)
                (default: 8h)

bam2cram's People

Contributors

cgpu avatar dependabot[bot] avatar imendes93 avatar

Watchers

 avatar  avatar

Forkers

hsyed91

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.