Giter VIP home page Giter VIP logo

nf_cracpipeline's Introduction

nf_CRACpipeline

Introduction

nf_CRACpipeline is a nextflow workflow for processing and analyzing CRAC data. This workflow was rewritten in Nextflow DSL2 based on an earlier version of python script, CRAC_pipeline_SE_demult_dedup.py by Edward Wallace which was used in the paper:

Rosemary A Bayne, Uma Jayachandran, Aleksandra Kasprowicz, Stefan Bresson, David Tollervey, Edward W J Wallace, Atlanta G Cook, Yeast Ssd1 is a non-enzymatic member of the RNase II family with an alternative RNA recognition site, Nucleic Acids Research, 2021;, gkab615, doi: 10.1093/nar/gkab615

Pipeline Summary

This workflow make use of various 3rd party tools especially tools from pyCRAC software by Sander Granneman. A brief summary of all the tools in this pipeline and their function is listed below:

  1. Flexbar: Trimmed forward reads
  2. pyBarcodeFilter.py: Demultiplexed the trimmed reads
  3. FastQC: Evaluate the quality of the demultiplexed trimmed reads
  4. pyFastqDuplicateRemover.py: Collapsed duplicated reads
  5. Novoalign: Maps the colllapsed reads to a reference genome with reference to an index file
  6. samtools: convert sam files to bam files, sort and index bam files
  7. BamQC: Evaluate the quality of bam/sam files
  8. bedtools genomecov: Generate bedgraph files from sorted indexed bam files
  9. bedtools multicov: Counts reads to transcript features
  10. pyReadcounter.py: Quantify the number of reads that are overlapped to the genomic features
  11. pyGTF2bedGraph.py: Generate bedgraphs from gtf files
  12. pyPileup.py: Make pileup tables of reads and deletions/mutations for a given genelist
  13. pyCalculateFDR.py: Find peaks with false discovery rates on protein-coding genes

Installation

Before running this pipeline, Nextflow first needs to be installed. Please follow the instruction here for installation. Please make sure you have all the required dependencies listed above installed before running the pipeline.

Quick start

To run this pipeline, use the command below:

nextflow run main.nf \
    --reads  reads.fastq \ 
    --adapterfile adapter.fasta \ 
    --barcode Barcodes.txt \ 
    --novoindex myindexfile.novoindex \ 
    --transcriptgff myfile.gff \
    --gtf mygtffile.gtf \
    --chromosome mychromosome.lengths \ 
    --genelist mygenelist.txt \
    --genometab  mygenome.tab 

For more information on the arguments of this pipeline, simply run the command: nextflow run main.nf --help

nf_cracpipeline's People

Contributors

jingqichong avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

ewallace

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.