Giter VIP home page Giter VIP logo

meripseqpipe's Introduction

MeRIPseqPipe

MeRIP-seq analysis pipeline arranged multiple alignment tools, peakCalling tools, Merge Peaks' methods and methylation analysis methods..

Build Status Nextflow

install with bioconda Docker

Introduction

The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker containers making installation trivial and results highly reproducible.N6-methyladenosine (m6A) is the most prevalent modification in the mRNA of many eukaryotic species, including yeast, plants, flies, and mammals. In order to analyze m6A-seq data, we developed a user-friendly, integrated analysis pipeline called MeRIPseqPipe based on Nextflow. It integrated ten main functional modules including data preprocessing, quality control, read mapping, peak calling, peak merging, motif searching, peak annotation,differential methylation analysis, differential expression analysis, and data visualization. 

Quick Start

i. Install nextflow

ii. Install one of docker or conda

iii. Download the pipeline and test it on a minimal dataset with a single command

nextflow run path/to/meripseqpipe -profile test,<docker/singularity/conda>

Please check nf-core/configs to see if a custom config file to run nf-core pipelines already exists for your Institute. If so, you can simply use -profile <institute> in your command. This will enable either docker or conda and set the appropriate execution settings for your local compute environment.

iv. Start running your own analysis!

nextflow run path/to/meripseqpipe -profile <docker/singularity/conda> --designfile 'designfile.tsv' --comparefile 'comparefile.txt'  --fasta path/to/genome_fasta.fa --gtf path/to/genome_annotation.gtf

See usage docs for all of the available options when running the pipeline.

Documentation

The nf-core/meripseqpipe pipeline comes with documentation about the pipeline, found in the docs/ directory:

  1. Installation
  2. Pipeline configuration
  3. Running the pipeline
  4. Output and how to interpret the results
  5. Troubleshooting

Credits

MeRIPseqPipe was originally written by Kaiyu Zhu, Yu Sun, Xiaoqiong Bao.

Contributions and Support

If you would like to contribute to this pipeline, please see the contributing guidelines.

For further information or help, don't hesitate to get in touch on Slack (you can join with this invite).

Citation

You can cite the nf-core publication as follows:

The nf-core framework for community-curated bioinformatics pipelines.

Philip Ewels, Alexander Peltzer, Sven Fillinger, Harshil Patel, Johannes Alneberg, Andreas Wilm, Maxime Ulysse Garcia, Paolo Di Tommaso & Sven Nahnsen.

Nat Biotechnol. 2020 Feb 13. doi: 10.1038/s41587-020-0439-x.
ReadCube: Full Access Link

meripseqpipe's People

Contributors

lazyky avatar juneb4869 avatar

Stargazers

 avatar BioWalker avatar Seoyeon Kim avatar Colin Davenport avatar Gabriel Lencioni Lovate avatar  avatar pd321 avatar shirley avatar Songqi Duan avatar Perry Evans avatar HarukaM avatar liqiming avatar  avatar Jeremy Leipzig avatar  avatar hdchen avatar Yihai Luo avatar  avatar  avatar Qi ZHAO avatar Regina avatar

Watchers

 avatar

meripseqpipe's Issues

reads vs readPaths

Hi I am pretty confused by the reads vs readPaths arguments. It looks almost from your code that you expect readPaths to be a two-column table of sample_name and path. I don't see how passing reads alone will pass enough to raw_data to fulfill set sample_name, file(reads) from raw_data

merge_peaks_by_bedtools.sh: ***** ERROR: Requested column 4, but database file - only has fields 1 - 0.

Hi,

I am trying to recreate Fig 4 from Dominissi et al (https://www.nature.com/articles/nature11112) using meripseqpipe, and keep getting the same error in the PeakMerge process:

***** ERROR: Requested column 4, but database file - only has fields 1 - 0.
There is no peaks in one of the merged peaks files`
Merge Peaks by macs2 may not be suitable for your data.`

I have checked all of the .bed files used in this process, and all have 5 columns of data. By breaking up the mergebedByBedtools() function I've deduced this line results in an empty tmp._others_allPeaks.bed file.

ls *${prefix_id}*normalized.bed | grep -v ${peakCalling_tools_main} | xargs -i cat {} | sortBed -i - | mergeBed -i - -c 4,5 -o count,mean > tmp.${prefix_id}_others_allPeaks.bed

When, for example, macs2 is used, only macs2*normalized.bed are available so grep -v returns nothing and mergeBed terminates with the error ***** ERROR: Requested column 4, but database file - only has fields 1 - 0. (See screenshots).

image
image

You can see my temp3.bed file is producing the error from mergeBed, since its empty.

  • What is the purpose of tmp._others_allPeaks.bed?
  • Can I simply comment out the offending line?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.