Giter VIP home page Giter VIP logo

cutntag-benchmarking-analysis's Introduction

CUT&Tag benchmarking and analysis

This repository holds code used to carry out processing, benchmarking and analysis of CUT&Tag against ENCODE ChIP-seq, as shown in our preprint ("CUT&Tag recovers up to half of ENCODE ChIP-seq peaks"; doi: https://doi.org/10.1101/2022.03.30.486382).

The analysis consists of two main parts:

  1. Pre-processing (adapter trimming, alignment, peak calling, etc.) and a few additional analyses (e.g. heatmap generation) using Linux command line scripts
  2. Downstream processing in R

This demo was carried out on two real samples, Diagenode C15410196 (1:50) and CST9733, starting from fastq files. These files are too large to be hosted here but can be downloaded from the European Nucleotide Archive (sample accessions SAMN27024067 and SAMN27024069, respectively). All code required for processing and analysis is enclosed, and presented in a defined order where appropriate. Other outputs and smaller intermediate files have also been included where possible. The scripts assume a particular directory structure but note that base paths will need to be amended. A complete analysis may take up to 20 hours, depending on available resources and file sizes.

The following command line tools were used:

  • Cutadapt (version 3.3)
  • TrimGalore (version 0.6.6)
  • Bowtie (version 2.2.9)
  • Picard (version 2.6.0)
  • bedtools (version 2.30.0)
  • deeptools (version 3.5.1)
  • samtools (version 1.3.1)
  • SEACR (version 1.3)
  • MACS2 (version 2.1.4)
  • HOMER (version 4.10)

The following R packages were used:

  • GenomicRanges (version 1.38.0)
  • IRanges (version 2.30.0)
  • genomation (version 1.18.0)
  • clusterProfiler (version 3.13.4)
  • rtracklayer (version 1.56.0)
  • chromVAR (version 1.8.0)
  • ChIPseeker (version 1.22.1)
  • DiffBind (version 3.0.15)
  • BRGenomics (version 1.1.3)
  • marge (version 0.0.4.9999)

Required R packages can also be found in the session information provided in the html file with downstream analysis. Note that this pipeline is now available in the form of the EpiCompare R package (https://github.com/neurogenomics/EpiCompare).

References

CUT&Tag recovers up to half of ENCODE ChIP-seq peaks
Di Hu, Leyla Abbasova, Brian M Schilder, Alexi Nott, Nathan G Skene, Sarah J Marzi
bioRxiv 2022.03.30.486382; doi: https://doi.org/10.1101/2022.03.30.486382

EpiCompare: R package for the comparison and quality control of epigenomic peak files (2022)
Sera Choi, Brian M. Schilder, Leyla Abbasova, Alan E. Murphy, Nathan G. Skene
bioRxiv 2022.07.22.501149; doi: https://doi.org/10.1101/2022.07.22.501149

cutntag-benchmarking-analysis's People

Contributors

leylaabbasova avatar

Watchers

 avatar

Forkers

paulinaurbana

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.