Giter VIP home page Giter VIP logo

beast's Introduction

BEAST

Bacterial Epigenomics Analysis SuiTe

Description

With only a few thousand bacterial methylomes published to date, it is becoming increasingly evident that epigenetic regulation of gene expression is highly prevalent across bacterial species. Despite the exciting prospects for studying epigenetic regulation, our ability to comprehensively analyze bacterial epigenomes is limited by a bottleneck in integratively characterizing methylation events, methylation motifs, transcriptomic data, and functional genomics data. In this regard, we provided the first comprehensive comparative analysis of a large collection of epigenomes in a single bacterial species, as well as a detailed roadmap that can be used by the scientific community to leverage the current status quo of epigenetic analyses (Oliveira et al., 2019, Nat. Microbiology).

PLEASE NOTE: BEAST includes a set of R and wrapper shell scripts that allow to reconstitute the major analyses steps of this publication. However, this pipeline requires multiple dependencies, which have to be installed prior the use of BEAST. The user has two possibilities:

  • For those having a basic programming knowledge, they are ore more than welcome to install these dependencies and run BEAST (a detailed step-by-step tutorial is nevertheless provided in the Documentation).
  • For newcomers or those less familiar with installing some of the dependencies, we have provided links to Docker containers with pre-installed dependencies here.

Sections

Details on the requirements and usage of all shell wrapper scripts can be accessed by scriptname -h (or -help).

1) Motif refining tool for the SMRT-seq pipeline

  • Motif_Refine.R performs the refining of methylation motifs estimated by the SMRT portal pipeline. Takes a Parameter file as input (provided as example). Each genome to be analysed should be placed in a separate folder containing the genome fasta file, and the output files of the SMRT pipeline (Modifications and Motif files).

2) Analysis of methylation motif enrichment / depletion using Markov models and a multiscale representation framework

  • GO_Abundance.sh maps methylation motifs in a FASTA file and computes their scores of exceptionality using Markov models.

  • GO_MSR.sh highlights chromosomal regions with enrichment / depletion of a given signal (methylation motifs in this case) using the Multiscale Signal Representation (MSR) method. An example parameters.txt is also given.

  • Plot_MSR.R is used to plot the pruned MSR output.

3) Analysis of orthologous conserved / variable methylation motifs

  • GO_ConsVar.sh performs multiple whole-genome alignment and looks at the conservation of methylation motifs across genomes.

4) Analysis of Transcription Factor Binding Sites (TFBS) and Transcription Start Sites (TSS) in bacterial genomes

  • GO_TFBS.sh takes a TFBS multifasta file, and computes the corresponding PSSM and TFBS hits in a given genome. An example of a TFBS multifasta file (XylR.fasta) is provided.

  • GO_PSSM.R is used to build a PSSM.

  • GO_TSS.sh estimates transcription start sites (TSS) through the reconstruction of a transcriptional landscape from RNA-Seq data. An example *.chrom.sizes file in IGV format is provided.

5) Pipeline to compute differential expression genes from RNA-seq data

  • GO_GetCounts.sh performs RNA-seq paired-end read cleaning, mapping, and counting for differential expression (DE) analysis.
  • GO_DE.R performs the DE analysis. A CDIF.count.txt and a colData.csv files are provided as examples.

6) Pipeline to compute homologous recombination and horizontal gene transfer

  • GO_HR.sh computes homologous recombination events given an ordered core genome alignment and corresponding phylogenetic tree in Newick format.

  • GO_TTR-MBL.R is used to compute transition / transversion ratios (TTR) and mean branch lengths (MBL).

  • GO_HGT.sh uses a pan-genome matrix and phylogenetic tree to perform ancestral reconstruction and infer family and lineage specific characteristic along the species' tree. An example pan-genome matrix and phylogenetic tree are provided.

7) Detection of mobile genetic elements and defense systems

  • Restriction modification systems were detected using dedicated HMM profiles and scripts previously published in Oliveira et al, 2014 and available here.
  • GO_CRISPRs.sh computes CRISPRs from a FASTA file and parses the output into a TAB format.
  • GO_Prophages_Integrons.sh computes prophages using Phage Finder and integrons using Integron Finder.

Documentation

For a comprehensive guide on each section and corresponding scripts, please see the documentation available here.

beast's People

Contributors

oliveira-lab avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

beast's Issues

About demo data

Hi,

Do you have a demo data for analysis and step-by-step anaysis practice.

Thank you !

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.