Giter VIP home page Giter VIP logo

its_workflow's Introduction

Amplicon data preprocessing and analysis workflow for Schneider et al.

This repository contains code to analyse the ITS amplicon sequencing data with DADA2 for further comparisons with RNA-seq data from the same samples. It uses data collected for Haas et al (2018), but using dada2 and Swarm clustering instead of OTU clustering.

Repository contents

The repository consists of two units that are run separately, and an additional folder with R scripts for analysis and plotting of the results:

1 demultiplex_wf -> download and demultiplex raw data

The folder contains instructions on how to run the workflow as a docker container. It will download the raw data from the ENA and demultiplex the sequences into files per sample, as well as concatenate technical replicates.

2 workflow -> ITS amplicon sequencing data preprocessing workflow

The folder contains a snakemake workflow that will reproduce the preprocessing of the demultiplexed ITS amplicon sequencing data as used in the study.

How to use it

The workflow is run through snakemake, from the root folder of the repository (where this readme sits). To continue with the demultiplexed data, we need to move it from the demultiplex_wf subfolder.

mkdir $(pwd)/data
mv $(pwd)/demultiplex_wf/data/ $(pwd)/

Next we will create a conda environment ("its_wf") needed to execute the snakemake workflow, and then activate it:

conda env create -n its_wf -f $(pwd)/environment.yml
conda activate its_wf

Once the conda environment has been created successfully, we can execute the workflow with the following command:

snakemake -s $(pwd)/workflow/Snakefile -pr -j 4 --use-conda

This will output the final count matrix and other results (such as sequences for every Swarm OTU and taxonomic assignments) into the results/ folder.

analysis -> R scripts to analyse ITS and RNA data

The folder contains scripts to reproduce the figures in the publication.

its_workflow's People

Contributors

andnischneider avatar johnne avatar

Stargazers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.