Giter VIP home page Giter VIP logo

rnaseq_lsinapis's Introduction

Pipeline used in:

Gene expression profiling across ontogenetic stages in the wood white (Leptidea sinapis) reveals pathways linked to butterfly diapause regulation

Luis Leal, Venkat Talla, Thomas Källman, Magne Friberg, Christer Wiklund, Vlad Dincă, Roger Vila, and Niclas Backström. Molecular Ecology 27 (4), 935–48 (2018). https://doi.org/10.1111/mec.14501.


Luis Leal, Uppsala University, Uppsala, Sweden, 2017


1. QUALITY CONTROL

  • Filter libraries using TrimGalore
  • Mask libraries using Fastq Masker
  • Filter libraries using prinseq
  • Filter libraries using condetri
  • Screen libraries using fastQ Screen
  • Reconcile paired-end libraries after using fastQ Screen (removes non-paired reads)

(Note: Index fasta files as required.)

01_quality_control.sh

2. DE NOVO TRANSCRIPTOME ASSEMBLY USING TRINITY

(Note: Index fasta files as required.)

02_trinity_transcriptome_assembly.sh

3. FUNCTIONAL ANNOTATION BASED ON TRINOTATE PROTOCOL

  • a. Annotate each trinity transcript using modified Trinotate protocol
03_trinotate_functional_annotation.sh
  • b. Build local UniProt databases
04_build_local_UniProt_db.sh
  • c. Find GO terms associated to each Trinity gene
05_associate_GO_terms_to_genes.sh

4. TRANSCRIPT QUANTIFICATION USING KALLISTO

(Note: Index files as required.)

06_kallisto_transcript_quantification.sh

5. GENE COUNT CONSOLIDATION USING TXIMPORT

  • a. Convert counts-per-transcript to counts-per-gene using tximport

    • a1. Associate Trinity transcript name to its gene name
    07_associate_transcript_to_gene.py
    • a2. Convert Kallisto output files from count-per-transcript to count-per-gene
    08_tximport_aggregate_transcript_counts.R
  • b. Consolidate annotation file & gene counts

    09_tximport_consolidateGenes.py

6. DIFFERENTIAL GENE EXPRESSION ANALYSIS USING DESEQ2

10_DESeq2.R

7. GENE ONTOLOGY ENRICHMENT ANALYSIS USING TOPGO

  • a. Create two-column file that lists each Trinity gene and its associated GO terms
11_gene_to_GO.py
  • b. Perform gene ontology enrichment analysis
12_TopGO.R

rnaseq_lsinapis's People

Contributors

lln273 avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

jun-jian-li

rnaseq_lsinapis's Issues

Confirm that use of BLAST's `-max_target_seqs` is intentional

Hi there,

This is a semi-automated message from a fellow bioinformatician. Through a GitHub search, I found that the following source files make use of BLAST's -max_target_seqs parameter:

Based on the recently published report, Misunderstood parameter of NCBI BLAST impacts the correctness of bioinformatics workflows, there is a strong chance that this parameter is misused in your repository.

If the use of this parameter was intentional, please feel free to ignore and close this issue but I would highly recommend to add a comment to your source code to notify others about this use case. If this is a duplicate issue, please accept my apologies for the redundancy as this simple automation is not smart enough to identify such issues.

Thank you!
-- Arman (armish/blast-patrol)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.