Giter VIP home page Giter VIP logo

training's Introduction

CIDGOH training materials

This repository collects the CIDGOH training materials about genomics data analysis.

Index of the Material

Genomics data analysis tutorials

Jupyter notebooks with tutorials for bioinformatics training in the Center for Infectious Diseases Genomics and One Health. Each notebook is executable in the institution cluster and reproduces common steps in most bacterial genomic analysis pipelines. It includes links to further references and training material.

Common processes in the group for microbial genomics isolate analysis include:

  1. Quality control and filtering of raw reads
    • Appraisal of quality in sequencing results
    • Filtering of low quality regions
    • Trimming adapter sessions
    • Visualization of quality control
  2. Assembly of draft genome
    • De-novo assembly (polishing of final fasta files)
    • Appraisal of quality in the resulting assembly
  3. Genome annotation and downstream analysis
    • Annotation of genomic features using comprehensive databases
    • Define a matrix of genomic distances (SNP)
    • Produce a phylogenetic tree

Set up for tutorials

They can be executed directly by accessing the web interface of the seagull cluster at https://seagull.cidgoh.ca/.

Open up a terminal inside Jupyter and run the following command to copy all the necessary files into your home directory. Everything else, including tools and data, should be easily accessible once you start to execute the tutorial notebooks.

# change into your home directory at seagull
cd ~

# download the three tutorial notebooks there
wget https://github.com/cidgoh/training/blob/main/genomics_data_analysis/annotation_genomes.ipynb \
    https://github.com/cidgoh/training/blob/main/genomics_data_analysis/genome_assembly.ipynb \
    https://github.com/cidgoh/training/blob/main/genomics_data_analysis/raw_reads_processing.ipynb

Troubleshooting - common issues

If by any chance, you cannot find a bash kernel (interpreter for shell command line in jupyter), you can quickly create one for your own use.

First, open a terminal inside Seagull and run the following commands to create a virtual environment and install the bash_kernel inside

conda create -n bash_kernel
conda activate bash_kernel
pip3 install -m bash_kernel
python3 -m bash_kernel.install
conda deactivate

Then, in the Jupyter notebook interface, you should be able to choose bash as your kernel and run your notebook tutorial.

training's People

Contributors

duanjunhyq avatar

Watchers

William Hsiao avatar

Forkers

azmigueldario

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.