Giter VIP home page Giter VIP logo

zhejiang2020's Introduction

DOI .github/workflows/basic_checks.yaml Docker

Introduction to bulk and single-cell RNA sequencing analyses

zhejiang2020

Instructor names and contact information

  • Xueyi Dong <dong.x at wehi.edu.au>
  • Luyi Tian <tian.l at wehi.edu.au>
  • Hongke Peng <peng.h at wehi.edu.au>
  • Stefano Mangiola <mangiola.s at wehi.edu.au>

Syllabus

Material web page.

This material was created for the Zhejiang 2020 workshop workshop but it can also be used for self-learning.

More details on the workshop are below.

Workshop package installation

This is necessary in order to reproduce the code shown in the workshop. The workshop is designed for R 4.0 and can be installed using one of the two ways below.

Via GitHub

You can install the workshop using the commands below in R 4.0.

# Install dependency manually

# Open R
Open R 4.0 or newer

# Install devtools if you do not have it
install.packages("devtools")

# Install the workshop package from Github
devtools::install_github("stemangiola/zhejiang2020", build_vignettes = TRUE)

# Load the workshop
library(zhejiang2020)

# List the vignettes, present in the vignettes directory
browseVignettes("zhejiang2020")

Workshop Description

This workshop will present how to perform analysis of bulk and single-cell RNA sequencing count data following base R paradigm. Example of the use of tidy paradigm is given at the end of each section.

The bulk analyses were based on the Bioconductor workflow package RNAseq123 and the workshop for tidy transcriptomics BioC Asia 2020

Pre-requisites

  • Basic knowledge of RStudio
  • Familiarity with R base and tidyverse syntax

Recommended Background Reading Introduction to R for Biologists

Workshop Participation

The workshop format is 2 days, 2 hours sessions each day consisting of hands-on demos with Q&A.

R / Bioconductor packages used

  • dittoSeq
  • dplyr
  • edgeR
  • ggplot2
  • ggrepel
  • Glimma
  • gplots
  • igraph
  • limma
  • Mus.musculus
  • purrr
  • R.utils
  • RColorBrewer
  • readr
  • RNAseq123
  • scater
  • scran
  • SingleCellExperiment
  • SingleR
  • stats
  • stringr
  • SummarizedExperiment
  • tibble
  • tidybulk
  • tidyr
  • tidySingleCellExperiment
  • utils

Time outline

First day

Activity Time
Bulk RNA sequencing analyses 1h 20m
Questions 20m
Break 30m
Tidy bulk RNA sequencing analyses 30m
Questions 20m

Second day

Activity Time
Single-cell RNA sequencing analyses 1h 20m
Questions 20m
Break 30m
Tidy single-cell RNA sequencing analyses 30m
Questions 20m

Workshop goals and objectives

In exploring and analysing RNA sequencing count data, there are a number of key concepts, such as filtering, scaling, dimensionality reduction, hypothesis testing, clustering and visualisation, that need to be understood.

Learning goals

  • To understand the key concepts and steps of RNA sequencing count data analysis
  • Apply the concepts to publicly available data
  • Create plots that summarise the information content of the data and analysis results
  • To approach critical thinking

zhejiang2020's People

Contributors

stemangiola avatar you-k avatar xueyidong avatar

Stargazers

Pengfei Xu avatar Connie Li-Wai-Suen avatar

Watchers

James Cloos avatar Luyi Tian avatar  avatar  avatar  avatar  avatar

zhejiang2020's Issues

issue with extdata/pbmc4k_raw_gene_bc_matrices.tar.gz

Hello @YOU-k ,

I know you dropped from the workshop, I will invite Hongke. Could you please assist him into solving the problem of installing our workshop on windows.

The problem in about the file extdata/pbmc4k_raw_gene_bc_matrices.tar.gz

> devtools::install_github("stemangiola/zhejiang2020", build_vignettes = TRUE)
Downloading GitHub repo stemangiola/zhejiang2020@HEAD
stemangiola-zhejiang2020-d24f938/inst/extdata/pbmc4k_raw_gene_bc_matrices.tar.gz: truncated gzip input
tar.exe: Error exit delayed from previous errors.

Header

Hello @XueyiDong ,

I would suggest to replace the header "Unsupervised clustering of samples" with "Dimensionality reduction". As I we don't show any clustering operation performed there.

Pull requests

Thanks @XueyiDong and @YOU-k for creating your branch. When you are ready create a pull request and put myself as reviewer.

You can perform the check locally on your Rstudio and monitor the github action to see if you matrial passes the tests in terms of dependencies and data. For example,

image

You can go into details on what has failed. Once we are happy and all checks are passed I will merge and create a branch to add the tidy part.

We are almost there.

Make data self contained

Hello @XueyiDong ,

we should keep the repository self contained. Could you please put this data into the Data directory ("xz" compression). It should be smaller than 10MB if possible, and surely smaller than 100Mb for github limitations.

url <- "https://www.ncbi.nlm.nih.gov/geo/download/?acc=GSE63310&format=file"
utils::download.file(url, destfile="GSE63310_RAW.tar", mode="wb") 
utils::untar("GSE63310_RAW.tar", exdir = ".")
files <- c("GSM1545535_10_6_5_11.txt", "GSM1545536_9_6_5_11.txt", "GSM1545538_purep53.txt",
  "GSM1545539_JMS8-2.txt", "GSM1545540_JMS8-3.txt", "GSM1545541_JMS8-4.txt",
  "GSM1545542_JMS8-5.txt", "GSM1545544_JMS9-P7c.txt", "GSM1545545_JMS9-P8c.txt")
for(i in paste(files, ".gz", sep=""))
  R.utils::gunzip(i, overwrite=TRUE)

Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.