Giter VIP home page Giter VIP logo

dictys's Introduction

Dictys

Dictys reconstructs cell-type specific and dynamic gene regulatory networks (GRN) from scRNA-seq and scATAC-seq datasets. Dictys first infers a Transcription Factor (TF) binding network with TF footprinting from single-cell chromatin accessibility. Then Dictys refines the edges with single-cell transcriptome. Dictys addresses traditional challenges in network inference by orienting causality with TF binding information, modeling transcriptional rate to reconstruct cycle-compatible networks, and using probabilistic programming to capture the scRNA-seq process.

Dictys provides network analysis and visualization at global (across all cell types), pairwise (between two cell types) and single GRN levels. Dictys directly quantifies TF regulatory activity from GRN and enables a series of analyses such as cell-type specific TF discovery as regulation markers, differential regulation analysis alongside differential expression, and TF regulatory program illustration through its subnetwork and top activation/repression targets. These GRN-based analyses can capture unique biological insights not available from mean expression.

Dictys infers and analyzes dynamic GRN from scRNA-seq and scATAC-seq datasets along (inferred) trajectories. This avoids artificial cell subsets and potential biases from population imbalance, and allows (pseudo-)time-resolved discovery and investigation of driver TFs and their individual regulations. Dictys provides an integrative network viewer for dynamic GRN visualization of synchronous panels in animation.

Overview

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/logo.png

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Dictys_overview.png

Installation

Dictys has dependencies not in python. The options below automatically install these dependencies. Installation should take ~<10 mins.

Option 1: with Anaconda

First install Anaconda/Miniconda. Then, install Dictys with CPU computation:

conda create -y -n dictys -c conda-forge python=3.9 mamba
. activate dictys
mamba install -y -c lingfeiwang -c bioconda -c conda-forge -c pytorch dictys cpuonly

This will create a conda environment named dictys.

Alternatively, with GPU computation (here CUDA 11.3):

conda create -y -n dictys -c conda-forge python=3.9 mamba
. activate dictys
mamba install -y -c lingfeiwang -c bioconda -c conda-forge -c pytorch dictys cudatoolkit=11.3

Option 2: with bash script

First install Anaconda/Miniconda. Then, install Dictys with CPU computation:

wget https://tinyurl.com/dictys -O - | bash

This will create a conda environment named dictys.

Alternatively, under a different conda environment name:

wget https://tinyurl.com/dictys -O - | CONDAENV_NAME=your_favorite_name bash

Alternatively, with GPU computation (here CUDA 11.3):

wget https://tinyurl.com/dictys -O - | CUDAVERSION_CONDA=11.3 bash

Option 3: with containers

TBA

Additional notes

For more advanced installation, see INSTALL.md and/or edit the install script.

Note: dynamic network inference is computationally intensive and GPU availability is highly recommended. Running time depends on the dataset, but it can take weeks or longer without a GPU.

If you need STREAM, ArchR, or other softwares upstream of Dictys, we recommend to install them in separate environments following their official instructions.

Updating Dictys

If your minor version is the latest (e.g. your installed version is 0.1.0 and the latest release is 0.1.9), you can update Dictys to the latest github version with pip3 install --no-deps git+https://github.com/pinellolab/dictys inside your Dictys conda environment.

If your minor version is not the latest (e.g. your installed version is 0.1.0 but the latest release is 0.2.0), you should reinstall Dictys in a new conda environment with any option above.

Tutorials

We provide several tutorials for different data types. Please download each tutorial folder structure before running. Note that these tutorials are not intended to fully replicate the results in the paper due to differences in software versions, computing platforms, various randomness e.g. in HOMER genome preparsing or Pytorch algorithms, etc.

  1. short-multiome: a single-notebook tutorial for the data preparation, inference, and analysis of context specific networks on 10x multiome data for human blood.
  2. full-multiome: an extended version of the above tutorial with detailed usage.
  3. full-skin: a short tutorial for the inference and analysis of dynamic networks on SHARE-seq data for mouse skin.

The network analysis tutorials below use the same reconstructed networks as in the paper and are designed to fully replicate the results.

  1. analysis-blood: a simple tutorial for context specific and dynamic network analysis on separate scRNA-seq and scATAC-seq quantifications of human blood as in manuscript.
  2. analysis-skin: a simple tutorial for context specific network analysis on SHARE-seq of mouse skin as in manuscript.

Gallery

The figures below are produced with the blood example dataset. You can reproduce them with the analysis-blood example. See Tutorials. Each figure is linked to the jupyter notebook that produces it.

Cell-type specific GRN analyses

Regulation marker TF discovery

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Global_dotplot.png

Top activation target heatmap for select TFs

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Global_heatmap.png

Differential regulation v.s. differential expression scatter plot; integrative TF rank plot

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Diff_analysis.png

Subnetwork for select TF

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Subnet.png

Dynamic GRN analysis

Driver TF discovery based on regulatory activity curve

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/Dynamic_discovery.png

Dynamic GRN animation

https://raw.githubusercontent.com/pinellolab/dictys/master/doc/images/animation.gif

FAQ

  • How do I perform network inference faster?

    1. Get a GPU, such as:
      • Google Colaboratory offers free GPU access with zero/minimal setup. You can run Dictys on very small datasets for free, or larger datasets with paid membership. See our tutorial.
      • Major cloud computing service providers offer GPU access that is orders of magnitude cheaper than a scRNA-seq experiment.
      • High-performance computing cluster with GPU access at institution or other levels. Dedicated computing server. Personal computer with high-end consumer level GPU.
      • People or labs with the above access.
    2. Reduce the computational load, such as:
      • For context specific networks, choose only cell clusters of your interest. For this, delete the uninterested cell clusters in data/subsets.txt.
      • For dynamic networks, use fewer windows. This risks reducing time resolution. Details TBA.
      • Reduce the number of training steps. This risks reducing network quality. Details TBA.
    3. Configure properly for a powerful CPU. Details TBA.
  • Why do I see this error: AssertionError: Torch not compiled with CUDA enabled?

    This is because you installed a CPU-only pytorch but tried to run it on GPU. You have several options:

    1. To run pytorch on CPU, run dictys_helper makefile_update.py path/to/config.mk '{"DEVICE": "cpu"}' to configure to CPU mode. See Tutorials to find the right place to run this command.
    2. To run pytorch on GPU, reinstall Dictys with the correct options to enable GPU support at Installation.
  • How do I save figures from jupyter notebooks onto the disk?

    You can use plt.savefig('output.pdf') to save the current figure to disk. See matplotlib.pyplot.savefig.

    Some visualization functions in Dictys return two or more figures, such as figs = net.draw_discover(...). You can save them separately with figs[0].savefig('output1.pdf'); figs[1].savefig('output2.pdf'); .... See matplotlib.figure.savefig and issue 15.

Issues

Please raise an issue on github.

References

Dictys: dynamic gene regulatory network dissects developmental continuum with single-cell multi-omics bioRxiv (2022)

dictys's People

Contributors

lingfeiwang avatar dependabot[bot] avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.