Giter VIP home page Giter VIP logo

multimorbid3d's Introduction

Multimorbid3D

This repository contains scripts used to predict statistically significant co-occurring diseases/conditions (in GWAS catalog) of a disease of interest.

Installation

Clone the repository using the following command:

https://github.com/Genome3d/multimorbid3D.git
cd multimorbid3D

Setup

Ensure that you have conda installed. In this example, environment is created in the multimorbid3D directory.

  1. Create a conda environment from environment.yaml file
conda env create --prefix ./multimorbid3D --file environment.yaml
  1. Activate the environment
conda activate multimorbid3D/
  1. Deactivate the environment after usage
conda deactivate 

Required datasets

  • Input: a list of SNPs or genes or a trait

  • Gene regulatory network (GRN). It is a collection of regulatory interactions between SNPs and genes (can be build using CoDeS3D pipeline)

  • To also include linked SNPs in the analysis: pre-calculated pairwise linkage disequilibrium data for all the SNPs in 1000 genome project

  • To use protein-protein interactions from PROPER, download the database from the website

Basic usage

For details on the expected inputs, run the comorbid.py script with -h argument as shown below

(/m/p/u/s/multimorbid3D/multimorbid3D) :/mnt/projects/multimorbid3D$ python comorbid.py -h
usage: comorbid.py [-h] [-g GENES [GENES ...]] [-s SNPS [SNPS ...]] [--trait TRAIT] [--pmid PMID] --grn-dir GRN_DIR [--gwas GWAS] -o OUTPUT_DIR [-l LEVELS] [-p {string,proper}]
                   [--string-score STRING_SCORE] [--bootstrap] [--bootstraps BOOTSTRAPS] [--keep-bootstraps] [--non-spatial] [--non-spatial-dir NON_SPATIAL_DIR] [--snp-ref-dir SNP_REF_DIR]
                   [--gene-ref-dir GENE_REF_DIR] [--ld] [-c CORRELATION_THRESHOLD] [-w WINDOW] [--population {EUR}] [--ld-dir LD_DIR] [--string-version]

Identify multimorbid traits based on eQTL associations and protein-protein interactions.

optional arguments:
  -h, --help            show this help message and exit
  -g GENES [GENES ...], --genes GENES [GENES ...]
                        A space-separated list of gene symbols or filepath to a file containing gene symbols in the 'gene' column.
  -s SNPS [SNPS ...], --snps SNPS [SNPS ...]
                        A space-separated list of SNP rsIDs or filepath to a file containing SNP rsids in the 'snp' column.
  --trait TRAIT         GWAS trait to query. Note: this flag is mutually exclusive with the --snps and --pmid flags
  --pmid PMID           PubMed ID of the GWAS to query. Note: this flag is mutually exclusive with the --snps and --trait flag
  --grn-dir GRN_DIR     Directory containing tissue gene regulatory network. The subdirectories should contain significant_eqtls.txt for each chromosome.
  --gwas GWAS           Filepath to GWAS associations. Default: Associations from the GWAS Catalog (https://www.ebi.ac.uk/gwas/api/search/downloads/full)
  -o OUTPUT_DIR, --output-dir OUTPUT_DIR
                        Directory to write results.
  -l LEVELS, --levels LEVELS
                        Path length (i.e. number of nodes) to query. Default = 1
  -p {string,proper}, --ppin {string,proper}
                        The protein-protein-interaction database(s) to use. Default: string
  --string-score STRING_SCORE
                        Cut-off score for STRING interactions. Default = 0.7
  --bootstrap           Perform a bootstrap. Default = False
  --bootstraps BOOTSTRAPS
                        Number of bootstrap datasets. Default: 1000
  --keep-bootstraps     Keep bootstrap results. Default: False
  --non-spatial         Include non-spatial eQTLs. Default = False
  --non-spatial-dir NON_SPATIAL_DIR
                        Filepath to non-spatial eQTLs.
  --snp-ref-dir SNP_REF_DIR
                        Filepath to SNP BED databases.
  --gene-ref-dir GENE_REF_DIR
                        Filepath to gene BED.
  --ld                  Include LD SNPs in identifying eQTLs and GWAS traits. Default = False
  -c CORRELATION_THRESHOLD, --correlation-threshold CORRELATION_THRESHOLD
                        The r-squared correlation threshold to use.
  -w WINDOW, --window WINDOW
                        The genomic window (+ or - in bases) within which proxies are searched. Default = 5000
  --population {EUR}    The ancestral population in which the LD is calculated. Default = "EUR"
  --ld-dir LD_DIR       Directory containing LD database.
  --string-version      Use this flag to specify the version of STRING database to use. Available versions: latest, 11.5, 11.0b, 11.0, 10.5, 10.0.

multimorbid3d's People

Contributors

sreemol-gokuladhas avatar tayaza avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.