Giter VIP home page Giter VIP logo

hi-geomvp's Introduction

Hi-GeoMVP: a hierarchical geometry-enhanced deep learning model for drug response prediction

Hi-GeoMVP is a deep-leaning model using graph neural networks to represent the drug and cell line features for drug response prediction.

plot Hi-GeoMVP takes drug geometry, chemical graph, and fingerprints, along with multi-omics data as input. Hi-GeoMVP employs a geometry-enhanced graph neural network (GeoGNN) and an multi-layer perceptron (MLP) for drug representation learning from drug features, and graph attention networks (GATa) and a variational autoencoder (VAE) for cell representation learning from multi-omics data. These representations are then integrated via a multi-view fusion module for drug response prediction (ln(IC50)). The model also utilizes latent representations alongside distinct prediction sub-networks for joint learning of drug and cell tasks, ultimately improving the prediction of drug response. The final output of the Hi-GeoMVP model is the predicted value of ln(IC50).

How to use:

1. Generate dataset:

Run create_cell_feat.py to generate the used cell_line feature.
Run create_drug_feat.py to generate the used drug feature. 
Run create_drp_dict.py to generate the drug_cell_ic50 feature and train_test split for mix,cell_blind and drug_blind sets.
All the used data are saved in .npy format and under the ./Data/DRP_dataset
An example to create the dataset and dataloader in Pytorch is provided in data_preprocess.py

2. Train the model:

We provide the script to train the model with different settings and hyperparameters:
train_type: 'cb', 'db', or 'mix' for cell blind, drug blind, and mix setting.
use_norm_ic50: 'True' or 'False', for usage of normalized IC50 value.
use_regulizer: 'True' or 'False', usage of MTL cancer type.
use_regulizer_drug: 'True' or 'False', usage of MTL drug threshold.
use_drug_path_way: 'True' or 'False', usage of MTL drug targeting pathway.
use_raw_gene: 'True' or 'False', usage of whole gene expression data. regular_weight: float number, the weight of MTL cancer type in the loss function.
regular_weight_drug: float number, the weight of MTL drug threshold in the loss function.
regular_weight_drug_path_way: float number, the weight of drug targeting pathway in the loss function.

You can run the training in the command line with:

python train_GeoMVP_5fold.py --train_type {train_type} 
    --device {device}--use_norm_ic50' {use_norm_ic50}  
    --use_drug_path_way {use_drug_path_way} 
    --use_regulizer {use_regulizer}
    --use_regulizer_drug {use_regulizer_drug} 
    --use_raw_gen {use_raw_gen}

hi-geomvp's People

Contributors

matcyr avatar

Watchers

 avatar

hi-geomvp's Issues

file miss

Hello author, are you missing a folder named "base_line" in your project
1

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.