Giter VIP home page Giter VIP logo

tumortype-wgs's Introduction

TumorType-WGS

Classifying tumor types based on Whole Genome Sequencing (WGS) data. A web version of the model where vcf files can be uploaded and predict cancer types can be found at: DeepTumour

Training RF Models

$ Rscript train_models.R <dataType> <cancerType>

dataType is one of: SNV, SV, CNV, MUT, GEN, PTW, IND cancerType is one of the 24 cancer types.

Training DNN Models

$ python train_models_tumour_classifier.py <fold> <path/to/features>

tumortype-wgs's People

Contributors

waynejiao avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

tumortype-wgs's Issues

Trained Models and Example Input Data

Hi, I was just reading the associated Nature paper, which says that pre-trained models used in the paper can be found at this repository. Can you point me to where I can find these as well as an example of the input data so that I can give the model a try.

1-Mbp Bin Coordinates

Hi,

Have you ordered the ~3000 1-Mbp bins by genomic coordinates and chromosome number (chr1, chr2, etc)?

I want to use the mutational density values that you provided but I am unsure whether the bins are ordered (bin1: chr1:1000000-1999999, bin2: chr1:200000-2999999) or randomly shuffled.

Thank you!

Asking for providing the feature vectors

I read your paper 'A deep learning system accurately classifies primary and metastatic cancers using passenger mutation patterns'. I would like to reproduce the result from this paper by training the random forest and neural network model you provided here.
I would be glad if you could provide me the value of each feature (that you mentioned in Table 2: WGS feature types used in classifier) for PCAWG training and test datasets in a format that can directly be used to train and test the models. The feature vectors for the independent validation dataset would also be very helpful.

Running DNN model

I read your amazing nature paper and would like to try out the DNN method. Could you please provide a more detailed guideline about how to use your code, such as giving some examples to reproduce the results in the paper? Specifically, I have following questions. I wonder what the feature_type could be? Regarding the Keras model in the model folder, I wonder which folder you used? Could you please also provide the codes that how you slipt the training, validation, and testing samples? Thanks in advance.

Input format

Hello, I would also like to run this model. Could you please refer me to the place you mentioned the input file format for predict_cancer.py?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.