Giter VIP home page Giter VIP logo

deepmass's Introduction

DeepMASS


DeepMASS is a known-to-unknown metabolite identification workflow, which includes a deep-learning based model to predict structural similarity between the unknown metabolites and the known ones based on their MS/MS spectra and a rank method for picking out the possible candidate structures of the unknowns.

Motivation

Metabolite identification is a great challenge restricting the metabolomics study. One of the main reasons is the limited number of high-quality MS/MS spectra of metabolites included in database. Using the transformational relationship and structural similarity between metabolites is a promising strategy to extend the number of metabolites can be identified by the existing database. Hence, we present the DeepMASS workflow, which is proved to be an effective way to predict the structural similarities between the unknown metabolite and the metabolites existing in the database.

Cite

Deep MS/MS-Aided Structural-Similarity Scoring for Unknown Metabolite Identification

Depends

Anaconda for python 3.6
# RDKit
conda install libboost=1.65.1
conda install boost=1.65.1
conda install boost-cpp=1.65.1
conda install -c rdkit rdkit
# Keras
conda install keras
# IsoSpec
pip install IsoSpecPy == 1.0.7

Installation

  1. Download the model here, then unzip it into /model/40V directory.
  2. Clone the repo and put spectra files in data/spectra directory

Note

2019/04/29

Under the requirement of the owner of experimental spectra (MetDNA dataset), the dataset has been removed. If you have already download the dataset, please keep it private, and the dataset can be only used to reproduce the results of DeepMASS paper. If you need the dataset for other use, please contact with the owner.

2019/05/13

Since the experimental spectra has been removed, this package cannot be run directly. You can train your own model with your in-house database. Otherwise, you can wait some days. I m trying to include spectra from public databases.

Usage

Take example.py as an example for metabolite identification.
If you want to train a model based on your in-house database, please put your spectra files in to data/spectra directory and run test.py.

Contact

[email protected]

deepmass's People

Contributors

hcji avatar

Stargazers

heliu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.