Giter VIP home page Giter VIP logo

ram-compbio / cando Goto Github PK

View Code? Open in Web Editor NEW
33.0 5.0 9.0 11.85 MB

Computational Analysis of Novel Drug Opportunities

Home Page: http://compbio.org/protinfo/cando

License: BSD 3-Clause "New" or "Revised" License

Python 89.32% Jupyter Notebook 10.68%
drug-discovery drug-repurposing drug-design multitarget shotgun python bioinformatics biomedical-informatics machine-learning docking

cando's People

Contributors

compbio314 avatar melissavannorden avatar willmangione avatar zfalls avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

cando's Issues

similar_compounds()

  • Index out of bounds
  • variable n to topX for consistency
  • add -1 as default for output

Add anaconda env download link/command

Update README.md with the option to download the anaconda env for CANDO (all packages installed) and a preconfigured CANDO env with all packages and the newest stable version of CANDO.

Add header (column names) to CANDO matrix

An artifact of v1 is lack of header for the matrix file. We need to add a header with either CANDO ids or compound names to enable more flexible use of the matrix without losing track of compound data.

  • Add headers to matrices
  • Add functionality to BANDOCK for column names/ids
  • Modify CANDO init to handle the column name when creating Compound objects

Kindly help to solve the following error

Kindly help to solve the following error:
Generating CANDO matrix...
Traceback (most recent call last):

File "/tmp/ipykernel_3652/1890496437.py", line 10, in
lig_name=False, ncpus=ncpus)

File "/home/mi/anaconda3/lib/python3.7/site-packages/cando/cando.py", line 4224, in generate_matrix
p_dict[p[0]] = list(zip(p[1].split(','),p[2].split(',')))

IndexError: tuple index out of range

Add "source" variable to CANDO compounds mapping

This will be an additional variable that will be present in the mapping file and will be processed on init. Each compound will have an associated source (public db, design protocol, etc.) from which it was extracted.

  • Add "source" column to compound mapping file
  • Add new variable to Compound class that will be populated by "source" during CANDO init
  • Optional: Output "source" in canpredict and other functions

Add return statements to functions

Adding "return" statements to a couple key functions (particularly canpredict) to make them more compatible with external functions, so you don't need to write to file and then read from file to get results if you're not working within the CANDO object

Create canbenchmark_new()

Created a new benchmarking function called canbenchmark_new() which uses a methodology similar to canpredict_compounds() to assess performance using the classic CANDO metrics (i.e., AIA, APA, and IC), which all are prepended with n to denote the new benchmarking protocol (e.g., nAIA, nAPA, nIC).

Remove recursive object references

Removed the references to objects within objects to reduce memory usage and depth.
Check all functions to confirm they work properly.

help needed : UnpicklingError: pickle data was truncated

UnpicklingError: pickle data was truncated

Dear Sir,
I want to run the following code:
(https://github.com/lamawmouk/Tox21_FeatureSelection_SMOTEENN_RF/tree/main/Featuregeneration_using_CANDO)/Feature_generation_trainset.ipynb
.............................
import csv
with open('train_smile_toxnum.csv','r') as csvin, open('train_smile.tsv', 'w') as tsvout:
csvin = csv.reader(csvin)
tsvout = csv.writer(tsvout, delimiter='\t')

for row in csvin:
    tsvout.writerow(row)

import cando as cnd
ncpus=3
cnd.add_cmpds("train_smile.tsv", file_type='smi', fp='rd_ecfp4', vect="int", cmpd_dir=".", v="tox21_train")
cnd.generate_matrix(v="tox21_train", lib_path='.',
fp="rd_ecfp4", vect="int",
dist="dice", org='homo_sapien', bs='coach',
c_cutoff=0.0, p_cutoff=0.0, percentile_cutoff=0.0,
i_score="dCxP", out_file='homo_sapien-tox21_train.tsv', out_path="",
nr_ligs=True, approved_only=False,
lig_name=False, ncpus=ncpus)

But the following error is encountered. How to resolve this?

Error message:
Generating CANDO matrix...
Traceback (most recent call last):

File "", line 10, in
lig_name=False, ncpus=ncpus)

File "/home/mazharul/anaconda3/lib/python3.7/site-packages/cando/cando.py", line 4242, in generate_matrix
l_fps = pickle.load(f)

UnpicklingError: pickle data was truncated

Replace Indication object

Create 2 indication dictionaries (id to compound list and name to id) to replace indication object

confused by input

Dear developer,
I just found out about this package, which is great, well, I am quite new to this. So, I am getting a little bit confused about input?
Can I input like some proteins or uniprot ID which in some certain pathway to predict which compound could activate/deactivate it? How can I do this? And, these output compounds are already in your database or can be selected from another big database like zinc? I am not sure if I make myself clear and hope you can understand. Thank you for your valuable time and sorry for these silly questions.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.