Giter VIP home page Giter VIP logo

autodock-vina-app's Introduction

This application runs AutoDock Vina virtual screening using the user's protein receptor and a selected ligand library on the Texas Advanced Computing Center (TACC) Lonestar6 supercomputer. Utilizing Python's mpi4py library, the application allows the user to submit batch jobs that dock up to thousands of molecules in parallel.

Note: Computer docking and virtual screening are inexact, but potentially very valuable, tools. This site is intended to provide easy access to researchers wishing to perform small numbers of docking or virtual screening experiments, but who do not have the necessary computer resources and/or computational biology backgrounds. This interface can handle most protein-ligand docking experiments, but there will always be systems where this simple interface will fail. For those cases, researchers will need to develop the necessary expertise or collaborations to perform the experiment.

Running the Application:

To run the application, the user must submit a job using the TACC API (TAPIS) UTRC Portal system. The following inputs are allowed:

  • Ligand Library: The user must choose one of the ligand libraries to run this virtual screening on
  • Protein Receptor: The user must upload a .pdb- or .pdbqt-formatted protein receptor file
  • Grid Center: The user must specify X-Y-Z coordinates of the center of the search area
  • Box Size: The user must specify X-Y-Z size limits; default is 20-20-20
  • Run Time: The user must enter maximum allowed runtime. See documentation for assistance choosing an appropriate runtime
  • Processors on Each Node: The user must set this to 32
  • Node Count: The user must set this to a specific number based on the chosen ligand library:
    • Enamine-HTSC: 10
    • Enamine-AC: 3
    • All others: 1

Output Visualization:

The user may also choose to visualize their resulting output file utilizing the ChimeraX application on the Texas Advanced Computing Center (TACC) Lonestar6 supercomputer. To run the application, the user must submit a job using the University of Texas Research Cyberinfrastructure (UTRC) system. A virtual DCV session will be initialized for the user, and a document titled 'ChimeraX_dcvserver.txt' will be created in the user's work folder with connection instructions. Once connected, the University of California San Francisco (UCSF) ChimeraX software will begin running, and a Tapis jobs archive folder will be created on the desktop for the user's convenience. The user may then use the ChimeraX program to open the file containing the top scoring docked ligands.

Docking Software:

AutoDock Vina, written by Drs. O. Trott and A. Olson at the Scripps Institute is used to perform the actual docking. This open-source program has been made freely available by the authors.

AutoDockTools, written by Drs. G. Morris and A. Olson at the Scripps Institute is used to convert proteins and ligands to the format required by AutoDock Vina. This open-source program has been made freely available by the authors.

J. Eberhardt, D. Santos-Martins, A. F. Tillack, and S. Forli. (2021). AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings. Journal of Chemical Information and Modeling.

O. Trott and A. J. Olson. (2010). AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. Journal of computational chemistry, 31(2), 455-461.

ChimeraX Software:

Molecular graphics and analyses performed with UCSF ChimeraX, developed by the Resource for Biocomputing, Visualization, and Informatics at the University of California, San Francisco, with support from National Institutes of Health R01-GM129325 and the Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases.

UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, Morris JH, Ferrin TE. Protein Sci. 2021 Jan;30(1):70-82.

UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE. Protein Sci. 2018 Jan;27(1):14-25.

autodock-vina-app's People

Contributors

joshuaamedina avatar tiffanyhuff avatar wjallen avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

autodock-vina-app's Issues

Put final citation in output somewhere

Tiffany Huff, Austin Darrow, Joshua Medina, Erik Ferlanti, James Carson, John M. Fonner, Sal Tijerina, Stanley J. Watowich, and William J. Allen. 2023. Scalable and Reproducible Virtual Screening through an API-Integrated Workflow. In Practice and Experience in Advanced Research Computing (PEARC โ€™23), July 23โ€“27, 2023, Portland, OR, USA. ACM, New York, NY, USA, 4 pages. https://doi.org/10.1145/3569951.3597599

Increase number of nodes for HTSC library

Ligand library Processors/node Nodes # ligands Run time
ZINC-fragments 32 1 546,002 22 hrs*
Enamine-PC 32 1 84,359 ~7.5 hrs#
Enamine-AC 32 3 876,985 ~26 hrs*
Enamine-HTSC 32 10 3,467,770 ~40 hrs^

*search volume 14x12x18
#search volume 14x14x18
^search volume 9x12x16

Limit search time so all jobs finish within wallclocklimit

If a job hits the wallclocklimit, then likely it has not finished docking all the ligands or removing all the output that it doesn't need to keep. This creates a problem where TAPIS will then try to archive all the output files (could be 3M+ files) to the archive location. This is not ideal.

We should either be deleting temporary output files as we go, or reserving time at the end of the job for cleaning up.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.