Giter VIP home page Giter VIP logo

metapathways2's Introduction

MetaPathways 2: A master-worker model for environmental Pathway/Genome Database construction on grids and clouds

Niels W. Hanson, Kishori M. Konwar, Shang-Ju Wu, and Steven J. Hallam

MetaPathways Logo

Updates

July 7, 2015: MetaPathways v2.5.2 release has minor bug fixes releated to gbk input processing, sam file RPKM calculations, and rRNA homology search.

November 27, 2014: MetaPathways v2.5 released with upgrades to the pipeline:

  • LAST homology searches with BLAST-equivalent output and E-values
  • Reads per kilobase per million mapped (RPKM) coverage measure for Contig annotations calculated from raw reads (.fastq) or mapping files (.SAM) using bwa
  • Addition of the CAZy sequence database as a new compatible functional hierachy
  • GUI Keyword-search from annotation subsetting and projection onto different functional hierarcies (KEGG, COG, SEED, MetaCyc, and now CAZy)

See the release page and the wiki for more information.

Abstract

The development of high-throughput sequencing technologies over the past decade has generated a tidal wave of environmental sequence information from a variety of natural and human engineered ecosystems. The resulting flood of infor- mation into public databases and archived sequencing projects has exponentially expanded computational resource requirements rendering most local homology-based search methods inefficient. We recently introduced MetaPathways v1.0, a modular annotation and analysis pipeline for constructing environmental Pathway/Genome Databases (ePGDBs) from environmental sequence information capable of using the Sun Grid engine for external resource partitioning. However, a command-line interface and facile task management introduced user activation barriers with concomitant decrease in fault tolerance.

Here we present MetaPathways v2.0 incorporating a graphical user interface (GUI) and refined task management methods. The MetaPathways GUI provides an intuitive display for setup and process monitoring and supports interactive data visualization and sub-setting via a custom Knowledge Engine data structure. A master-worker model is adopted for task management allowing users to scavenge computational results from a number of worker grids in an ad hoc, asynchronous, distributed network that dramatically increases fault tolerance. This model facilitates the use of EC2 instances extending ePGDB construction to the Amazon Elastic Cloud.

Installation

MetaPathways v2.5 requires Python 2.7 or greater and Pathway Tools developed by SRI International for full functionality.

The MetaPathways Python codebase as well as the compiled GUI binaries for Mac OSX and Ubuntu are self-contained in this GitHub distro. GUI source code can be obtained here.

Please see the MetaPathways v2.5 wiki for more installation details.

A template MetaPathways_DBs.zip (Updated: October 2014) contains starter protein and taxonomic databases

Citation

If using MetaPathways for reserach work please cite the following:

  • Kishori M. Konwar, Niels W. Hanson, Maya P. Bhatia, Dongjae Kim, Shang-Ju Wu, Aria S. Hahn, Connor Morgan-Lang, Hiu Kan Cheung, and Steven J. Hallam. MetaPathways v2.5: Quantitative functional, taxonomic, and usability improvements. Bioinformatics, 1โ€“3 (2015). doi:10.1093/bioinformatics/btv361

  • Niels W. Hanson, Kishori M. Konwar, Shang-Ju Wu, Steven J. Hallam. MetaPathways v2.0: A master-worker model for environmental Pathway/Genome Database construction on grids and clouds. Proceedings of the 2014 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB 2014), Honolulu, HI, USA, May 21-24, 2014. doi:10.1109/CIBCB.2014.6845516

metapathways2's People

Contributors

kishori82 avatar nielshanson avatar michaeldubyu avatar

Watchers

James Cloos avatar Natalie K Beach avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.