Giter VIP home page Giter VIP logo

megamerge's Introduction

#MeGAMerge MeGAMerge (A tool to merge assembled contigs, long reads from metagenomic sequencing runs)

##Description MeGAMerge is a perl based wrapper/tool that can accept any number of sequence (FASTA) files containing assembled contigs of any length in Multi-FASTA format to produce an improved contig set based on OLC based assembly. All overlap parameters (Minimum Overlap Length, Identity, etc) are user-declarable at runtime. It is written to run on Linux.

##Requirements: You will need to have the following tools installed and in $PATH, or added to $binpath in the tool:

  • Newbler (specifically runAssembly)
  • Minimus2 (part of AMOS, also requires MUMmer)

###Installation notes: MUMmer: For larger genome projects, the MUMmer package must be compiled in 64 bit mode. This can be accomplished using: make all CPPFLAGS="-O3 -DSIXTYFOURBITS"

AMOS: For installation of AMOS, AMOS tools must be able to find nucmer, delta-filter and show-coords as compiled above, either by adding it to the path before running ./configure Or by specifying variables: NUCMER,DELTAFILTER, and SHOWCOORDS when running ./configure in the amos directory. Example: ./configure NUCMER=/usr/local/bin/nucmer/bin/nucmer --prefix /usr/local/amos

##Usage:

MeGAMerge-1.0.pl [options] output_directory

##Options:

-overlap=NN Parameter for minimum overlap length in minimus2/Newbler (default = 80)

-minID=NN Minimum % identity for overlap in minimus2/Newlber (default 98)

-conserr=NN Maximum conservation error for minimus2 (default 0.06)

-cpu=NN Number of CPU for Newbler (default 4)

-bindir=directory Directory containing MUMmer executables and AMOS executables

-newblerdir=direcoty Directory for newbler executable (runAssembly)

-o=outfile Name of final file to output in output_directory (default MergedContigs.fasta)

-minLen=NN Minimum length to include in newbler assemblies (default 150)

-minIncludeLen=NN Minimum length to include in minimus assembly (default, 200)

-d Turns on debug information

-force Force overwrite of previous runs.

-single_genome=1 Runs assuming single genome, reducing auto-options (one newbler run, exclude fewer contigs, overrides -minLen and minIncludeLen)

##Citation Please cite:

Scholz, M., Lo, C.-C., & Chain, P. S. G. (2014). Improved Assemblies Using a Source-Agnostic Pipeline for MetaGenomic Assembly by Merging (MeGAMerge) of Contigs. Scientific Reports, 4, 6480. Retrieved from http://dx.doi.org/10.1038/srep06480

if you use this software for your publications

megamerge's People

Contributors

chienchi avatar scholzmb avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.