Giter VIP home page Giter VIP logo

ablab / spades Goto Github PK

View Code? Open in Web Editor NEW
728.0 32.0 134.0 99.3 MB

SPAdes Genome Assembler

Home Page: http://ablab.github.io/spades/

License: Other

HTML 0.03% Python 1.82% Shell 0.12% C++ 80.39% C 16.11% Perl 0.31% CMake 0.49% Makefile 0.08% TeX 0.01% Java 0.02% Lua 0.06% Roff 0.10% JavaScript 0.07% Jupyter Notebook 0.30% R 0.01% D 0.03% CSS 0.03% Starlark 0.03% Dockerfile 0.01% VBScript 0.01%
genome-assembly sequence-assembler sequencing transcriptome-assembly illumina metagenome-assembly next-generation-sequencing

spades's Introduction

About SPAdes

SPAdes is a versatile toolkit designed for assembly and analysis of sequencing data. SPAdes is primarily developed for Illumina sequencing data, but can be used for IonTorrent as well. Most of SPAdes pipelines support hybrid mode, i.e. allow using long reads (PacBio and Oxford Nanopore) as a supplementary data.

SPAdes package contains assembly pipelines for isolated and single-cell bacterial, as well as metagenomic and transcriptomic data. Additional modes allow to discover bacterial plasmids and RNA viruses, as well as perform HMM-guided assembly. Besides, SPAdes package includes supplementary tools for efficient k-mer counting and k-mer-based read filtering, assembly graph construction and simplification, sequence-to-graph alignment and metagenomic binning refinement.

Current version: see VERSION file.

Quick start

  • Complete user manual can be found here. Information below is provided merely for your convenience and cannot be considered as the user guide.

  • SPAdes assembler supports:

    • Assembly of second-generation sequencing data (Illumina or IonTorrent);
    • PacBio and Nanopore reads that are used as supplementary data only.
  • SPAdes allows to assemble genomes, metagenomes, transcriptomes, viral genomes etc.

  • Download SPAdes binaries for Linux or MacOS here. You can also compile SPAdes from source (requires g++ 9.0+, cmake 3.16+, zlib and libbz2). SPAdes requires only Python 3.8+ to be installed.

  • Test your SPAdes installation by running

    bin/spades.py --test
  • A single paired-end library (separate files, gzipped):
    bin/spades.py -1 left.fastq.gz -2 right.fastq.gz -o output_folder
  • IonTorrent data:
    bin/spades.py --iontorrent -s it_reads.fastq -o output_folder
  • A paired-end library coupled with long PacBio reads:
    bin/spades.py -1 left.fastq.gz -2 right.fastq.gz --pacbio pb.fastq -o output_folder
  • Available assembly modes: --isolate, --sc, --plasmid, --meta, --metaplasmid, --metaviral, --rna, --rnaviral, --bio, --corona, --sewage.

  • Standalone tools in SPAdes package: k-mer counting, k-mer cardinality estimation, k-mer-based read filtering, assembly graph construction, assembly graph simplification, alignment of long reads to an assembly graph, refinement of metagenomic binning.

Citation

If you use SPAdes in your research, please cite our latest paper.

In case you perform hybrid assembly using PacBio or Nanopore reads, you may also cite Antipov et al., 2015. If you use multiple paired-end and/or mate-pair libraries you may additionally cite papers describing SPAdes repeat resolution algorithms Prjibelski et al., 2014 and Vasilinetc et al., 2015.

If you use other pipelines, please cite the following papers:

You may also include older papers Nurk, Bankevich et al., 2013 or Bankevich, Nurk et al., 2012, especially if you assemble single-cell data.

Feedback and bug reports

Please, leave your comments and bug reports at our GitHub repository tracker. If you have any troubles running SPAdes, please attach params.txt and spades.log from the output folder.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.