Giter VIP home page Giter VIP logo

neutronstar's People

Contributors

ewels avatar remiolsen avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

neutronstar's Issues

Tigmint integration

Use tigmint for assembly error correction using linked-read information, usually before ar{c,k}s scaffolding.

Same use cases as related issue #16.

Arcs integration

Have a think about integrating EMA mapping + Arcs (or EMA bc whitelisting + Arks) to do additional and more aggressive scaffolding than than Supernova does.

With the coming of DSL 2. It could be a good idea to write this out to a sub-workflow with its own entrypoint. A common use-case would also be running these tools on an existing long-read assembly (e.g. PacBio).

These tools exist on bioconda! 👍

Remis master To-do list

From @remiolsen on December 11, 2017 17:4

For transparency, here's my "design document"


Main requirements

  • Nextflow? yes
  • Supernova running in $SNIC_TMP (Irma compatible?)
    • 1.20 compatible — multiple input parameter assemblies
    • [ ] Use nextflow publishdata in stead of rsync couldn't make it work. Use rsync!
    • Make a this optional
  • Rsync supernova assembly back to workdir
  • supernova mkoutput - pseudohap, megabubbles
    • gunzip
    • parameter of additional outputs — always output .phased.fasta
    • parameter for minimum length
  • QUAST
    • make it run on Irma
  • BUSCO
    • UPPMAX — beforeScript
  • MultiQC
    • Needs testing
  • support for --no-preflight flag
  • Documentation
    • Readme.md
  • dump software versions & commands that were run
  • Send mail when done pipeline is finished
  • Clean up and generalize the configs
    • Common HPC config
    • Common Uppmax config
    • Make a general local run config
  • Release tags

Docker / Singularity

  • Supernova (copyright issues?)
  • Quast
  • BUSCO
  • Script for automatic singularity/docker download / installation

NX script

  • input configuration:
    • id
      • fastqs
      • sample
      • maxreads
      • bcfrac
    • genomesize
  • memory parameter
  • cpu parameter
  • make Longranger / fastqc optional

Input_validation

  • id — only numbers, letters, dash, and underscore allowed
  • bcfrac (0,1)
  • maxreads - num

MultiQC

  • Fix when having empty molecule.yaml files
  • Does having “ASSEMBLER_CS” folders break multiqc?
  • Fix QUAST module. It breaks when running with -s option

Testing

  • Test data from NA12878 run.
  • Travis-CI integration

Could haves

  • Tigmint evaluation
  • Delivery template mail / output folder structure
  • BWA align
    • picard-tools
    • remove dups
    • collectinsertsize
  • qaTools-singularity
  • FRC-singularity
  • BUSCOv2 datasets in config
    • auto-script to download datasets

Copied from original issue: SciLifeLab#3

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.