Giter VIP home page Giter VIP logo

bacterial_reseq's People

Contributors

bamarshall2 avatar mikewolfe avatar

Watchers

 avatar

Forkers

bamarshall2

bacterial_reseq's Issues

fix line 32 in quality_control.smk

Change line 32 in quality_control.smk from
out = lookup_sample_metadata(sample, "file_path", pep)
to
out = lookup_sample_metadata(sample, "infile_path", pep)

"infile_path" is the name of the column in the sample sheet, not "file_path"

Add de-novo assembly as an option for analysis

Although breseq is a great tool, it is not designed to deal with large changes containing exogenous DNA not found in the reference. To quote the breseq methods paper (https://doi.org/10.1007/978-1-4939-0554-6_12):

The re-sequencing strategy employed by breseq breaks down when there is sufficient sequence divergence between the sample and the reference genome to lead to mismapping of reads, such as when there is a high level of local nucleotide divergence or there are novel sequences in the sample that are not in the reference genome. In these cases, it may be valuable to use de novo assembly tools, such as Velvet (10.1101/gr.074492.107) or ALLPATHS-LG (10.1101/gr.141515.112), on either the unmapped reads after a breseq run (located at data/unmatched.*) or all reads and then compare assemblies to the reference genome or to one another using tools for comparing whole-genome sequences, such as MUMmer (10.1186/gb-2004-5-2-r12).

I think we should follow their suggestion and use these tools to perform a de-novo assembly of the sequencing samples.

  • Decide on which de-novo assembler to use. Are these the best for bacterial genomes and short read sequencing? Is there something newer that is better?
  • Check to make sure any tool is conda installable (i.e. MUMmer is https://anaconda.org/bioconda/mummer)
  • Create a new module in workflow/rules/denovo_assembly.smk to run the assembly
  • Add rules to compare the assembled genome against the reference genome. MUMmer is likely the best for this.
  • Decide what set of output files need to be given to the end user
  • Define how a user of the pipeline will indicate they want de-novo assembly ran. This could be a required column to the metadata sheet that tells the pipeline whether to run assembly or not on a sample

BAM request

Consumers would like a BAM file to scrutinize for each sample. General parameters for alignment should be fine.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.