Giter VIP home page Giter VIP logo

hiv-ngs's People

Contributors

spond avatar stevenweaver avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hiv-ngs's Issues

Exporting sequences

Having the ability to export the collapsed sequences (based on 99% homology) from each dataset has been helpful. However, I realize now that we still need the output of aligned nucleotide sequences the way old Datamonkey UDS pipeline presented it. To screen for DI, we would like to take random representative samples from each dataset for downstream phylogenetic analysis, and we cannot do that with the collapsed sequence set. Thanks much. Gabe

Folder naming / additional level

We would like to submit a specific dataset including NGS sequences from a cohort of MSM pairs (source and recipient).
Is it possible to add a new level (5th) for the pair identification (source / recipient) or to replace the actual optional Level 4 ("replicate", not necessary for our cohort)
Level 1: Patient ID (anything, unique, required)
Level 2: Sample date (YYYYMMDD format, required)
Level 3: Compartment (anything, optional)
Level 4: Replicate (anything, optional)
+/-Level 5: Pair ID (for example: S3 for the Source of Pair #3 or R12 for the recipient of Pair #12...)

thx
Antoine

Octamonkey report

Hi
Here are several comments about the octamonkey pipeline:
-There are some duplicates with the ID of the sequences in the clusters alignment ('region' column) and there are also some 'space' characters that are conflicting.
-Problems with the colors of the FST figures and no option to download these figures.
-A very brief 'help' page with explanation of each option/output may help (for example, what does the 'divergence column' refer to?)
-Need to add an export option to download all the haplotypes (and not only the consensus) from a cohort.

Many thx for this fantastic job.
A

Exporting Consensus Sequence snafu

When attempting to 'Export Consensus Sequences' from ProtC data (>1000 records), it only captured 133 sequences. Looking at individual datasets that were not captured in the export revealed no significant problems; I was able to generate a consensus sequence myself from them.
Thanks!
Gabe

Requested features on new NGS pipeline

  1. Raw read metrics (number of raw reads, how many reads fulfill quality metrics or how many low quality reads are filtered out from subsequent analyses).
  2. Number of clean reads per coding region.
    Thanks!
    Gabe

Exporting sequences

For the Protocol C dataset, it is nice to be able to export individual sequences (major homology clusters). But to be able to export them all 'en masse' would be nice too. There is an 'Export' feature at the top right of the page which allows one to export all sequences (I think), but given the large number of sequences, the uploading portion never finishes.
Thanks,
G

Requested feature

The sequence titles in the downloadable identical sequence alignment should be more informative. Perhaps (in addition to cluster # and # reads collapsed) also include PID, date/Visit code and region.
Request from Antoine/Gabe.
Thanks

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.