Comments (7)
What about adding a few optional columns (such as fw_index, rv_index) to the sample sheet. If those columns are present, demultiplexing will run. If that might mess too much with existing routines, a separate input file (e.g. --demultiplex "sheet.tsv") that contains the necessary information (samplesheet & demultiplexsheet have identical IDs) might be an option?
While ampliseq does not require a samplesheet (folder input & fasta input are also allowed), for demultiplexing that would be fine. After all, a samplesheet can handle more info than a folder input. Not all input options need to support all functionality, imho.
from ampliseq.
This might be a helpful feature. As far as I know there is work ongoing for wrapping DADA2 directely in this pipeline instead of QIIME2 using DADA2. Therefore I am unsure how to integrate this feature sustainably with the major changes that are planned to the early workflow. However, PRs are welcome.
from ampliseq.
@DiegoBrambilla is planning to implement dada2 for PacBio analysis and could immediately add that demultiplexing step :)
from ampliseq.
We take it into consideration.
For the time being, implementing the R-DADA2 pipeline, taxonomy annotation from several sources and dealing with PacBio reads take priority.
from ampliseq.
Demultiplexing could be done via cutadapt as documented here. I never come across the need for demultiplexing in the pipeline, but if anyone does, please mention it here and I might further look into it.
from ampliseq.
I want to add demultiplexing (with Cutadapt) to Ampliseq. The way I've handled demultiplexing in my own nf-core style pipeline is to ask the user to specify the path to their raw data in the command line --raw_data "/path/to/data/*{R1,R2}*.fastq.gz"
. Then in the sample sheet the user has to add the columns fw_index, rv_index, fw_primer, and rv_primer (the two rv_ columns can be empty for single-end data). I use the _index columns for demultiplexing and the _primer columns for trimming after demultiplexing. The main issue I see is that Ampliseq doesn't require a sample sheet as input, so I'm wondering if anyone has a suggestion for a better way of adding this feature to Ampliseq? Maybe the sample sheet should be required if the user wants to demultiplex?
from ampliseq.
To me, adding columns to the sample sheet sounds best.
from ampliseq.
Related Issues (20)
- Edge case: Clustering with VSEARCH fails at QIIME2_INSEQ HOT 1
- Allow to analyse 454 sequencing data HOT 2
- Add option to assign ASV to multiple species with DADA2 HOT 3
- Debug information for docker-based run. HOT 4
- Allow stratified output from picrust2 HOT 4
- nf-core/ampliseq with conda - change bioconductor-biostrings HOT 2
- Launch webpage not working HOT 4
- Adding qza file for downstream analysis in R HOT 3
- When using `--vsearch_cluster`, if you have many thousands of clusters, `AMPLISEQ:FILTER_CLUSTERS` will fail with an `Argument list too long` error. HOT 8
- test_full Cannot access file fastq HOT 1
- Multipe region amplicon sequencing analysis support (5R / SMURF / q2-sidle) HOT 1
- Getting ca 50% more ASVs than when using DADA2 on QIIME2 HOT 2
- ampliseq fails during taxonomy assignation when processing ITS sequences HOT 14
- Error No subject alternative DNS name matching zenodo.org found HOT 2
- minor improvement of sort() before denoising with method = "radix HOT 2
- 12S taxonomic classification databases HOT 3
- Does the `gtdb` database only include Bacteria? HOT 5
- Remove PhytoRef as it's included in PR2 5.0.0 HOT 1
- Template update 2.13.1 HOT 1
- Barrnap filtration HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ampliseq.