Comments (8)
I added a fix linked above to dev branch, it will give a proper error message with the request to change the sampleID. Will be in the next release. SO I close that issue here.
from ampliseq.
Thanks for reporting it, sorry for the trouble.
I never encountered this issue.
I am speculating that the input file is also picked up as output file by
This might be because the sampleID
D17_1
is the base of your forward name D17_1.fastq.gz
.If that is true, changing your samplesheet from
D17_1 /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_1.fastq.gz /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_2.fastq.gz
to
D17 /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_1.fastq.gz /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_2.fastq.gz
should do the trick. Could you test that?
from ampliseq.
Yeah, I think that was the issue indeed. I renamed my input files 🙈, 'cos I like to make things complicated. I suppose the rename would have been the easiest and simplest option. Thanks for responding so quickly though.
I know someone already raised this but maybe an input file validation
step will help in future releases, to avoid these cases. I expect they might happen with replicate samples, those which one doesn't want to treat as run
in the input file.
Thanks again!
from ampliseq.
Yes, thanks, there is already some input validation going on and more is done in the next release, but I think this problem wouldn't be identified by any test!
Potential solutions:
- Check (1) whether
sampleID
is ending with_1
/_2
, and if yes, whether it is identical to file base name offorwardReads
&reverseReads
(before .fastq.gz) --> error message with request to changesampleID
. - Change rename_raw_data_files.nf output regex so that it doesnt care (not sure how thats done atm).
Will need to investigate further. Lets keep that issue open because its clearly a bug.
from ampliseq.
Great, thank you!
from ampliseq.
@d4straub quick question: I have reads from the Novogene where the barcode and the primer sequences were removed apparenlty.
When I run the following:
nextflow run nf-core/ampliseq -r 2.6.1 -profile singularity --input sample_hebe_edited.tsv --FW_primer GTGCCAGCMGCCGCGGTAA --RV_primer CCGTCAATTCCTTTGAGTTT --outdir "./hebe_16Sresults" --max_cpus 24 --max_memory 256.GB
I'm getting the below issue:
The following samples had too few reads (<1) after trimming with cutadapt:
Is it better to run the skip_cutadapt
or the retain_untrimmed
flags?
Thanks!
from ampliseq.
It might be also that you use the wrong primer sequences.
Is it better to run the skip_cutadapt or the retain_untrimmed flags?
If its fine to you to run ampliseq potentially twice, use --skip_cutadapt
first. If a large portion of reads (lets say, >10 or 15%) are removed due to being flagged as chimeric, use --retain_untrimmed -resume
instead of --skip_cutadapt
. That should reduce chimeric reads considerably.
Such a question would be better suited to be asked in the nf-core slack channel #ampliseq, see https://nf-co.re/join
from ampliseq.
Awesome, thank you. To my knowledge, I'm using the primer sequences the company provided. And thanks for the link the slack channel. Will use that going forward.
from ampliseq.
Related Issues (20)
- ERROR ~ Error executing process > 'NFCORE_AMPLISEQ:AMPLISEQ:DADA2_MERGE .........has been broken HOT 3
- ANCOM-BC for differentially abundant taxa HOT 2
- ERROR: R_HOME ('/usr/local/lib/R') not found HOT 5
- Misleading error message when samples are not passing filterandtrim HOT 1
- Accessing 16S gene identifiers HOT 13
- DADA2 split regions singularity HOT 13
- multi-region analysis: sidle/reconstructed/reconstructed_merged.tsv OCCATIONALLY mis-formatted HOT 1
- Running test error HOT 1
- Abundance plots for qiime2 results without metadata provided HOT 3
- Adding ONT read support for ampliseq HOT 2
- `overall_summary.tsv` sometimes with misleading numbers in 2.9.0 HOT 11
- Analyse data set that contains unknown primer set HOT 5
- Cutadapt with "-u" instead of "fw/rv_primer seq" HOT 1
- There is no qiime2 result file in the results HOT 2
- Misleading text in output documentation HOT 1
- Add MACSE to enable frame-shift detection for COI HOT 6
- Pipeline fails when run with a lot of cores HOT 6
- Error "Can't stage file https://files.plutof.ut.ee:" Download fail HOT 7
- Taxonomy database choices not validated against permitted values since 2.9.0 HOT 3
- New (10.0) version of Unite databases available to `--dada_ref_taxonomy`
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ampliseq.