Giter VIP home page Giter VIP logo

Comments (8)

d4straub avatar d4straub commented on September 23, 2024 1

I added a fix linked above to dev branch, it will give a proper error message with the request to change the sampleID. Will be in the next release. SO I close that issue here.

from ampliseq.

d4straub avatar d4straub commented on September 23, 2024

Thanks for reporting it, sorry for the trouble.
I never encountered this issue.
I am speculating that the input file is also picked up as output file by

tuple val(meta), path("${meta.id}{_1,_2,}.fastq.gz", includeInputs: true), emit: fastq

This might be because the sampleID D17_1 is the base of your forward name D17_1.fastq.gz.
If that is true, changing your samplesheet from
D17_1 /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_1.fastq.gz /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_2.fastq.gz
to
D17 /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_1.fastq.gz /hdd0/susbus/nf_core/data/hebe_16S/00.RawData/D17/D17_2.fastq.gz
should do the trick. Could you test that?

from ampliseq.

susheelbhanu avatar susheelbhanu commented on September 23, 2024

Yeah, I think that was the issue indeed. I renamed my input files 🙈, 'cos I like to make things complicated. I suppose the rename would have been the easiest and simplest option. Thanks for responding so quickly though.

I know someone already raised this but maybe an input file validation step will help in future releases, to avoid these cases. I expect they might happen with replicate samples, those which one doesn't want to treat as run in the input file.

Thanks again!

from ampliseq.

d4straub avatar d4straub commented on September 23, 2024

Yes, thanks, there is already some input validation going on and more is done in the next release, but I think this problem wouldn't be identified by any test!
Potential solutions:

  • Check (1) whether sampleID is ending with _1/_2, and if yes, whether it is identical to file base name of forwardReads & reverseReads (before .fastq.gz) --> error message with request to change sampleID.
  • Change rename_raw_data_files.nf output regex so that it doesnt care (not sure how thats done atm).

Will need to investigate further. Lets keep that issue open because its clearly a bug.

from ampliseq.

susheelbhanu avatar susheelbhanu commented on September 23, 2024

Great, thank you!

from ampliseq.

susheelbhanu avatar susheelbhanu commented on September 23, 2024

@d4straub quick question: I have reads from the Novogene where the barcode and the primer sequences were removed apparenlty.

When I run the following:

nextflow run nf-core/ampliseq -r 2.6.1 -profile singularity --input sample_hebe_edited.tsv --FW_primer GTGCCAGCMGCCGCGGTAA --RV_primer CCGTCAATTCCTTTGAGTTT --outdir "./hebe_16Sresults" --max_cpus 24 --max_memory 256.GB

I'm getting the below issue:

The following samples had too few reads (<1) after trimming with cutadapt:

Is it better to run the skip_cutadapt or the retain_untrimmed flags?

Thanks!

from ampliseq.

d4straub avatar d4straub commented on September 23, 2024

It might be also that you use the wrong primer sequences.

Is it better to run the skip_cutadapt or the retain_untrimmed flags?

If its fine to you to run ampliseq potentially twice, use --skip_cutadapt first. If a large portion of reads (lets say, >10 or 15%) are removed due to being flagged as chimeric, use --retain_untrimmed -resume instead of --skip_cutadapt. That should reduce chimeric reads considerably.
Such a question would be better suited to be asked in the nf-core slack channel #ampliseq, see https://nf-co.re/join

from ampliseq.

susheelbhanu avatar susheelbhanu commented on September 23, 2024

Awesome, thank you. To my knowledge, I'm using the primer sequences the company provided. And thanks for the link the slack channel. Will use that going forward.

from ampliseq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.