Comments (7)
Found the problem in the Docker thing as well after ~10 hours of debugging and without help at the QIIME2 user forum.
docker.temp = 'auto'
env {
JOBLIB_TEMP_FOLDER="/tmp"
}
Nobody tells you about this here:
https://forum.qiime2.org/t/error-no-28-out-of-memory/5758
https://forum.qiime2.org/t/error-no-28-no-space-left-on-device-with-feature-classifier-sklearn/5961
https://forum.qiime2.org/t/error-in-filtering-using-dada2/3543
@KochTobi and I found this here:
https://scikit-learn.org/stable/modules/generated/sklearn.utils.Parallel.html
and that the scikit learn application apparently looks for the env variable JOBLIB_TEMP_FOLDER
first, setting this fixed to /tmp
and using nextflows docker.temp = 'auto'
resolved the issue.
That means, we'Ll have working CI tests today!!!!
from ampliseq.
It even runs through entirely without a single error message.
=======================================================
Pipeline Name : nf-core/rrna-ampliseq
Pipeline Version: 1.0dev
Run Name : cheesy_coulomb
Reads : data/*_L001_R{1,2}_001.fastq.gz
Data Type : Paired-End
Max Memory : 6 GB
Max CPUs : 2
Max Time : 2d
Output dir : ./results
Working dir : /home/alex/IDEA/nf-core/rrna-ampliseq/work
Container Engine: singularity
Container : nfcore/rrna-ampliseq:latest
Current home : /home/alex
Current user : alex
Current path : /home/alex/IDEA/nf-core/rrna-ampliseq
Script dir : /home/alex/IDEA/nf-core/rrna-ampliseq
Config Profile : test,singularity
=========================================
[warm up] executor > local
[11/ed9b34] Cached process > get_software_versions
[8d/ab2a66] Cached process > output_documentation
[7e/7b98fb] Cached process > metadata_category_all (1)
[30/a8b348] Cached process > metadata_category_pairwise (1)
[c7/254f41] Cached process > fastqc (1_S103)
[6f/0948dd] Cached process > fastqc (1a_S103)
[6d/3a1a3c] Cached process > trimming (2_S115)
[03/385ca3] Cached process > trimming (1a_S103)
[7f/648968] Cached process > trimming (2a_S115)
[60/b91fd1] Cached process > fastqc (2a_S115)
[37/35e535] Cached process > trimming (1_S103)
[cc/5e3356] Cached process > fastqc (2_S115)
[9e/71976f] Cached process > qiime_import
[61/037f86] Cached process > qiime_demux_visualize
[74/df79da] Cached process > multiqc
[40/71b95b] Cached process > dada_trunc_parameter
[49/3a2962] Cached process > dada_single
[5c/2e4105] Submitted process > classifier (1)
[3f/937df3] Submitted process > filter_taxa (1)
[05/470625] Submitted process > RelativeAbundanceASV (1)
[6e/30bc88] Submitted process > RelativeAbundanceReducedTaxa (1)
[f3/3619b3] Submitted process > export_filtered_dada_output (1)
[82/49c195] Submitted process > tree (1)
[e3/b2432f] Submitted process > ancom (1)
[1a/f08022] Submitted process > barplot (1)
[da/aad654] Submitted process > combinetable (1)
[a3/20e214] Submitted process > report_filter_stats (1)
[01/69a2c5] Submitted process > diversity_core (1)
[5f/af8a16] Submitted process > alpha_rarefaction (1)
[47/28fc6c] Submitted process > beta_diversity_ordination (1)
[33/35de42] Submitted process > beta_diversity (1)
[f4/e7c281] Submitted process > alpha_diversity (1)
[nf-core/rrna-ampliseq] Pipeline Complete
from ampliseq.
Open an issue at the QIIME2 dev forum asking for more details on what is done in that specific step. It feels weird that neither submitting TMP or TMPDIR as environment variables is used by the environment at all.
from ampliseq.
from ampliseq.
Sorry @d4straub , I invested ~4hours today and can't get this to run at all. Need to wait for the qiime2 guys to help in here.
from ampliseq.
https://forum.qiime2.org/t/docker-qiime2-no-space-left-on-device/6851
from ampliseq.
"Fun" fact:
[74/df79da] Cached process > multiqc
[40/71b95b] Cached process > dada_trunc_parameter
[49/3a2962] Cached process > dada_single
[5c/2e4105] Submitted process > classifier (1)
[3f/937df3] Submitted process > filter_taxa (1)
This runs well when using Singularity...
from ampliseq.
Related Issues (20)
- Adding the new Greengenes2 database for classification HOT 2
- Phyloseq object creation will fail if any samples have all reads removed by the tax filtering step HOT 5
- Add blast-consensus support to Ampliseq HOT 1
- Add greengenes2 2022.10 support to Ampliseq HOT 1
- Add custom qiime reference database support to Ampliseq. HOT 2
- Edge case: Clustering with VSEARCH fails at QIIME2_INSEQ HOT 1
- Allow to analyse 454 sequencing data HOT 2
- Add option to assign ASV to multiple species with DADA2 HOT 3
- Debug information for docker-based run. HOT 4
- Allow stratified output from picrust2 HOT 4
- nf-core/ampliseq with conda - change bioconductor-biostrings HOT 2
- Launch webpage not working HOT 4
- Adding qza file for downstream analysis in R HOT 3
- When using `--vsearch_cluster`, if you have many thousands of clusters, `AMPLISEQ:FILTER_CLUSTERS` will fail with an `Argument list too long` error. HOT 8
- test_full Cannot access file fastq HOT 1
- Multipe region amplicon sequencing analysis support (5R / SMURF / q2-sidle) HOT 1
- Getting ca 50% more ASVs than when using DADA2 on QIIME2 HOT 2
- ampliseq fails during taxonomy assignation when processing ITS sequences HOT 14
- Error No subject alternative DNS name matching zenodo.org found HOT 2
- minor improvement of sort() before denoising with method = "radix HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ampliseq.