nf-feelnc is a Nexflow pipeline for detection of lncRNA in reference and non reference genome annotation using feelnc program. It takes care of uncompressing genome and run the 3 feelnc scripts. This simple pipeline has been extracted and modified from TAGADA pipeline.
To use this pipeline you will need:
- Nextflow >= 21.04.1
- Docker >= 19.03.2 or Singularity >= 3.7.3
A small dataset is provided to test this pipeline. To try it out, use this command:
nextflow run sguizard/nf-feelnc -profile test,docker -revision 1.2
The pipeline is written in Nextflow, which provides the following default options:
Option | Parameters | Description | Requirement |
---|---|---|---|
-profile |
profile1,profile2 |
Profile(s) to use when running the pipeline. Specify the profiles that fit your infrastructure among singularity ,docker , kubernetes ,slurm . |
Required |
-revision |
version |
Version of the pipeline to launch. |
Optional |
-work-dir |
directory |
Work directory where all temporary files are written. |
Optional |
-resume |
Resume the pipeline from the last completed process. |
Optional |
For more Nextflow options, see Nextflow's documentation.
Option | Parameters | Description | Requirement |
---|---|---|---|
--ref_annotation |
ref_annotation.gtf |
Input reference annotation file or url. |
Required |
--new_annotation |
new_annotation.gtf |
Input reference annotation file or url. |
Required |
--genome |
genome.fa |
Input genome sequence file or url. |
Required |
--feelnc-args |
'--mode shuffle' |
Custom arguments to pass to FEELnc's coding potential script when detecting long non-coding transcripts. |
Optional |
--max-cpus |
16 |
Maximum number of CPU cores that can be used for each process. This is a limit, not the actual number of requested CPU cores. |
Optional |
--max-memory |
64GB |
Maximum memory that can be used for each process. This is a limit, not the actual amount of alloted memory. |
Optional |
--max-time |
12h |
Maximum time that can be spent on each process. This is a limit and has no effect on the duration of each process. |
Optional |
The pipeline executes the following processes:
- Decompress annotations and/or genome files if its gziped.
- Detect long non-coding transcripts with FEELnc.
The annotation saved toresults/annotation
is updated with the results. - Aggregate quality controls into a report with MultiQC.
The report is saved toresults/control
in a.html
file.
Many thanks to Sarah Djebali (@sdjebali), Sylvain Foissac, Cervin Guyomar and Cyril Kurylo for their help and advices.
The GENE-SWitCH project has received funding from the European Union’s Horizon 2020 research and innovation program under Grant Agreement No 817998.
This repository reflects only the listed contributors views. Neither the European Commission nor its Agency REA are responsible for any use that may be made of the information it contains.