Comments (8)
from edta.
Yes. There are files being produced in the folder '*.fa.mod.EDTA.raw/TIR/Module3_New/TIR-Learner'. I'll keep waiting.
Thank you for the quick reply.
Augusto
from edta.
Hi Augusto,
Just want to check back, did you finish the run successfully?
Thanks,
Shujun
from edta.
Hi Shujun,
Unfortunately, I had to kill the process because after 30 days it didn't finish running. I'm dealing with a large polyploid genome and the assembly is highly fragmented. When I run EDTA.pl on a small slice of contigs, it works fine. What do you think about running the pipeline on subsets of contigs?
Thanks,
Augusto
from edta.
Hi Augusto,
Thanks for your feedback. Both genome size and fragment number are deterministic for the run time of TIR-Learner. I've run through the chromosomal-scaffold wheat genome and the TIR part took about 26 days (#61). I also tested on the apple genome which is not big but with 127k sequences and it took about 10 days with large memory (>128Gb and I allocate 1 TB on it). You may split the genome into different batches such as split by ploidy, and combine them afterward (also see #61 and other issues).
Best,
Shujun
from edta.
Close due to no activity. Please reopen if the issue persists.
from edta.
Hi @oushujun,
I have approximately the same problem, except that I have 0 process corresponding to the program. I don't have any error message either. Penaeus vanamei genome (1.6 Gb)
nohup singularity exec -B /mystore /biolo/EDTA/1.9.5/edta.sif EDTA.pl --genome GCF_003789085.1_ASM378908v1_genomic.fna --overwrite 1 --sensitive 1 --anno 1 --evaluate 1 --threads 48 > run_EDTA.out 2>&1 &
########################################################
Extensive de-novo TE Annotator (EDTA) v1.9.4
Shujun Ou ([email protected])
########################################################
Mon Mar 15 16:45:20 CET 2021 Dependency checking:
All passed!
Mon Mar 15 16:45:34 CET 2021 The longest sequence ID in the genome contains 136 characters, which is longer than the limit (15)
Trying to reformat seq IDs...
Attempt 1...
Mon Mar 15 16:45:42 CET 2021 Seq ID conversion successful!
Mon Mar 15 16:45:42 CET 2021 Obtain raw TE libraries using various structure-based programs:
Mon Mar 15 16:45:42 CET 2021 EDTA_raw: Check dependencies, prepare working directories.
Mon Mar 15 16:45:47 CET 2021 Start to find LTR candidates.
Mon Mar 15 16:45:47 CET 2021 Identify LTR retrotransposon candidates from scratch.
from edta.
from edta.
Related Issues (20)
- 文件缺失 HOT 1
- Stuck by BLAST in LTR finding HOT 2
- PanEDTA test output
- [No LINE, EDTA 2.2.0] Empty LINE file after RM2
- LINE and SINE results files has 0 bp!
- ERROR: TE annotation stats results not found in B.purpurea.fasta.mod.EDTA.TE.fa.stat! HOT 1
- '调用失败' HOT 8
- Statistical genome size
- solve '*.mod.EDTA.TEanno.sum' empty HOT 5
- Unusual Output and Failure During Regular Annotation
- TE_XXX in gff3 from panEDTA
- panEDTA timing out on large genomes HOT 2
- For the RepeatModeler step why throw logs to 2>null?
- Using CDS from multiple species
- several issues in the ruuning of EDTA2
- test data HOT 6
- Test file failed
- Locate source of annotation HOT 1
- .mod.EDTA.TEanno.sum was empty HOT 1
- Can I concatenate EDTA library and Repbase library manually? HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from edta.