Comments (8)
Hi!
Generally, for assembling any kind of organelle genome, you have to firstly get the organelle-sufficient graph, which I defined to be an assembly graph with contigs completely covers the whole organelle genome. In your case, the assembly graph is not sufficient, meaning that there are about three gaps. Considering the k-mer coverage, this is nice data and it is very likely to get the mitogenome-sufficient graph after adjusting some options when running GetOrganelle. Could you please attach the get_org.log.txt file? I can barely tell what to do to improve the assembly graph.
Only when we did obtain the mitogenome-sufficient graph, we could disentangle the graph into complete sequences. You actually do not need to watch the video and manually disentangle a nice graph like this after you make it "sufficient". You can simply use the script disentangle_organelle_assembly.py (which would be automatically called by the main script). However, for mitogenome with large scale repeats like this graph, there would be lots possible paths, which are potential configurations for the mitogenome. Even if you had really long insert size for the sequencing library, different isomers of mitogenome (even of plastome) coexist in plant is quite common.
Let's look at the log file first~
from getorganelle.
Alright! I'm attaching the log-file! I went through the video while going through the paper and I thought it showed good clarity!
Please do let me know, currently I have just used the default parameters with the anti-seed being set as a chloroplast genome
from getorganelle.
I would try using a broader range of k-mer gradient, "-k 21,35,45,55,75,95,105,115,127" first.
You could make a duplicate for previous result and deleted the filtered_spades folder and re-run get_organelle_reads.py with your previous options together with "-k 21,35,45,55,75,95,105,115,127 --continue", thus the previous extension part would be skipped.
Let me know the progress!
from getorganelle.
Sure, I'm currently starting the run again.
from getorganelle.
Hi,
I have finished the run for the same sample using the extended kmer gradient as per your suggestion, I'm also attaching the log file.
It seems that the graph is still mitogenome-insufficient and as such can't be circularized.
Thanks for the help
get_org.log.txt
from getorganelle.
Hi,
I mean you should try "-k 21,35,45,55,75,95,105,115,127" rather than 75,85,95,105,115,127 in your log file. Also, you have to delete previous assembly_graph.fastg files in the output folder, otherwise the slim & disentangle process would be skipped as well when "--continue" used.
If you cannot quite get the usage of continue, you can start a new run by using:
get_organelle_reads.py -1 ../../1_R1.clean.fastq -2 ../../1_R2.clean.fastq -s /data/ref_genomes/Eucalyptus_grandis/Eucalyptus_mitochondrion.fasta -a /data/ref_genomes/pomogranate/chloroplast.fasta --genes /data/ref_genomes/Eucalyptus_grandis/Eucalptus_MT_coding_sequnce.fasta -o mitochondria_Sample1 -R 30 -t 20 -k 21,35,45,55,75,95,105,115,127 -F plant_mt
If by doing this you still fail with using "-k 21,35,45,55,75,95,105,115,127", you may try to remove the "-a /data/ref_genomes/pomogranate/chloroplast.fasta" option, for the target mitogenome might share some chloroplast gene/sequence and was removed by the anti-seed. But I would try the "-k 21,35,45,55,75,95,105,115,127" first.
from getorganelle.
Thank you, I'll do that and get back to you.
from getorganelle.
Since there are no follow-ups, I'm closing this issue. Feel free to reopen it if you have further questions.
from getorganelle.
Related Issues (20)
- ERROR: Disentangling failed: Failed in 'from scipy import stats, inf, log'! HOT 3
- Regarding the Use of Incomplete Plant Mitochondrial Genome Assembly Outputs for Subsequent Analysis HOT 2
- failing from flye assembly graph of a phage genome with 4 circular paths HOT 5
- Disentangling failed: 'Unable to generate result with single copy vertex percentage < 50%' HOT 20
- TypeError: execute_blast() got an unexpected keyword argument 'percent_identity' HOT 2
- An input/output error while running Getorganelle HOT 1
- Increase the insert size length to improve assembly completeness HOT 1
- difference in assembly size of circular and scaffold cp genomes #326 HOT 2
- No target organelle contigs found / no sequence hit our LabelDatabase HOT 1
- No animal_mt seed reads found!
- TypeError: can't convert complex to int. HOT 1
- math domain error
- get_organelle_from_reads.py can not find the embplant_nr db in SGE
- something went wrong when using -F anonym without --target-genome-size
- Abnormal assembly size due to incorrectly including shallow-coverage mt contig(s) in the embplant_pt selected_graph HOT 2
- ValueError: max() iterable argument is empty HOT 1
- Extended SPAdes was created for low quality sequence read instead complete assembly HOT 2
- 叶绿体基因组组装不成环 HOT 1
- (ERR): bowtie2-align died with signal 9 (KILL)
- Error with running SPAdes: == Error == exception caught: <class 'AttributeError'> HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from getorganelle.