Comments (6)
Hi Xiaotong,
Yes, that's correct. If you do not use SpeedSeq for alignment, you'll have to generate discordant and split reads. We already have a tool for doing this, called Extract SV Reads. You can find the software and documentation here: https://github.com/hall-lab/extract_sv_reads.
Best,
Colby
from speedseq.
What aligner were you using? We have only tested samblaster's splitter/discordant extraction on BWA-MEM aligned files (using the -M flag to mark shorter split reads as secondary).
Also, is your BAM file sorted by read ID?
from speedseq.
We are using bwa mem (0.7.5a-r405) , yes we do mark them using -M and yes the bam file was read name sorted for samblaster to run. These is the example command:
bwa mem -t 4 -PM
Thanks for your quick reply.
from speedseq.
Interesting, I'm not sure why that wouldn't work. Could you send me the first 10000 or so lines from your BAM to cc2qe at virginia dot edu? I probably won't have time to look at it until tomorrow at the earliest though.
In the meantime, we have this script that may work on your file:
https://github.com/hall-lab/svtools/blob/master/extractSplitReads_BwaMem
from speedseq.
BAM file lacked secondary reads, potentially due to different behavior of BWA MEM prior to v0.7.7.
closing.
from speedseq.
Hi @cc2qe ,
So if the BAM is aligned by bwa aln
and not sorted by QNAME, then the internal default spl extraction of speedseq sv
won't work, right?
I noticed the discordant reads were generated properly but not split reads. In contrast, if I explicitly run speedseq realign
on my BAMs first, then the split reads extraction is successful, but this step also does the entire BAM which is a big waste of time when we are only interested in the reads supporting potential SVs.
Do you have the plan to make the default behavior of speedseq sv
to be compatible with bwa aln
BAMs?
Thanks,
Xiaotong
from speedseq.
Related Issues (20)
- compilation of freebayes module fails on Ubuntu 18.04 HOT 2
- Error -running speedseq SV - missing pair end parameters:mean stdev
- Parallel link is dead HOT 1
- problems about vcf files.
- Warning: --split_bam (-S) is deprecated
- speedseq var and sv only on genomic regions
- Read block operation failed with error HOT 1
- is there a container? HOT 2
- Linker error when doing `make cnvnator`
- -M parameter for BWA MEM alignment
- error: static assertion failed: comparison object must be invocable as const
- Bam file not generated using speedseq align
- mismatched line lengths at line 3 within sequence
- Speedseq is runnning infinitely HOT 1
- [Question] Is ARM64 supported ? HOT 1
- configure: error: cannot guess build type; you must specify one
- error: cannot guess build type; you must specify one
- make :can not find -lCore -lRIO -lHist -lGraf -lGpad -lTree -lMathCore
- Are CRAM files supported by SpeedSeq?
- Compile Errors: *** [Makefile:59: cnvnator] Error 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from speedseq.