Comments (2)
@matthdsm This is actually a "feature" in the sense that it reproduces exactly the same behavior as in GATK: When you pass the -L option to ApplyBQSR, it also produces a BAM file that filters out reads that are not in the specified regions. This also has an impact on HaplotypeCaller: When you pass the -L option to HaplotypeCaller, it will work only on the specified regions, but adds some padding around those. If we wouldn't filter out reads in our BQSR step, our HaplotypeCaller would therefore take more reads into account than the original HaplotypeCaller. (GATK's ApplyBQSR with -L option effectively cancels out the padding in GATK's HaplotypeCaller with -L option.)
If you have a better idea how we should handle this case while remaining compatible, we would be very interested to hear suggestions. :)
Pascal
from elprep.
Hi Pascal,
Thanks for the clarification.
I get the need for feature parity between Elprep and the GATK, it's a big selling point. With this explanation, I think it's handled as it should be.
Thanks for clearing up, feel free to close.
Cheers
Matthias
from elprep.
Related Issues (20)
- [RFE] haplotypecaller: support for custom GVCF GQ slices HOT 1
- Comment: handling RG's in SFM HOT 4
- GATK equivalency - version HOT 1
- Issue during build of v5.1.0 HOT 3
- Variant calling using `sfm` does not adhere to thread limit HOT 13
- "exit status 2" error HOT 2
- vcf-to-elsites HOT 3
- Proper running using sam input file? HOT 1
- coverage vs vcf.gz size discrepancy HOT 4
- elprep5: gVCF does not match BAM read pileup HOT 2
- `elprep split` output directory HOT 5
- Add wiki page about "creating your own `sfm` script" HOT 3
- header tag differs between runs HOT 2
- elprep merge: add input for intermediate files
- feature: index output HOT 2
- Very low number of variants called by elprep in an heterozygous genome
- elfasta-conversion error: bufio.Scanner: token too long HOT 6
- Option to output realigned BAM file during --haplotypecaller step
- build fails on ALMA linux 9
- NO_COOR error when trying to index the output bam file
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elprep.