Comments (4)
Hi,
Good point. The way coverm is interpreting this is that you want a BAM file with only proper pairs that don't pass the --min-read-percent-identity 0.95 --min-read-aligned-percent 0.7
thresholds. It doesn't "invert" the --proper-pairs
flag.
I suggest taking a two step approach - grab reads that fail in the same way you did above, and then concatenate unpaired reads and unmapped reads with samtools and its flags for filtering.
Does that make sense? I understand this is a bit confusing, and the documentation could be better - will fix.
ben
from coverm.
Thanks for this quick and helpful response! I see, I think that makes sense. One follow up question, I guess I would then expect the bam file to still contain a matching number of reads in the forward and reverse file with this proper pairs flag, rather than having a different number between the files. I.e., if only proper pairs are passed to this output bam file, and the bam file contains output of properly paired reads which do not meet the identity and alignment requirements, should the fwd and rev fastq files have the same number of reads? Could you explain this piece a bit more?
from coverm.
Good q.
I think what is happening is that they are aligned as a proper pair, but then since you used thresholds which apply to a single side of each pair, then only one of the pair is present in the output BAM. Does that make sense?
I have to admit it has been a while since I looked at that code though. Perhaps it would be worth verifying by inspecting the mappings of now-singleton mappings in the original file?
from coverm.
I see, yes that does make sense. I'll definitely inspect the singletons to see if that is the case. Thanks!
from coverm.
Related Issues (20)
- Read minimum length & small RNA mapping HOT 3
- A question on RPKM calculation HOT 2
- Checkm2 tsv for `coverm cluster --checkm-tab-table` HOT 1
- Question about TPM HOT 2
- Calculations of min-read-percent-identity and min-read-aligned-length HOT 2
- Zero covered bases and mean coverage but non-zero reads mapping
- --contig-end-exclusion doesn't work with -m not set to mean HOT 2
- Feature request: Handle secondary alignments and new release HOT 5
- CoverM reports near 0 reads mapped despite almost all reads seemingly being mapped HOT 5
- Extracting bam files HOT 1
- CoverM to assess genomes relative proportion in Metatranscriptomics data?
- How can I provide extra parameters to BWA using the "--bwa-params" option? HOT 3
- [2023-06-04T07:45:16Z INFO coverm::contig] In sample 'cdhit_rep_seq.fna/SRR13083091_1.fq.gz', found 0 reads mapped out of 0 total (NaN%) HOT 3
- libtinfow.so.6: no version information available (required by samtools) HOT 2
- Q: coverM compatible with bowtie2 HOT 1
- Usage of --sharded
- default rep seq picking method HOT 2
- Unable to find BAM file when file exists HOT 2
- thread 'main' panicked at 'index out of bounds: the len is 6564 but the index is 6586' HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from coverm.