Comments (2)
Hello @JulianSpagnuolo
I missed your issue sorry for the late response.
You will almost always see a discrepancy between rna and tag data for droplet based protocols. This is because the lowest RNA content cells that might not be selected in the end will still be valid from a tag perspective.
As an example, High mitochondrial/dying cells will come up with tags but won't be kept for RNA because of the low amount RNA.
There is not cell barcode correction in CITE-seq-Count, so you definitely lose a few reads. But the sensitivity of CITE-seq is so high that I would not worry about this.
You can "adjust" this by using a whitelist of cell barcodes. Either take the top cells in RNA and use those for the tags or the other way around. You could also be very broad and take the union of top cells from each.
There is definitely an incentive to integrate both data in dropSeqPipe. It will be done in the future, but there is no fixed plan as for a specific release date yet.
from cite-seq-count.
Thanks @Hoohm , thanks for the feedback, I have found that it is better to use cell barcodes that have high antibody counts to find the "good" cell barcodes in the transcriptome data (because of the high sensitivity of the method).
As for the barcode correction - I have already gone ahead and written my own Rcpp solution (currently available on my git: julianspagnuolo/adtseq) that is compatible with the dropseq-tools pipeline written by @jamesnemesh
I also modified the antibody barcode detection to take into account the phred quality of the base call since I was concerned about mismatches at low quality base-calls (not always a problem but I had a bad sequencing library to get data from).
from cite-seq-count.
Related Issues (20)
- Different "UMIs corrected" results between v1.4.3 and v1.4.5 HOT 2
- Code for running multiple lanes
- ValueError: columns cannot be a set HOT 5
- Python error when there are too many cells? Script never finishes HOT 2
- Cell Hashing on Multiome samples HOT 11
- bug in preprocessing.py -> get_read_paths() (version 1.4.3) HOT 1
- method keeps looking for whitelist when not provided. HOT 8
- 100% unmapped
- Discussion question -- is it normal to take so long? HOT 3
- Include path to tag file in run report HOT 1
- barcodes with 0s in all protein tags and unmapped roll HOT 2
- Running Cite-Seq-count with next GEM protocol HOT 2
- Pandas error after correcting UMIs HOT 1
- Allow to split fastq or get list of readID per cell HOT 2
- Unable to locate temp file while merging results
- Does not compile using Python version > 3.9.x please fix.
- Gene and Antibody both have a pair of fastq files, how to run CITE-seq-Count?
- Please, trim all sequences at the same length.
- CITE-seq-Count run gets stuck HOT 4
- ValueError: columns cannot be a set HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cite-seq-count.