Comments (27)
Hi. Do you mean mapping bulk TCR repertoire sequencing (assembled clones without cell barcodes) to scRNA-seq?
from mixcr.
Thanks. I generated TCR repertoire using mixcr with scRNA-seq data. I would like to know which cells actually contain the clonotypes. Does that make sense?
from mixcr.
What MiXCR command did you use?
from mixcr.
mixcr analyze rna-seq -s hsa --rna 1.fastq.gz 2.fastq.gz outdir
and
mixcr analyze rna-seq -s hsa --rna {{CELL0ROW:a}}_1.fastq.gz {{CELL0ROW:a}}_2.fastq.gz outdir
but the data is for scRNA-seq (I run into errors when using the 10x and Smart-seq2 presets).
from mixcr.
This preset is designed for a bulk RNA seq data.what protocol did you use for scRNA seq? Was is it 10x? What error did you get?
from mixcr.
I used 10x and Smart-seq2. For 10x, there's no preset for 3'.
from mixcr.
What chemistry did you use for 10x v2 or v3. Both are available for 3' and they differ in cell barcode length.
Also, did you encounter any errors using 'smart-seq2'? Can you share the text of the error?
from mixcr.
What are the presets for 10x v2 or v3 for 3'? Can I map the TCR clonotypes to the individual cells?
from mixcr.
We originally didn't include the 3' preset because the clonotype yield from this type of data is significantly low.
If you're using 3'10x V2, the barcodes are identical to 5', so you can use the following preset:
mixcr analyze 10x-sc-5gex \
--species hsa \
input_R1.fastq.gz \
input_R2.fastq.gz \
output
For 3'10x V3, use the command below:
mixcr analyze 10x-sc-5gex \
--species hsa \
-MrefineTagsAndSort.whitelists.CELL=builtin:3M-febrary-2018 \
--tag-pattern "^(CELL:N{16})(UMI:N{12})\^(R2:*)" \
input_R1.fastq.gz \
input_R2.fastq.gz \
output
In the resulting output tables, there will be a column showing the CELL barcoded sequence. You can utilize this to map the cells to their respective expression data.
Do you require assistance with smart-seq2 data?
from mixcr.
Thanks. In the above examples, does input_R2.fastq.gz contain the barcode+UMI sequence?
from mixcr.
Yes, everything is according to 10x protocol.
from mixcr.
Thanks. What if there are two coding fastq files?
from mixcr.
Sorry, I mislead you, in 10x CELL and UMI barcodes are in R1 and the "coding" sequence is in R2.
Do you mean that you have a longer R1 which also covers some sequence in addition to barcodes?
from mixcr.
Thanks. Yes.
from mixcr.
Then you can use --tag-pattern "^(CELL:N{16})(UMI:N{10})(R1:*)\^(R2:*)"
and --tag-pattern "^(CELL:N{16})(UMI:N{12})(R1:*)\^(R2:*)"
for V2 and V3 respectively.
from mixcr.
Thanks! I would also like to consult re smart-seq2 but I have to review my data first.
from mixcr.
Then you can use
--tag-pattern "^(CELL:N{16})(UMI:N{10})(R1:*)\^(R2:*)"
and--tag-pattern "^(CELL:N{16})(UMI:N{12})(R1:*)\^(R2:*)"
for V2 and V3 respectively.
I read here https://www.biostars.org/p/9529864/#9572200 that in cases where R1 is barcode+UMI, the rest of the read can be effectively ignored. What tag pattern should I use in that case?
from mixcr.
Also, do you have a preset for Smart-seq2 cel-seq?
from mixcr.
We originally didn't include the 3' preset because the clonotype yield from this type of data is significantly low.
If you're using 3'10x V2, the barcodes are identical to 5', so you can use the following preset:
mixcr analyze 10x-sc-5gex \ --species hsa \ input_R1.fastq.gz \ input_R2.fastq.gz \ output
For 3'10x V3, use the command below:
mixcr analyze 10x-sc-5gex \ --species hsa \ -MrefineTagsAndSort.whitelists.CELL=builtin:3M-febrary-2018 \ --tag-pattern "^(CELL:N{16})(UMI:N{12})\^(R2:*)" \ input_R1.fastq.gz \ input_R2.fastq.gz \ output
In the resulting output tables, there will be a column showing the CELL barcoded sequence. You can utilize this to map the cells to their respective expression data.
Do you require assistance with smart-seq2 data?
do you also have a preset for 10x 3' v1?
from mixcr.
Hi,
- Preset for Smartseq2 is
smart-seq2-vdj
. You can read about it here. - Use the original commands provided above (where the pattern doesn't have
R1
) to skip the rest of R1 read. - I couldn't find any info regarding 3' v1 on their website. If you can share the barcode structure and the whitelist I can help you with that.
from mixcr.
- I couldn't find any info regarding 3' v1 on their website. If you can share the barcode structure and the whitelist I can help you with that.
Thanks. https://www.biostars.org/p/462568/
10x v1
Whitelist, 737K-april-2014_rc.txt
CB length, 14
UMI start, 15
UMI length, 10 (courtesy ATpoint)
from mixcr.
Hi,
How can I run mixcr with SureCell sequences?
SureCell (18 bp barcode, 8 bp UMI): surecell, ddseq, biorad
Thanks and good day.
from mixcr.
Hi, are you certain you're using 3' v1? To my knowledge, that kit version was discontinued a long time ago.
From the link you provided, the tag pattern appears as follows:
--tag-pattern "^(CELL:N{14})(UMI:N{10})\^(R2:*)"
The corresponding command would be:
mixcr analyze 10x-sc-5gex \
--species hsa \
-MrefineTagsAndSort.whitelists.CELL=builtin:737K-april-2014_rc \
--tag-pattern "^(CELL:N{14})(UMI:N{10})\^(R2:*)" \
input_R1.fastq.gz \
input_R2.fastq.gz \
output
However, based on this link I found, the cell barcode is actually located in the Index read.
Given this, the command should be:
mixcr analyze 10x-sc-5gex \
--species hsa \
-MrefineTagsAndSort.whitelists.CELL=builtin:737K-april-2014_rc \
--tag-pattern "^(UMI:N{10})\^(R2:*)\^(CELL:N{14})" \
input_R1.fastq.gz \
input_R2.fastq.gz \
input_I1.fastq.gz \
output
Illumina Bio-Rad SureCell 3' WTA for ddSEQ
We haven't tried this protocol before. Since it's a 3' transcriptome, we don't anticipate a substantial number of TCR/BCR clones from it. But, based on the link shared above, the preset should look like this:
mixcr analyze generic-single-cell-gex-with-umi \
--species hsa \
--tag-pattern "^(CELL1:N{6})tagccatcgcattgc(CELL2:N{6})tacctctgagctgaa(CELL3:N{6})acg(UMI:N{8})\^(R2:*)" \
input_R1.fastq.gz \
input_R2.fastq.gz \
output
Sincerely,
Mark
from mixcr.
Thanks!
Do you also have a preset for Fluidigm C1?
from mixcr.
Can you share the protocol that you used fro cDNA library construction? To my knowledge Fluidigm C1 platform can be used with different chemistry.
If you can provide the library structure I will help you with the command. UMIs, CELL barcodes any technical sequences, are all cells in the single pair of FASTQ files or each cells' sequences are in a distinct pair of FSTQ files, etc.
from mixcr.
Is it related to this #1105
from mixcr.
Yes. Many thanks!
from mixcr.
Related Issues (20)
- Analysis of single chain Fragment variable (scFv) HOT 1
- Error exportPlots shmTrees - Duplicate library: repseqio.v4.0_with_found_alleles:10090 HOT 2
- transitioning from postanalysis to exportPlots HOT 1
- Somatic Hypermutation Status of IGHV status HOT 2
- Question about filtering reads prior to Mixcr HOT 1
- Preset for long-read RNAseq with cell barcode HOT 1
- How to perform MiXCR on my spatial TCR data HOT 1
- The problem of mixcr exportPlots shmTrees HOT 3
- The majority of results in 'clones_TR[A|B].tsv' are "region_not_covered" HOT 1
- allVHitsWithScore allDHitsWithScore allJHitsWithScore allCHitsWithScore HOT 1
- FASTQ and BAM give discordant results HOT 6
- Help with TCR-seq alignment rate
- findAlleles get empty clone data HOT 11
- Rhapsody BCR+TCR full length data input HOT 2
- Generating all all_contig_annotations.json to run enclone after MiXCR
- Postanalysis output explanation HOT 3
- `Feature for allele search doesn't intersect JGene` error when running `findAlleles` HOT 1
- Runtime error of "Can't apply step BuildingInitialTrees" when running findShmTrees HOT 3
- Question: Does mixcr take sequence stagger into consideration? HOT 1
- ERROR: picocli.CommandLine$ExecutionException: Error while running command align java.lang.IllegalStateException: Check failed. HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mixcr.