Comments (18)
Hi Guys. Just to let you know that I am working on adding a module for that, which is also mentioned in issue #89
😄
from scrnaseq.
Thought scverse IO package when you mentioned was the whole scverse of functions 😅
But they are actually planning to have everything as a single package? Is that so?
Anyways, so we have a plan!
Monday I come back with my saga to find a docker image. The module itself is finished with scanpy 😃
Have a nice weekend guys
from scrnaseq.
Yes, the module I am writing is using scanpy from scverse IO package to handle convertion.
Well, the scverse IO package doesn't exist yet which is a common complaint :) You have to use scanpy as you were planning to do anyways.
"What do you guys think?"
Good plan!
from scrnaseq.
"But they are actually planning to have everything as a single package? Is that so?"
"they" are the people you're talking to here :) Yes, this has been a frequent discussion.
from scrnaseq.
Yeah because I think that the actual count generation is not done in the pipeline (yet). Could be added, I think this is just lacking some extra parameters / another step that generates the matrices that @ivirshup just mentioned above.
from scrnaseq.
Available in dev
now!
from scrnaseq.
Sounds like a good thing to do - maybe work on this already in DSLv2 as the conversion to DSLv2 is likely going to be finalized during the hackathon ...?
from scrnaseq.
@fmalmeida it might make sense to make the module more general -> create scverse datastructures.
Especially with #99 the output should be MuData objects and not AnnData objects. Happy to get more detailed if you want me to.
from scrnaseq.
Would be great to align here @fmalmeida with @Zethson 👍🏻
from scrnaseq.
Of course Lukas is right - but wouldn't it be a good start to add AnnData support and extend to mudata whenever we address #99?
I guess it depends if that feature is planned for the 2.1 release...
from scrnaseq.
@grst I also think that this is another great use case for the scverse IO package @ivirshup
The containers get a bit bloated with the big packages.
from scrnaseq.
Hi @grst, @apeltzer and @Zethson.
Yes, the module I am writing is using scanpy from scverse IO package to handle convertion. It is basically done. I just need to find a working docker image for it (It is not in bioconda — helps on that are welcomed)
I believe we can do as Greg mentioned because they are different use cases.
I also saw that there is an issue #89 about converting to Seurat format (which I will also work on).
Since they are all diferente and valid use cases, I believe all of them should be options.
We can have modules for AnnData, Seurat and MuData. And later, if necessary, make then optional for users.
We could, for example, do:
- First have AnnData
- Have Seurat
- Work on #99 and then add MuData.
What do you guys think?
from scrnaseq.
Didn’t know.
Great work on the packages 😁
from scrnaseq.
Hi Guys,
A question.
How can I do conversion for other aligners? For now I was able to do for cellranger, alevin and kallistobustools with scanpy.read_10x_mtx
and scanpy.read_mtx
but am not sure about starsolo
. It that does not seem to produce .mtx
file.
from scrnaseq.
I have opened a draft PR so we can try to address stuffs like right concept, right usage of data and storage of outputs directly from there.
from scrnaseq.
AFAICT, start solo should output something like that? From the bottom of the docs:
soloOutFileNames Solo.out/ features.tsv barcodes.tsv matrix.mtx
string(s) file names for STARsolo output:
file_name_prefix gene_names barcode_sequences cell_feature_count_matrix
from scrnaseq.
Interesting ...
This is not what I am seeing.
run_star
└── star
├── Sample_X.Aligned.sortedByCoord.out.bam
├── Sample_X.Log.final.out
├── Sample_X.Log.out
├── Sample_X.Log.progress.out
├── Sample_X.SJ.out.tab
├── Sample_Y.Aligned.sortedByCoord.out.bam
├── Sample_Y.Log.final.out
├── Sample_Y.Log.out
├── Sample_Y.Log.progress.out
├── Sample_Y.SJ.out.tab
└── star
├── Genome
├── Log.out
├── SA
├── SAindex
├── chrLength.txt
├── chrName.txt
├── chrNameLength.txt
├── chrStart.txt
├── exonGeTrInfo.tab
├── exonInfo.tab
├── geneInfo.tab
├── genomeParameters.txt
├── sjdbInfo.txt
├── sjdbList.fromGTF.out.tab
├── sjdbList.out.tab
└── transcriptInfo.tab
from scrnaseq.
from scrnaseq.
Related Issues (20)
- Switch to use central nf-core modules
- Autorenaming of files in new cellranger module leads to erroneous renaming of sample ID containing R1 in it, like in SRR ids HOT 2
- Multi-omics support (meta-issue) HOT 1
- exceeded running time limit (8h) HOT 3
- Crash when specifying existing --star_index using star aligner HOT 2
- Bump up kb to 0.28.0 HOT 3
- Use nf-validation plugin HOT 1
- Update or remove universc HOT 8
- Tests for cellranger-arc HOT 1
- MuData conversion
- Utilize central modules for simpleaf workflow HOT 1
- The cellrangerarc cannot start successfully HOT 2
- When aligner is cellrangerarc the step MTX_TO_SEURAT failed HOT 3
- Avoid error on unknown headers in input.csv HOT 4
- Issues with samplesheet.csv as well as additional columns in the samplesheet HOT 3
- Samplesheet Check Bug HOT 1
- Flexible input read format with STARsolo with --soloBarcodeReadLength HOT 1
- Make scrnaseq pipeline v2.5.1 compatible with nextflow v22.04.0 HOT 1
- Clean up mtx conversion code
- Update to the latest simpleaf HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scrnaseq.