Giter VIP home page Giter VIP logo

Comments (18)

fmalmeida avatar fmalmeida commented on May 26, 2024 6

Hi Guys. Just to let you know that I am working on adding a module for that, which is also mentioned in issue #89
😄

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024 2

Thought scverse IO package when you mentioned was the whole scverse of functions 😅

But they are actually planning to have everything as a single package? Is that so?

Anyways, so we have a plan!
Monday I come back with my saga to find a docker image. The module itself is finished with scanpy 😃

Have a nice weekend guys

from scrnaseq.

Zethson avatar Zethson commented on May 26, 2024 1

Yes, the module I am writing is using scanpy from scverse IO package to handle convertion.

Well, the scverse IO package doesn't exist yet which is a common complaint :) You have to use scanpy as you were planning to do anyways.

"What do you guys think?"
Good plan!

from scrnaseq.

Zethson avatar Zethson commented on May 26, 2024 1

"But they are actually planning to have everything as a single package? Is that so?"

"they" are the people you're talking to here :) Yes, this has been a frequent discussion.

from scrnaseq.

apeltzer avatar apeltzer commented on May 26, 2024 1

Yeah because I think that the actual count generation is not done in the pipeline (yet). Could be added, I think this is just lacking some extra parameters / another step that generates the matrices that @ivirshup just mentioned above.

from scrnaseq.

grst avatar grst commented on May 26, 2024 1

Available in dev now!

from scrnaseq.

apeltzer avatar apeltzer commented on May 26, 2024

Sounds like a good thing to do - maybe work on this already in DSLv2 as the conversion to DSLv2 is likely going to be finalized during the hackathon ...?

from scrnaseq.

Zethson avatar Zethson commented on May 26, 2024

@fmalmeida it might make sense to make the module more general -> create scverse datastructures.
Especially with #99 the output should be MuData objects and not AnnData objects. Happy to get more detailed if you want me to.

from scrnaseq.

apeltzer avatar apeltzer commented on May 26, 2024

Would be great to align here @fmalmeida with @Zethson 👍🏻

from scrnaseq.

grst avatar grst commented on May 26, 2024

Of course Lukas is right - but wouldn't it be a good start to add AnnData support and extend to mudata whenever we address #99?

I guess it depends if that feature is planned for the 2.1 release...

from scrnaseq.

Zethson avatar Zethson commented on May 26, 2024

@grst I also think that this is another great use case for the scverse IO package @ivirshup
The containers get a bit bloated with the big packages.

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024

Hi @grst, @apeltzer and @Zethson.

Yes, the module I am writing is using scanpy from scverse IO package to handle convertion. It is basically done. I just need to find a working docker image for it (It is not in bioconda — helps on that are welcomed)

I believe we can do as Greg mentioned because they are different use cases.

I also saw that there is an issue #89 about converting to Seurat format (which I will also work on).

Since they are all diferente and valid use cases, I believe all of them should be options.

We can have modules for AnnData, Seurat and MuData. And later, if necessary, make then optional for users.

We could, for example, do:

  1. First have AnnData
  2. Have Seurat
  3. Work on #99 and then add MuData.

What do you guys think?

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024

Didn’t know.
Great work on the packages 😁

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024

Hi Guys,
A question.

How can I do conversion for other aligners? For now I was able to do for cellranger, alevin and kallistobustools with scanpy.read_10x_mtx and scanpy.read_mtx but am not sure about starsolo. It that does not seem to produce .mtx file.

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024

I have opened a draft PR so we can try to address stuffs like right concept, right usage of data and storage of outputs directly from there.

from scrnaseq.

ivirshup avatar ivirshup commented on May 26, 2024

AFAICT, start solo should output something like that? From the bottom of the docs:

soloOutFileNames            Solo.out/          features.tsv barcodes.tsv        matrix.mtx
    string(s)               file names for STARsolo output:
                            file_name_prefix   gene_names   barcode_sequences   cell_feature_count_matrix

from scrnaseq.

fmalmeida avatar fmalmeida commented on May 26, 2024

Interesting ...
This is not what I am seeing.

run_star
└── star
    ├── Sample_X.Aligned.sortedByCoord.out.bam
    ├── Sample_X.Log.final.out
    ├── Sample_X.Log.out
    ├── Sample_X.Log.progress.out
    ├── Sample_X.SJ.out.tab
    ├── Sample_Y.Aligned.sortedByCoord.out.bam
    ├── Sample_Y.Log.final.out
    ├── Sample_Y.Log.out
    ├── Sample_Y.Log.progress.out
    ├── Sample_Y.SJ.out.tab
    └── star
        ├── Genome
        ├── Log.out
        ├── SA
        ├── SAindex
        ├── chrLength.txt
        ├── chrName.txt
        ├── chrNameLength.txt
        ├── chrStart.txt
        ├── exonGeTrInfo.tab
        ├── exonInfo.tab
        ├── geneInfo.tab
        ├── genomeParameters.txt
        ├── sjdbInfo.txt
        ├── sjdbList.fromGTF.out.tab
        ├── sjdbList.out.tab
        └── transcriptInfo.tab

from scrnaseq.

grst avatar grst commented on May 26, 2024

from scrnaseq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.