Giter VIP home page Giter VIP logo

Comments (4)

walaj avatar walaj commented on September 29, 2024

Hi Danshu,
I think the best I could advise would be to make a histogram of the BX counts and find a reasonable cutoff for your own data. bxtools was a sort of weekend-type project to help with some 10X analyses led by some collaborators, so I wouldn't presume at this point to give much advice on interpreting the outputs. e.g. the value of "100" was completely ad hoc. I'm not really sure what is meant by their filter either, so you might reach out to the authors.

That said, I'd like to learn more sometime about 10X data. In the meantime, if you have any suggestions for the types of low-level data manipulation (like stats or split) that would be useful from bxtools, do let me know. I'd be interested in expanding this!

Best,
Jeremiah

from bxtools.

danshu avatar danshu commented on September 29, 2024

Hi Jeremiah,

Thanks for your kind advice. Actually I have tried to plot the BX counts to find a reasonable threshold. But what matters should be the characteristics of the technology, e.g. low frequency of BX count indicates poor quality of that pool, which I'm not sure. Anyway I will try to ask the company for more details.

Actually I'm trying to use my 10x genomics data for scaffolding my assembly. One common requirement of those scaffolders are attaching BX to the read name, while different tools have different styles of attaching BX. For example, ARCS (https://github.com/bcgsc/arcs) will be expecting the barcode at the end of the read name, in this format:
READNAME_TAGCATAGACATCAGA
Great if bxtools may combine filtering with attaching BXs to read names.

I managed to do this job by extracting filtered reads using "seqtk subseq" and then rename them according to the format required: "awk '{ if (NR%4==1) {split($2,a,":"); split(a[3],b,"-"); print $1"_"b[1] } else { print } }' test.fastq > rename_test.fastq".

Best,
Danshu

from bxtools.

walaj avatar walaj commented on September 29, 2024

I'm in a mood to procrastinate on other things at the moment, decided to give this a shot... Just added relabel. Does this do what you need? You can pipe the output from relabel to samtools view and AWK to get to a fastq from a BAM if you need.

from bxtools.

danshu avatar danshu commented on September 29, 2024

Thanks and I will try it later!

from bxtools.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.