Giter VIP home page Giter VIP logo

Comments (5)

Dfupa avatar Dfupa commented on June 9, 2024

Hi @SaelinB,

Only adjacent contigs connected by at least 5 (default value of the parameter -j, --joins in the script) read pairs are joined through SSPACE3 as scaffolds. Setting --nogapclosing, no further action is done to fill the gaps before the final reduction step.

Let me know if this answers your question!

from redundans.

SaelinB avatar SaelinB commented on June 9, 2024

Thanks for your response!

I'm a little new to this, and I guess I'm a bit confused as to exactly what gapclosing is doing. I saw that after using redundans normally and with --nogapclosing, both scaffolds.fa and scaffolds.reduced.fa have gaps as Ns of variable lengths. So, how are these Ns different between scaffolds.fa/scaffolds.reduced.fa with or without gapclosing? and when gapclosing is on, how are the Ns different between scaffolds.fa, scaffolds.filled.fa and scaffold.reduced.fa?

from redundans.

Dfupa avatar Dfupa commented on June 9, 2024

Hi @SaelinB,

Redundans uses GapCloser from SOAP denovo to close the gaps emerging during the scaffolding process. It utilizes the paired end reads and remaps them to the scaffolds. If one end of the read is paired to a gap and the other read is paired to a scaffold, It will try to use the read sequences to try and reconstruct the gap sequence. Keep in mind that for gap closure, sufficient sequencing depth of each insert-sized library can significantly enhance the effectiveness of filling gaps of corresponding sizes..

After each step of reduction, scaffolding, gap closing, etc., the number of 'N's can change. This can occur by removing chunks of heterozygous contigs, generating new gaps during the scaffolding or filling gaps with high-quality sequences. These changes often lead to a decrease in the number of 'N's at the later steps, although this isn't always the case. It's worth noting that you can employ other metrics to assess the assembly quality such as improvements in the N50, longer contig sizes, fewer total contigs, and a higher percentage of homozygous contigs. So, while the presence of 'N's is informative, there are additional metrics that can assist in evaluating genome assembly quality both before and after the process.

Hope this answer is informative!

from redundans.

SaelinB avatar SaelinB commented on June 9, 2024

Okay thanks for the explanation!

from redundans.

Dfupa avatar Dfupa commented on June 9, 2024

@SaelinB Glad it helped

If you don't mind, I'll close this issue for now, if you ever need it to be reopened, just comment on it

from redundans.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.