Giter VIP home page Giter VIP logo

Comments (9)

kmnip avatar kmnip commented on July 18, 2024

Can you please report the version and the exact command you used?

from rna-bloom.

xiekunwhy avatar xiekunwhy commented on July 18, 2024

Hi,

I use latest conda version, and jar v1.4.3 also has the same problem.

I assembled 30 samples separately, only 3 of them were finished normally, 27 samples got above errors.

here is the log file
rna-bloom.zip

Best,
Kun

from rna-bloom.

kmnip avatar kmnip commented on July 18, 2024

Thanks. I am posting part of your command here for future references:

rnabloom -sensitive true -bound 500000 -f true -left ZD_E28_3_1.fq.gz -right ZD_E28_3_2.fq.gz -revcomp-right -t 8 -outdir ZD_E28_3

There are several things wrong with your command:

  1. -sensitive does not take any arguments. -sensitive true would not work.
  2. -f does not take any arguments. -f true would not work. Typically, you don't need to use this option unless the command is a re-run and you want to overwrite all existing files.
  3. -bound 500000 is set way too large. I would not expect the fragment size for your paired-end reads to be 500Kbp. Usually, it is well under 1,000 for a typical Illumina RNA-seq sample. Setting a bound that high will result in an extremely long runtime for stage 2.
  4. Since you have installed from conda, please include the -ntcard option to automatically calculate the appropriate Bloom filter sizes. From one of the log files, I noticed that stage 1 was re-run automatically due to a very high Bloom filter false positive rate.

Taking all this together, this modified command should work:

rnabloom -ntcard -sensitive -bound 500 -f -left ZD_E28_3_1.fq.gz -right ZD_E28_3_2.fq.gz -revcomp-right -t 8 -outdir ZD_E28_3

from rna-bloom.

xiekunwhy avatar xiekunwhy commented on July 18, 2024

Thank you for pointing out.

Sorry, the .sh files above is the second run, and the log files are the first run's log.
here is the first run command line: rnabloom -left ZD_E28_3_1.fq.gz -right ZD_E28_3_2.fq.gz -revcomp-right -t 8 -outdir ZD_E28_3
all are default except input and output parameters.

I changed the parameters as I want to let rnabloom run through, but failed again.

Best,
Kun

from rna-bloom.

kmnip avatar kmnip commented on July 18, 2024

What is the error this time? And, can you please remove the output directory ZD_E28_3 if it already exists and try again?

from rna-bloom.

kmnip avatar kmnip commented on July 18, 2024

Also, please use the ntcard option as mentioned previously.

from rna-bloom.

xiekunwhy avatar xiekunwhy commented on July 18, 2024

Hi,

rnabloom finished normally this time (rnabloom -left ZD_E28_3_1.fq.gz -right ZD_E28_3_2.fq.gz -revcomp-right -t 20 -outdir ZD_E28_3 -ntcard), but produced too many transcripts (126327 transcript in rnabloom.transcripts.nr.fa file), the gene number of this species is about 20000, so 126327 transcript is too many. Is there a way or some parameters to reduce transcript number?

Best,
Kun

from rna-bloom.

kmnip avatar kmnip commented on July 18, 2024

You may consider using EvidentialGene:
http://arthropods.eugenes.org/EvidentialGene/about/EvidentialGene_trassembly_pipe.html

from rna-bloom.

xiekunwhy avatar xiekunwhy commented on July 18, 2024

EvidentialGene is not so easy to use.

from rna-bloom.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.