Giter VIP home page Giter VIP logo

Comments (3)

BurntSushi avatar BurntSushi commented on June 27, 2024

It looks like your compressed database is corrupt. Every compressed database contains a blast database of the compressed sequences, and this seems to be where cablastp-search is failing.

The blast database is created at the end of compression, and if it fails, it outputs an error message. But perhaps the error message was ignored? (The rest of the database would still be created.)

In either case, it should be possible to fix this without re-running compression. (Although it would be good to run compression on a smaller database and make sure everything runs correctly. If you get an error, please come back here with it and open a new issue.)

We can fix it by generating the BLAST database:

cd path/to/cablastp-UNIREF90
makeblastdb -dbtype prot -in ./coarse.fasta -out blastdb-coarse

Then re-running your cablastp-search command should do the trick.

from cablastp.

nsarode avatar nsarode commented on June 27, 2024

Thanks for the help. There was some serious issue with the compressed database I guess, cause the coarse.fasta file is zero bytes! As a result I obviously got the "BLAST options error: File ./coarse.fasta is empty" error. All I can do for it is re-compress the database myself (someone else had done that earlier). That should take a while.
I had another COG database that was compressed properly and that seems to be running smoothly (so far).

There is another question though, even though my command is
~/path/to/capblast/cablastp-search path/database query_file --p="4" --blast-args -max_target_seqs 10 -num_threads 4 -out out.blast -outfmt '6 qseqid sseqid pident qcovs length mismatch qstart qend sstart send evalue bitscore stitle'

Why does the program ignore the num_threads and outfmt value given by me (see below) ? Should every individual parameter be preceded by --blast-args ?

Blasting query on coarse database...
blastp -db /projects/home/nsarode3/Database/capblastp-cogs_db_aa/blastdb-coarse -num_threads 16 -outfmt 5 -dbsize 47558392

from cablastp.

BurntSushi avatar BurntSushi commented on June 27, 2024

All I can do for it is re-compress the database myself (someone else had done that earlier). That should take a while.

Dang, that sucks. Sorry about that. If the compressed database is still corrupt after that, could you please post the output of your cablastp-compress command, including the command itself?

Why does the program ignore the num_threads and outfmt value given by me (see below) ? Should every individual parameter be preceded by --blast-args ?

A search in compressed space actually involves two invocations of blast. First is a blast search against the compressed database, which always uses a particular output format so that it can be read by cablastp-search. The second blast search is on the expanded hits from the first search, and that invocation of blast will use your parameters to --blast-args verbatim. But since your search is failing on the first invocation, you're seeing the results of the compressed blast search, which won't include the --blast-args.

However, it is a bug that -num_threads is set to 16. It should probably be set to the value of the -p flag. I've filed a new issue #4.

from cablastp.

Related Issues (8)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.