Giter VIP home page Giter VIP logo

Comments (6)

martin-steinegger avatar martin-steinegger commented on August 18, 2024

Congratulations. :)

Since you can expect the same result as the x86_64 implementation. I would pick 100 random proteins from the Uniprot (I can do this for you if you are not familiar with the Uniprot) and run the both HHblits versions (x86 and ppc64le) and diff the results.
Be aware that using the ppc64le SIMD float units might result in slight differences in the score (+- 0.1).

from hh-suite.

sowani avatar sowani commented on August 18, 2024

@martin-steinegger Thanks for your help! Could you please send me results generated on x86 for 100 random proteins? If the resulting data is going to be too big to transfer, will it be possible for you to create a script which I can execute in my x86 environment and generate the results locally?

from hh-suite.

martin-steinegger avatar martin-steinegger commented on August 18, 2024

I picked 100 random sequences from the Unprot database and wrote a small script to generate for each of the 100 sequences a hhr and a A3M file. Just call this script for the x86 and the ppc64le version and diff the results. Please let me know if you have any questions.
100randsequences.zip

from hh-suite.

sowani avatar sowani commented on August 18, 2024

@martin-steinegger Thanks for the sequences and the script. I am using this data to generate the baseline on x86. I modified the run_benchmark.sh script slightly to tidy-up /tmp slightly. Here is the diff:

*** a/run_benchmark.sh 2016-11-20 17:10:44.000000000 +0530
--- b/run_benchmark.sh 2016-11-21 18:30:37.292000000 +0530
***************
*** 1,7 ****
#!/bin/bash
CPU=1
! tmpdir="/tmp"
IT=1
for seq in $(seq 1 100); do
awk -v line="$seq" '/>/{i++}i==line{print; next; print; exit}' 100.random.seq > $tmpdir/${seq}.fasta
hhblits -i $tmpdir/${seq}.fasta -d $DB -n $IT -oa3m $tmpdir/${seq}.a3m -o $tmpdir/${seq}.hhr -cpu $CPU
--- 1,11 ----
#!/bin/bash
CPU=1
! tmpdir="/tmp/hh-results"
IT=1
+ DB=/root/hhsuite-3.0.1-Linux/dbs/scop70_1.75
+ if [ ! -d /tmp/hh-results ]; then
+ mkdir -p $tmpdir
+ fi
for seq in $(seq 1 100); do
awk -v line="$seq" '/>/{i++}i==line{print; next; print; exit}' 100.random.seq > $tmpdir/${seq}.fasta
hhblits -i $tmpdir/${seq}.fasta -d $DB -n $IT -oa3m $tmpdir/${seq}.a3m -o $tmpdir/${seq}.hhr -cpu $CPU

Thanks!
Atul.

from hh-suite.

sowani avatar sowani commented on August 18, 2024

The script executed successfully. I timed the script and got following results on my x86 VM:
real 6m26.182s
user 6m19.844s
sys 0m5.688s
I am closing this issue now. With this as baseline I will start checking the ppc64le port now.

BTW, a suggestion - could you please include these 2 files (i.e. the contents of 100randomsequences.zip) along with hh-suite source code so that there will be a readymade test suite available for the users.

Thanks,
Atul.

from hh-suite.

lydonchandra avatar lydonchandra commented on August 18, 2024

hi @sowani , how are you ? did you end up adding more validation / regression tests ?

from hh-suite.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.