Comments (11)
I checked the bgq
micro-kernels, and they do in fact honor both row and column strides. However, they do not implement the "beta == 0" case in a nan/inf-safe manner. The "beta == 0" case is used by the operations you quote, so my best guess is that this is the problem.
from blis.
Dominic: Is it just the complex cases that are broken?
Field: Yeah, I never went back and fixed the 'beta == 0' case when that
became a requirement. I should do that, but since the single-precision
kernels are just the reference ones, that isn't the only problem that we're
seeing here.
What are the tests named like blis__4mh__ ? Are they the only ones that
fail?
-Tyler
On Mon, Aug 31, 2015 at 8:56 AM, Field G. Van Zee [email protected]
wrote:
I checked the bgq micro-kernels, and they do in fact honor both row and
column strides. However, they do not implement the "beta == 0" case in a
nan/inf-safe manner. The "beta == 0" case is used by the operations you
quote, so my best guess is that this is the problem.—
Reply to this email directly or view it on GitHub
#34 (comment).
from blis.
Tyler: Hmm, now that I think about it, my theory about "beta == 0" would not explain why the cgemm4mh is failing. I would be willing to debug this, but unless/until we reproduce it on an Intel architecture, I can't really justify making it a priority. (I don't have access to, much less experience with, bgq hardware.)
from blis.
@fgvanzee I can get you access easily. You just have to say you want it :-)
from blis.
Tyler: apparently all failure cases are complex cases.Thanks!
Field: we can also arrange an account if you need to access our system to debug BLIS.
from blis.
Hi is there any update on this thread?
from blis.
Anyone is trying to fix this problem?
from blis.
@fgvanzee Is this the same thing as issue number #51 ?
from blis.
@tlrmchlsmth Probably. However, the vulnerability to nan's/inf's due to not handling "beta == 0" still needs to get fixed eventually, even if it is not causing the symptoms originally reported by Dominic, so I figured we should keep the issue open.
from blis.
@tlrmchlsmth, according to #51, setting compiler flag to -O0 should fix its problem, but it does not work for the problem on BGQ. (for version 0.2.0)
blis_cgemm3m1_hh_ccc 400 400 400 0.000 4.31e-07 PASS
blis_cgemm4mh_nn_ccc 100 100 100 0.000 4.21e-04 FAILURE
blis_cgemm4mh_nn_ccc 200 200 200 0.000 1.34e-04 FAILURE
blis_cgemm4mh_nn_ccc 300 300 300 0.000 7.35e-05 MARGINAL
blis_cgemm4mh_nn_ccc 400 400 400 0.000 5.06e-05 MARGINAL
blis_cgemm4mh_nc_ccc 100 100 100 0.000 3.29e-04 FAILURE
blis_cgemm4mh_nc_ccc 200 200 200 0.000 1.50e-04 FAILURE
blis_cgemm4mh_nc_ccc 300 300 300 0.000 7.29e-05 MARGINAL
blis_cgemm4mh_nc_ccc 400 400 400 0.000 4.83e-05 MARGINAL
blis_cgemm4mh_nt_ccc 100 100 100 0.000 3.95e-04 FAILURE
blis_cgemm4mh_nt_ccc 200 200 200 0.000 1.42e-04 FAILURE
blis_cgemm4mh_nt_ccc 300 300 300 0.000 7.96e-05 MARGINAL
blis_cgemm4mh_nt_ccc 400 400 400 0.000 4.71e-05 MARGINAL
blis_cgemm4mh_nh_ccc 100 100 100 0.000 3.93e-04 FAILURE
blis_cgemm4mh_nh_ccc 200 200 200 0.000 1.41e-04 FAILURE
blis_cgemm4mh_nh_ccc 300 300 300 0.000 7.86e-05 MARGINAL
from blis.
I am closing this issue because ALCF Blue Gene/Q systems have been retired and thus friends of BLIS like me lack the ability to reproduce user errors.
from blis.
Related Issues (20)
- Header path for default source build and Debian should match HOT 6
- bli_gemmsup_rd_haswell_asm_d6x8m.c:1296:1:error:bp cannot be used in ams here HOT 3
- New release? HOT 6
- A more complete list of ARM cpu implementations
- arm64 cpu identification is not portable to BSDs HOT 3
- inconsistence between documentation and code for bli_?trmm3 HOT 5
- What is the best way to debug BLIS? HOT 2
- GPU support and PortBLAS HOT 4
- getting error as illegal instruction HOT 4
- Support compiler names with spaces HOT 1
- Regarding Default Behaviour for CPU Affinity HOT 4
- BF16 on AMD CPU? HOT 4
- Upstream BLIS patches for ARM SVE? HOT 5
- Memory location in the prefetch instructions HOT 5
- Facing issue when running following command: pip install --upgrade --no-cache-dir thinc HOT 1
- AMD FX(tm)-6300 Six-Core Processor piledriver errors with check HOT 4
- errors with scalapack due to [cz]symv and [cz]syr interfaces HOT 10
- Not possible to link Blis and Lapack statically into the same executable HOT 12
- Follow up on https://github.com/flame/blis/issues/811, remaining error in out.zblat3.txt after applying fix #814 HOT 4
- LAPACK test segfault on zen/zen2/zen3 at bli_sgemmsup_rd_haswell_asm_1x16n HOT 16
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blis.