Giter VIP home page Giter VIP logo

Comments (4)

fgvanzee avatar fgvanzee commented on August 24, 2024 1

Hi there, I saw there is some code in the sandbox/power10 folder for BF16 GEMM. I suppose that is just for POWER10 machines? Is it possible to build and run code with bli_sbgemm on AMD CPU? Thanks!

Thanks for your question. Yes, that code is specific to POWER10 systems. The author (@nicholaiTukanov) likely did not intend for it to run on AMD CPUs. That said, we always encourage power users (pun not intended) to tinker around and see what you can get working!

from blis.

moderato avatar moderato commented on August 24, 2024

Hi there, I saw there is some code in the sandbox/power10 folder for BF16 GEMM. I suppose that is just for POWER10 machines? Is it possible to build and run code with bli_sbgemm on AMD CPU? Thanks!

Thanks for your question. Yes, that code is specific to POWER10 systems. The author (@nicholaiTukanov) likely did not intend for it to run on AMD CPUs. That said, we always encourage power users (pun not intended) to tinker around and see what you can get working!

Thanks for the reply. Does that mean there is no BF16 support for AMD CPUs for now?

from blis.

BhaskarNallani avatar BhaskarNallani commented on August 24, 2024

Hi Zhongyi Lin,

You can use BF16 implementation designed for zen4 and above, which are available in aocl_gemm addon in amd/blis https://github.com/amd/blis/tree/master/addon/aocl_gemm

You can build clone amd version of blis and build with aocl_gemm addon and call one of the below api's which has similar arguments, one can pass null for post-ops structure argument if intended to use only for gemm. API definitions available in this file https://github.com/amd/blis/blob/master/addon/aocl_gemm/aocl_gemm_interface_apis.h

aocl_gemm_bf16bf16f32of32( ) - This API accumulates at float (f32) precision and gives the output in float (f32)
aocl_gemm_bf16bf16f32obf16( ) - This API accumulates at float (f32) precision and gives the output in bf16 format (which is half the size)

Bhaskar

from blis.

moderato avatar moderato commented on August 24, 2024

Hi Zhongyi Lin,

You can use BF16 implementation designed for zen4 and above, which are available in aocl_gemm addon in amd/blis https://github.com/amd/blis/tree/master/addon/aocl_gemm

You can build clone amd version of blis and build with aocl_gemm addon and call one of the below api's which has similar arguments, one can pass null for post-ops structure argument if intended to use only for gemm. API definitions available in this file https://github.com/amd/blis/blob/master/addon/aocl_gemm/aocl_gemm_interface_apis.h

aocl_gemm_bf16bf16f32of32( ) - This API accumulates at float (f32) precision and gives the output in float (f32) aocl_gemm_bf16bf16f32obf16( ) - This API accumulates at float (f32) precision and gives the output in bf16 format (which is half the size)

Bhaskar

Hi Bhaskar, thank you for this valuable information. Will try and let you know.

from blis.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.