Giter VIP home page Giter VIP logo

Comments (4)

tridao avatar tridao commented on August 27, 2024 1

Great to hear about the Squad result!

I don't have much experience with CPU or ONNX deployment. I mostly work with GPUs.
The model architecture is quite different from what's commonly used (Transformers), so there's probably quite a bit to do to make it compatible with the ecosystem.
Some of the operations are implemented in CUDA (selective scan), idk if that would work with ONNX.
The causal_conv1d has a CUDA implementation but you can also just call pytorch.
Triton is used for RMS norm I think, and you can replace that with a reference Pytorch implementation.

from mamba.

maedoc avatar maedoc commented on August 27, 2024

I was also interested in a CPU implementation; I was curious if the functions suffixed with _ref are reference implementations (like this one https://github.com/state-spaces/mamba/blob/main/mamba_ssm/ops/selective_scan_interface.py#L311) that can be used to guide new implementations?

In the past I had quite some success porting CUDA/OpenCL kernels to CPU using ISPC (which has very similar programming model) but I saw that C10 & CUB are used here and I don't know them very well.

from mamba.

kroggen avatar kroggen commented on August 27, 2024

Modified version to work on CPU is available on this fork

I chopped a lot of code, but it works (for inference only)

from mamba.

Fiona-cn avatar Fiona-cn commented on August 27, 2024

Congratulations! But I have a question that why the 130M model doesn't preform well in training, only 15% accuracy. Am I doing something wrong?

from mamba.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.