neuralmagic Goto Github PK

Name: Neural Magic

Type: Organization

Bio: Neural Magic helps developers in accelerating machine learning performance using automated model sparsification techniques and inference technologies.

Twitter: neuralmagic

Location: Boston

Blog: neuralmagic.com

Neural Magic's Projects

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

autofp8

aws-do-eks

band_of_the_hawk

Hackathon 2022

bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.

causal-conv1d

Causal depthwise conv1d in CUDA, with a PyTorch interface

clip_benchmark

CLIP-like model evaluation

compressed-tensors

A safetensors extension to efficiently store sparse quantized tensors on disk

cutlass

CUDA Templates for Linear Algebra Subroutines

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

deepsparse-digitalocean-image

Repo for building and packaging a 1-click app for DigitalOcean

docs

Top-level directory for documentation and general content

evalplus

NeuralMagic fork of EvalPlus (Rigourous evaluation of LLM-synthesized code - NeurIPS 2023)

examples

Notebooks using the Neural Magic libraries 📓

guidellm

hackathon_2024

woop wooop

helm-charts

Helm charts for deploying NM VLLM

inference

Reference implementations of MLPerf™ inference benchmarks

langchain

⚡ Building applications with LLMs through composability ⚡

llm-foundry

NM fork of LLM foundry for compatibility with SparseAutoModel.

lm-evaluation-harness

A framework for few-shot evaluation of language models.

lm-evaluation-harness-archive

A framework for few-shot evaluation of autoregressive language models.

mamba

Mamba SSM architecture

mixeval

NM fork of MixEval compatible with SparseAutoModel.

mlperf_inference_results_v2.1

mlperf_inference_results_v3.0

nm-actions

Neural Magic GHA

nm-autogptq

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

nm-docker

Neural Magic Docker

neuralmagic Goto Github PK

Neural Magic's Projects

Recommend Projects

Recommend Topics

Recommend Org