Neural Magic's Projects
A high-throughput and memory-efficient inference and serving engine for LLMs
General Information, model certifications, and benchmarks for nm-vllm enterprise distributions
Various utilities for use with nm-vllm
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
ML model optimization product to accelerate inference.
LLM training code for MosaicML foundation models
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Supercharge Your Model Training
LLM training code for MosaicML foundation models
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A high-throughput and memory-efficient inference and serving engine for LLMs
Benchmarking Repo for vLLM
Simple benchmarking utility for vLLM Server
A simple, fully convolutional model for real-time instance segmentation.
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
YOLOv5 in PyTorch > ONNX > CoreML > TFLite