Driss Guessous's Projects
The torchao repository contains api's and workflows for quantization and pruning gpu models.
Helpful tools and examples for working with flex-attention
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Continuous builder and binary build scripts for pytorch
Final Project for CS 513 Data Curation
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
CUDA Templates for Linear Algebra Subroutines
Different projects in data science
DCGAN PROJECT
Cuda extensions for PyTorch
A place to share my DataScience Projects
On-device AI across mobile, embedded and edge for PyTorch
C++ extensions in PyTorch
Fast and memory-efficient exact attention
🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.
A small classifier and server
Topic Modelling for Humans
Compiler for Neural Network hardware accelerators
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110).
Hackable implementation of state-of-the-art open-source LLMs based on nanoGPT. Supports flash attention, 4-bit and 8-bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
This is my current playlist and order of operations for setting up new Mac OS computer dev environment.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
The website for PyTorch
2d wave equation simulator