Jinwei's Projects
AIOS: LLM Agent Operating System
A beautiful, simple, clean, and responsive Jekyll theme for academics
My blogs
Store Blog Assets
Store the comments for my homepage
่ฎก็ฎๆบ่ชๅญฆๆๅ
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Gorums simplify fault-tolerant quorum-based protocols
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
A library for advanced large language model reasoning
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
About Me
Hi!
Record my life in Github.
latest
A high-throughput and memory-efficient inference and serving engine for LLMs