Giter VIP home page Giter VIP logo

burstgpt's Introduction

A GPT-3.5 & GPT-4 Workload Trace to Optimize LLM Serving Systems

This repository contains public releases of a real-world trace dataset of LLM serving workloads for the benefit of the research and academic community.

This LLM serving is powered by Microsoft Azure.

There are currently two files in /data:

  • BurstGPT.csv contains all of our trace in 2 month with some failure that Response tokens are 0s. Totally 1429.7k lines.

  • BurstGPT_without_fails.csv contains all of our trace in 2 month without failure. Totally 1404.3k lines.

Usage

  1. You may scale the RPS in the trace according to your evaluation setups.
  2. You may also model the patterns in the trace as indicated in our paper and scale the parameters in the models.
  3. If you have some specific needs, we are eager to assist you in exploring and leveraging the trace to its fullest potential. Please let us know of any issues or questions by sending email to mailing list.

Future Plans

  1. We will continue to update the time range of the trace and add the end time of each request.
  2. We will update the conversation log, including the question IDs, time stamps, etc, in each conversation, for researchers to optimize conversation services.
  3. We will open-source the full benchmark suite for LLM inference soon.

Paper

arXiv

If the trace is utilized in your research, please ensure to reference our paper:

@misc{wang2024efficient,
      title={Towards Efficient and Reliable LLM Serving: A Real-World Workload Study}, 
      author={Yuxin Wang and Yuhan Chen and Zeyu Li and Zhenheng Tang and Rui Guo and Xin Wang and Qiang Wang and Amelie Chi Zhou and Xiaowen Chu},
      year={2024},
      eprint={2401.17644},
      archivePrefix={arXiv},
      primaryClass={cs.DC}
}

Main characteristics

  • Duration: 61 consecutive days in 2 consecutive months.
  • Dataset size: 1.4m lines, ~50MB.

Schema

  • Timestamp: request submission time, seconds from 0:00:00 on the first day.
  • Model: called models, including ChatGPT and GPT-4.
  • Request tokens: Request tokens length.
  • Response tokens: Response tokens length.
  • Total tokens: Request tokens length plus response tokens length.
  • Log Type: the way users call the model, in conversation mode or using API, including Conversation log and API log.

Data Overview


Figure 1: Weekly Periodicity in BurstGPT.


Figure 2: Daily Periodicity in BurstGPT.


Figure 3: Average Daily Request and Response Throughput in BurstGPT.


Figure 4: Statistics of Request and Response Tokens in BurstGPT.

burstgpt's People

Contributors

wabluy avatar lzzmm avatar

Stargazers

XU Yuhao avatar He Guangrong avatar Anlan Zhang avatar gdymind avatar Jiashun Suo avatar Peyton avatar  avatar Xiaozhe Yao avatar Ying Sheng avatar  avatar Geng Zhang avatar  avatar JFDuan avatar Zhanda Zhu avatar Jeff Carpenter avatar Jiashu avatar  avatar Xiang LIU avatar Zhengqiang Yin avatar duan avatar  avatar Jonathan Zarecki avatar JimyMa avatar Ting Sun avatar Jiarui Fang avatar Fr4nk1in avatar Chao Jin avatar YUKE WANG avatar Zhenning Yang avatar Zhiqing Sun avatar Lily Liu avatar shiyi.c_98 avatar  avatar Zhuohan Li avatar zxy avatar Zhixin Zhao avatar Wu SiYu avatar  avatar Tianyu Feng avatar Yang Jin avatar guopeng li avatar Minhui Xie avatar  avatar Tiejun Wang avatar Ruiyang Zhu avatar Yifan Qiao avatar Mikasa avatar Ruibo avatar Jiacheng Shen avatar  avatar  avatar lilili avatar  avatar Ziming Huang avatar Yangyang Feng avatar  avatar Xiangrui Yu avatar YusenHOU avatar Zeyu Li avatar 巩固 avatar  avatar Kang Xueze avatar

Watchers

Yok Hon avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.