Giter VIP home page Giter VIP logo

habvt / infless Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tanklabtju/infless

0.0 0.0 0.0 222.42 MB

The source code of INFless,a native serverless platform for AI inference.

License: GNU General Public License v3.0

Python 0.32% Shell 1.11% Dockerfile 2.18% C# 0.09% Go 48.90% Java 12.92% HTML 1.14% JavaScript 2.84% PHP 0.02% Ruby 0.02% Makefile 0.28% Mustache 0.10% PowerShell 0.11% R 0.01% COBOL 0.04% CoffeeScript 0.01% C 0.01% CSS 11.21% MATLAB 18.71%

infless's Introduction

Source Code for INFless

INFless is a domain-specific serverless platform. It caters to AI inference as BaaS offerings. Function developers can submit their inference source code to INFless; INFless accepts the function code of inference models and automates the deployment and scaling under varying access workloads; Users can get AI services from INFless. INFless guarantees subsecond latency for user requests and achieves high resource efficiency through the elaborately designed resource allocation and function management mechanisms. The current design of INFless relies on docker container for resource management and isolation among inference services, and it runs on Kuberneter cluster.

Contents

  • sourceCode: The source code of INFless for implementation and evaluation.
  • configuration: The cluster configuration file that needs to be loaded when INFless launches.
  • developer: The inference functions that could be deployed into INFless.
  • models: The metadata of inference models evaluated in INFless.
  • profiler: The model performance-resource profiles.
  • workload: The function workload trace used for evaluation
  • scripts: Some scripts used in the evaluation.

INFless Installation

Deployment guide of INFless for Kubernetes is available here.

Notice: INFless is fully implemented with OpenFaaS, which is a serverless frameworks built with Kubernetes. The installation of INFless is similar as OpenFaaS, and we strongly recommend you have some preliminary knowledges about OpenFaaS and understand how it works with Kubernetes. The deployment guide of OpenFaaS for Kubernetes is available here: https://docs.openfaas.com/deployment/kubernetes/.

Inference Function Deployment

Guidance for function developer is available here.

Notice: Once INFless is deployed successfully, developers could use faasdev-cli tools to upload their inference functions and build them as FaaS service.

Load Generator Installation

Guidance to install LoadGen tool is available here.

Notice: Be sure that the deployed inference functions and workload generator work well before evaluating INFless platform.

Result Reproduction

We have provided a complete instructions for the evalution result reproduction, and the commands and scripts files are available here.

For the paper publication, the plotting source code is available here.

Bug report & Question

We have test the deployment guidances and fixed some bugs that have been found. If you have some questions about the reproduction process, please contact us via Email: [email protected]

Reference

[1] Yanan Yang, Laiping Zhao, Yiming Li, Huanyu Zhang, Jie Li, Mingyang Zhao, Xingzhen Chen, Keqiu Li. INFless: A Native Serverless System for Low-latency, High-throughput Inference. The 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'22), Lausanne, Switzerland. Feb 2022.

infless's People

Contributors

ynyang1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.