Giter VIP home page Giter VIP logo

kylehh / nim-deploy Goto Github PK

View Code? Open in Web Editor NEW

This project forked from nvidia/nim-deploy

0.0 0.0 0.0 1.18 MB

A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deployment.

Home Page: https://build.nvidia.com/

License: Apache License 2.0

Shell 12.22% JavaScript 0.25% Python 11.06% TypeScript 3.77% Smarty 0.99% Dockerfile 0.64% Jinja 1.15% Jupyter Notebook 49.27% HCL 20.65%

nim-deploy's Introduction

Introduction

This repo showcases different ways NVIDIA NIMs can be deployed. This repo contains reference implementations, example documents, and architecture guides that can be used as a starting point to deploy multiple NIMs and other NVIDIA microservices into Kubernetes and other production deployment environments.

Note The content in this repository is designed to provide reference architectures and best-practices for production-grade deployments and product integrations; however the code is not validated on all platforms and does not come with any level of enterprise support. While the deployments should perform well, please treat this codebase as experimental and a collaborative sandbox. For long-term production deployments that require enterprise support from NVIDIA, looks to the official releases on NVIDIA NGC which are based on the code in this repo.

Deployment Options

Category Deployment Option Description
On-premise Deployments Helm
LLM NIM
LLM NIM on OpenShift Container Platform (coming soon)
Open Source Platforms
KServe
Independent Software Vendors
Run.ai (coming soon)
Cloud Service Provider Deployments Azure
AKS Managed Kubernetes
Azure ML
Azure prompt flow
Amazon Web Services
EKS Managed Kubernetes
Amazon SageMaker
Google Cloud Platform
GKE Managed Kubernetes
Google Cloud Vertex AI
Cloud Run
NVIDIA DGX Cloud
NVIDIA Cloud Functions
Documents Deployment Guide
Hugging Face NIM Deployment

Contributions

Contributions are welcome. Developers can contribute by opening a pull request and agreeing to the terms in CONTRIBUTING.MD.

Support and Getting Help

Please open an issue on the GitHub project for any questions. All feedback is appreciated, issues, requested features, and new deployment scenarios included.

nim-deploy's People

Contributors

supertetelman avatar crookedstorm avatar liveaverage avatar gabwow avatar azeltov avatar edemiraydin avatar fortunazhang avatar tmc avatar mayani-nv avatar vikalluru avatar sujituk avatar kylehh avatar angudadevops avatar dfisk avatar dpadmanabhan03 avatar josephwnv avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.