Giter VIP home page Giter VIP logo

ia-llama-2-dev's Introduction

Local Llama-2

Intorduction

This repository contains docker-compose file for running Llama-2 locally. I have tested this on Linux using NVIDIA GPUs (Driver 535.86.05, CUDA version 12.2), your experience may vary on other platforms.

There are three main components to this repository:

  • Huggingface text-generation-inference: we pass the model name to this service. You can change the model name in the docker-compose file.
  • mongodb: this is the database that stores the conversations for the UI.
  • chatbot-ui: this is the UI for the chatbot. The UI handles the prompting, you can customize it by changing the configuration in chat-ui/.env.local. More information is available in the huggingface post and chat-ui issue.

Requirements

Usage

In order to run the chat model locally, you only need to run the following command. Note that the first time you run this command, it will download the docker images for the services. This may take a while depending on your internet connection.

cd local-llm 

docker compose up huggingface_inference # takes a while to download the model
docker compose up mongo_chatui
docker compose up chat_ui


# Later runs can be done with the following command! 
docker compose up

Tested Environment

  • NVIDIA RTX 3090, Driver 535.86.05, CUDA version 12.2
  • 2 x NVIDIA RTX 3090 Ti, Driver 525.125.06 , CUDA version 12.0

Models

  • daryl149/llama-2-7b-chat-hf (unofficial, default)
  • meta-llama/Llama-2-7b-chat-hf (needs approval)
  • georgesung/llama2_7b_chat_uncensored (supposedly it is less politically correct, not tested)

ia-llama-2-dev's People

Contributors

mo-arvan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.