Giter VIP home page Giter VIP logo

bhanu-pappala / serge Goto Github PK

View Code? Open in Web Editor NEW

This project forked from serge-chat/serge

0.0 0.0 0.0 2.04 MB

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Home Page: https://serge.chat

License: Apache License 2.0

Shell 3.55% JavaScript 1.51% Python 21.79% TypeScript 2.66% CSS 6.49% HTML 0.47% Smarty 1.80% Dockerfile 1.68% Svelte 60.07%

serge's Introduction

Serge - LLaMA made easy 🦙

License Discord

Serge is a chat interface crafted with llama.cpp for running GGUF models. No API keys, entirely self-hosted!

  • 🌐 SvelteKit frontend
  • 💾 Redis for storing chat history & parameters
  • ⚙️ FastAPI + LangChain for the API, wrapping calls to llama.cpp using the python bindings

🎥 Demo:

demo.webm

⚡️ Quick start

🐳 Docker:

docker run -d \
    --name serge \
    -v weights:/usr/src/app/weights \
    -v datadb:/data/db/ \
    -p 8008:8008 \
    ghcr.io/serge-chat/serge:latest

🐙 Docker Compose:

services:
  serge:
    image: ghcr.io/serge-chat/serge:latest
    container_name: serge
    restart: unless-stopped
    ports:
      - 8008:8008
    volumes:
      - weights:/usr/src/app/weights
      - datadb:/data/db/

volumes:
  weights:
  datadb:

Then, just visit http://localhost:8008, You can find the API documentation at http://localhost:8008/api/docs

🖥️ Windows

Ensure you have Docker Desktop installed, WSL2 configured, and enough free RAM to run models.

☁️ Kubernetes

Instructions for setting up Serge on Kubernetes can be found in the wiki.

🧠 Supported Models

Category Models
Alfred 40B
CodeLLaMA 7B, 13B
Falcon 7B, 7B-Instruct, 40B, 40B-Instruct
LLaMA 2 7B, 7B-Chat, 13B, 13B-Chat, 70B, 70B-Chat, 70B-OASST
Med42 70B
Meditron 7B, 70B
Mistral 7B, 7B-Instruct, 7B-OpenOrca
Neural-Chat 7B-v3.2
Notus 7B-v1
OpenChat 7B-v3.5
OpenLLaMA 3B-v2, 7B-v2, 13B-v2
Orca 2 7B, 13B
PsyMedRP 13B-v1, 20B-v1
Starling LM 7B-Alpha
Vicuna 7B-v1.5, 13B-v1.5
Zephyr 7B-Alpha, 7B-Beta

Additional models can be requested by opening a GitHub issue.

⚠️ Memory Usage

LLaMA will crash if you don't have enough available memory for the model:

💬 Support

Need help? Join our Discord

⭐️ Stargazers

Stargazers over time

🧾 License

Nathan Sarrazin and Contributors. Serge is free and open-source software licensed under the MIT License and Apache-2.0.

🤝 Contributing

If you discover a bug or have a feature idea, feel free to open an issue or PR.

To run Serge in development mode:

git clone https://github.com/serge-chat/serge.git
cd serge/
docker compose -f docker-compose.dev.yml up --build

serge's People

Contributors

dependabot[bot] avatar gaby avatar nsarrazin avatar pabl-o-ce avatar smartappli avatar k0gen avatar mzen17 avatar rakete avatar snxraven avatar axolotlite avatar agronholm avatar fenarksec avatar mms-gianni avatar jsonsmth avatar johncadengo avatar justinguese avatar knight-001 avatar louisoutin avatar mavaa avatar steelalloy avatar migelo avatar noproto avatar paraskevasleivadaros avatar thomasleveil avatar robotdjman avatar security-companion avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.