Comments (5)
I'm currently trying with the new image that has been published. I'll give you the answer in the following minutes.
from serge.
Newly pushed image returns an error in the chat (see below), building it locally and using it fixes the original issue and works.
main: seed = 1679851058
llama_model_load: loading model from '/usr/src/app/weights/ggml-alpaca-7B-q4_0.bin' - please wait ...
llama_model_load: n_vocab = 32000
llama_model_load: n_ctx = 512
llama_model_load: n_embd = 4096
llama_model_load: n_mult = 256
llama_model_load: n_head = 32
llama_model_load: n_layer = 32
llama_model_load: n_rot = 128
llama_model_load: f16 = 2
llama_model_load: n_ff = 11008
llama_model_load: n_parts = 1
from serge.
I have also provided the fix here (look for #71). It shouldn't impact other deployments. If this issue gets fixed, a new docker image can be built and pushed to the repository and it'll be deployable on Docker and K8S
from serge.
@FenarkSEC Can this be close now that #71 was merged?
from serge.
09471f2 have fixed the issue with the newly built image, issue can be closed.
from serge.
Related Issues (20)
- 🚀 [Feature]: Add OpenVino / OpenVino Model Server HOT 1
- 🐛 [Bug]: Web interface does not render properly on mobile devices HOT 1
- 🚀 [Feature]: Add LINCE-Mistal model HOT 1
- 🐛 [Bug]: UI components are missing accessibility labels HOT 2
- 🐛 [Bug]: response text generated by a model sometimes disappears after computer/browser is woken up from a 'sleep' HOT 4
- have a separate page which displays downloaded moddles. HOT 1
- 🚀 [Feature]: Add support for Intel ARC GPUs A750 and A770 (If Possible) HOT 2
- bug: Allow loading .gguf and .bin files HOT 3
- 🚀 [Feature]: add eagle 7b HOT 3
- 🐛 [Bug]: system reachable via ICMP and via Port 8008 but screen "navy blue" with no text whatsoever HOT 14
- 🚀 [Feature]: Add Nous-Hermes-2-Mistral-7B-DPO HOT 8
- 🚀 [Feature]: Add support for uploading files during chat conversation
- 🐛 [Bug]: New install - response keeps repeating the last line HOT 7
- 🚀 [Feature]: add characters HOT 6
- 🚀 [Feature]: Please add Gorilla: Large Language Model Connected with Massive APIs HOT 3
- 🤗 [Question]: Whats the difference between the... models?
- 🚀 [Feature]: Add meta-llama/Meta-Llama-3-70B-Instruct HOT 7
- 🐛 [Bug]: Can't use pre-existing model at /weights HOT 1
- 🐛 [Bug]: DLLAMA_BLAS_VENDOR=OpenBLAS build with pip is not enabling OpenBlas HOT 3
- how to use mixtral-8x7b-v0.1🤗 [Question]: HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from serge.