Comments (3)
With latest builds of llama.cpp, there's no need for exllama v2 anymore in terms of speed or stability.
Some have asked for exllama v2 support, and it might be interesting last month, but after new llama.cpp I no longer will do this as llama.cpp is as fast.
exllama 0.0.18 used in h2oGPT is pretty old, and if it's broken I'll just remove it instead of trying to support exllama v2, it requires too many changes. They didn't make it easy to use.
from h2ogpt.
Yes, for path you just do --base_model=llama --model_path_llama=<path>
.
https://github.com/h2oai/h2ogpt/blob/main/docs/FAQ.md#gguf--ggml
We check for the actual path (e.g. local or absolute) or if it is inside --llamacpp_path
which defaults to ./llamacpp_path
folder where we download stuff too.
from h2ogpt.
@pseudotensor with that run command python generate.py --base_model=TheBloke/Mistral-7B-Instruct-v0.2-GGUF --prompt_type=mistral --max_seq_len=4096
Can I specify the path for the model file ?
Thanks
from h2ogpt.
Related Issues (20)
- Mac OS automatic installation runs errors HOT 2
- about Add Doc to Chat HOT 1
- One Click Installers for MacOS not working on MacMini M2
- Attention sink error with h2oai/h2ogpt-4096-llama2-13b-chat HOT 1
- ImportError: libcudnn.so.8: cannot open shared object file: No such file or directory HOT 6
- Size of Tensor A must match size of Tensor B HOT 6
- auth related feature HOT 9
- Loading a Large model on Multiples GPU system HOT 12
- Permissions in VectorDB HOT 6
- Support for AWS Bedrock HOT 1
- vLLM GROQ issue HOT 1
- Mac OS auto installer doesn't work after manual uninstallation
- RuntimeError: An error occurred while downloading using `hf_transfer`. HOT 1
- python dependency module version tweaks HOT 1
- AWQ Model Works from UI in Windows, But Fails When Launched from .bat File HOT 6
- Rest API for inference locally HOT 5
- HuggingFaceM4/idefics2-8b as vision model
- How to delete content in user_paste HOT 2
- Can you make_db from documents stored on another (for example, PostgreSQL) HOT 2
- No way to save prompt/response pairs in a database?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from h2ogpt.