Comments (9)
Gemma will be supported after this PR is merged janhq/cortex#446
from jan.
@lededev 我用的是一个第三方版本,仅供参考: https://huggingface.co/mlabonne/gemma-7b-it-GGUF 。
以下是 model.json :
{
"object": "model",
"version": 1,
"format": "gguf",
"sources": [
{
"url": "gemma-7b-it.Q4_1.gguf",
"filename": "gemma-7b-it.Q4_1.gguf"
}
],
"id": "gemma-7b-it.Q4_1",
"name": "gemma-7b-it.Q4_1",
"created": 1709277804515,
"description": "",
"settings": {
"ctx_len": 4096,
"embedding": false,
"prompt_template": "{system_message}\n### Instruction: {prompt}\n### Response:",
"llama_model_path": "gemma-7b-it.Q4_1.gguf"
},
"parameters": {
"temperature": 0.7,
"top_p": 0.95,
"stream": true,
"max_tokens": 2048,
"stop": [
"<endofstring>"
],
"frequency_penalty": 0,
"presence_penalty": 0
},
"metadata": {
"size": 5496286176,
"author": "User",
"tags": []
},
"engine": "nitro"
}
from jan.
hi @lededev, we tested the gemma using Jan nightly build ✅ https://discord.com/channels/1107178041848909847/1209906514069028885
Please follow this guideline to try it out on our latest nightly build 🙏
https://jan.ai/guides/using-models/import-manually/
from jan.
Version 0.4.7-293 with the Gemma model is working now.
Thanks.✋
from jan.
2024-02-22T02:53:20.198Z [NITRO]::Error: llama_model_load: error loading model: unknown model architecture: 'gemma'
llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model 'E:\AI\jan\datafolder\models\gemma-7b.Q4\gemma-7b.Q4_K_M.gguf'
2024-02-22T02:53:20.206Z [NITRO]::Debug: Load model success with response {}
2024-02-22T02:53:20.207Z [NITRO]::Debug: {"timestamp":1708570400,"level":"ERROR","function":"load_model","line":560,"message":"unable to load model","model":"E:\\AI\\jan\\datafolder\\models\\gemma-7b.Q4\\gemma-7b.Q4_K_M.gguf"}
20240222 02:53:20.198000 UTC 8276 ERROR Error loading the model - llamaCPP.cc:565
from jan.
I have tried to add the model manually, but it has not been able to start,
error "Apologies, something's amiss!
Jan's in beta. Find troubleshooting guides here or reach out to us on Discord for assistance.”
from jan.
@Van-QA I think v0.4.7 is already merged your PR, are there any example of config file to add gemma-7b and gemma-7b-it?
from jan.
Jan v0.4.7-293 also work for me.
@chenshaoju 能把你model.json贴一下吗?我用的是gemma-7b-it.gguf 34.2GB,那个模型太大了,用RTX 4090只有0.62t/s 。
from jan.
So when can use gemma, 4.8 ?
from jan.
Related Issues (20)
- bug: Load model failed with error TypeError: fetch failed, model : chimera-apex-7b IQ3M gguf HOT 1
- bug: Unable to Access Settings through Right-Click on '...' in Jan
- bug: Long thread title causing "..." to disappear
- bug: Can't make 2nd massage HOT 4
- bug: grammar nits
- feat: Use OpenRouter Default Model instead of OpenRouter/auto HOT 3
- feat: Beam search ("Best of") setting
- feat: Local Models difficult to find and review due to proliferation of API models
- bug: threads lose titles on rename HOT 3
- [UI] Migration Notification Screen
- Privacy Settings
- Telemetry stack for Cortex & Jan
- bug: Converting circular structure to JSON HOT 1
- Claude Sonnet 3.5 Sends Duplicate Inputs HOT 1
- bug: Expected - Edit Global Defaults for the <model_name>, option not appearing in windows app. . HOT 1
- bug: Need to Kill Cortex for Jan to Show Newly Downloaded Model
- bug: Add Tooltip Messages for Buttons HOT 1
- epic: Threads Management
- epic: Remove extension setting pages in Jan, users should be able to manage cloud model's API key in "My Models" settings page instead
- bug: Jan won't start on windows 11 SyntaxError: Unexpected end of JSON input HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from jan.