Giter VIP home page Giter VIP logo

Comments (9)

Van-QA avatar Van-QA commented on July 28, 2024 3

Gemma will be supported after this PR is merged janhq/cortex#446

from jan.

chenshaoju avatar chenshaoju commented on July 28, 2024 2

@lededev 我用的是一个第三方版本,仅供参考: https://huggingface.co/mlabonne/gemma-7b-it-GGUF

以下是 model.json :

{
  "object": "model",
  "version": 1,
  "format": "gguf",
  "sources": [
    {
      "url": "gemma-7b-it.Q4_1.gguf",
      "filename": "gemma-7b-it.Q4_1.gguf"
    }
  ],
  "id": "gemma-7b-it.Q4_1",
  "name": "gemma-7b-it.Q4_1",
  "created": 1709277804515,
  "description": "",
  "settings": {
    "ctx_len": 4096,
    "embedding": false,
    "prompt_template": "{system_message}\n### Instruction: {prompt}\n### Response:",
    "llama_model_path": "gemma-7b-it.Q4_1.gguf"
  },
  "parameters": {
    "temperature": 0.7,
    "top_p": 0.95,
    "stream": true,
    "max_tokens": 2048,
    "stop": [
      "<endofstring>"
    ],
    "frequency_penalty": 0,
    "presence_penalty": 0
  },
  "metadata": {
    "size": 5496286176,
    "author": "User",
    "tags": []
  },
  "engine": "nitro"
}

from jan.

Van-QA avatar Van-QA commented on July 28, 2024 1

hi @lededev, we tested the gemma using Jan nightly build ✅ https://discord.com/channels/1107178041848909847/1209906514069028885
Please follow this guideline to try it out on our latest nightly build 🙏
https://jan.ai/guides/using-models/import-manually/

from jan.

chenshaoju avatar chenshaoju commented on July 28, 2024 1

Version 0.4.7-293 with the Gemma model is working now.

Thanks.✋

image

from jan.

chenshaoju avatar chenshaoju commented on July 28, 2024
2024-02-22T02:53:20.198Z [NITRO]::Error: llama_model_load: error loading model: unknown model architecture: 'gemma'
llama_load_model_from_file: failed to load model
llama_init_from_gpt_params: error: failed to load model 'E:\AI\jan\datafolder\models\gemma-7b.Q4\gemma-7b.Q4_K_M.gguf'

2024-02-22T02:53:20.206Z [NITRO]::Debug: Load model success with response {}
2024-02-22T02:53:20.207Z [NITRO]::Debug: {"timestamp":1708570400,"level":"ERROR","function":"load_model","line":560,"message":"unable to load model","model":"E:\\AI\\jan\\datafolder\\models\\gemma-7b.Q4\\gemma-7b.Q4_K_M.gguf"}
20240222 02:53:20.198000 UTC 8276 ERROR Error loading the model - llamaCPP.cc:565

from jan.

jerry-f avatar jerry-f commented on July 28, 2024

I have tried to add the model manually, but it has not been able to start,
error "Apologies, something's amiss!

Jan's in beta. Find troubleshooting guides here or reach out to us on Discord for assistance.”

from jan.

lededev avatar lededev commented on July 28, 2024

@Van-QA I think v0.4.7 is already merged your PR, are there any example of config file to add gemma-7b and gemma-7b-it?

from jan.

lededev avatar lededev commented on July 28, 2024

Jan v0.4.7-293 also work for me.
@chenshaoju 能把你model.json贴一下吗?我用的是gemma-7b-it.gguf 34.2GB,那个模型太大了,用RTX 4090只有0.62t/s 。

from jan.

tsjslgy avatar tsjslgy commented on July 28, 2024

So when can use gemma, 4.8 ?

from jan.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.