Giter VIP home page Giter VIP logo

Comments (16)

Facico avatar Facico commented on June 13, 2024

你数据的问题吧,你看一下你的数据到底有没有东西

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

你数据的问题吧,你看一下你的数据到底有没有东西
有数据,数据里面有东西,两次训练的数据是一样的 佬

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

哦,是你的数据量太小了,当时打了一个检测tokenizer有没有问题的代码。你把finetune_chat.py的122-127行注释掉就好了

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

哦,是你的数据量太小了,当时打了一个检测tokenizer有没有问题的代码。你把finetune_chat.py的122-127行注释掉就好了

数据就是咱们GITHUB里面的链接下载的marge.json。我试一试大佬。

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

哦,是你的数据量太小了,当时打了一个检测tokenizer有没有问题的代码。你把finetune_chat.py的122-127行注释掉就好了

大佬,我试了,还是报错,给您看,我注释的地方截图和报错截图
微信图片_20240305151514
微信图片_20240305151520

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

finetune_chat用这个数据:https://huggingface.co/datasets/Chinese-Vicuna/instruct_chat_50k.jsonl
merge那个数据用这个代码:finetune.py
相关的脚本在这里:https://github.com/Facico/Chinese-Vicuna/tree/master/scripts

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

finetune_chat用这个数据:https://huggingface.co/datasets/Chinese-Vicuna/instruct_chat_50k.jsonl merge那个数据用这个代码:finetune.py 相关的脚本在这里:https://github.com/Facico/Chinese-Vicuna/tree/master/scripts

我运行错脚本了是吧大佬

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

嗯,对话和普通SFT的数据格式不一样

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

嗯,对话和普通SFT的数据格式不一样

那最后训练好,弄那个网页出问答,需要训练那个?

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

两个都可以,都写了不同的前端,chat版本就是对话能力好一点可以用chat的那个

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

两个都可以,都写了不同的前端,chat版本就是对话能力好一点可以用chat的那个
大佬,我在使用finetune.py重新训练的时候,报网络错误,这个是在下载什么,我可以手动下载么。
微信图片_20240305160544

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

下模型,可以手动下载。很多问题你不用一个个问我的,google一下都能解决😂,网络不行这个问题问了好多issue我都没回

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

下模型,可以手动下载。很多问题你不用一个个问我的,google一下都能解决😂,网络不行这个问题问了好多issue我都没回

大佬,我设置好了,然后出这个问题是怎么回事
微信图片_20240307164855

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

你的环境装错了吧,装的requirements.txt那个?

from chinese-vicuna.

xxyNeepu avatar xxyNeepu commented on June 13, 2024

from chinese-vicuna.

Facico avatar Facico commented on June 13, 2024

就用这个,我刚试了一下这个环境,bash ./scripts/finetune.sh是没问题的。如果你解决不了,可以调试一下model里面有没有这个接口(对应的应该是peft的接口)

from chinese-vicuna.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.