Comments (2)
您好,llama.family上是最新的参数,我们一直在持续优化,最新的参数之后也会更新到Hugging Face,欢迎👏您持续跟进社区的进展。
from llama-chinese.
+10000086. 请问作者,Hugging Face上模型啥时候更新啊?我看时间还是1个月之前的,可否将最新的模型更新上去嘞?我在family上测试效果感觉还行,本地部署太拉跨了
from llama-chinese.
Related Issues (20)
- Vocab size mismatch causing model convert failure
- 如何创建对话的template?
- TypeError: Object of type Tensor is not JSON serializable HOT 1
- 关于atom-7b-chat长文本微调应如何进行? HOT 5
- ollama上run本地部署的atom-7b-chat模型 报错"error loading model" HOT 2
- llama-2-13b多卡推理报错 RuntimeError: CUDA error: device-side assert triggered Compile with `TORCH_USE_CUDA_DSA` to enable device-side assertions. HOT 4
- 请问llama大模型实践指南纸质版中第一章第18页文献[1]从哪里看?
- 请问有人知道这个问题该怎么解决吗TypeError:object of Type Tensor is not JSON serializable. HOT 1
- Error while deserializing header: HeaderTooLarge
- 中文社区提供的微调代码运行报错,好像是pytest有问题我也不太清楚,有没有大佬帮忙看一下 HOT 1
- 各位大神,为什么 pip install -r requirements.txt 时,里面依赖的版本有些找不到呢,请指教一下。 HOT 1
- 提交到slurm集群导致的端口冲突
- pretrain中的pretrain.sh并不是从头开始训练吧,是增量训练吧
- RuntimeError: FlashAttention only supports Ampere GPUs or newer. HOT 1
- 求大佬帮忙看看,为什么社区的微调代码刚执行到保存了一个checkpoint就报错 HOT 1
- AMD 的显卡可以用起来吗
- 这个需要什么配置合适?用一张A100 显卡跑的7B模型,80G显存用了10G,回答case中的怎么去北京 要60秒才返回结果 HOT 2
- SFT数据格式问题 HOT 1
- pretrain.sh 中预训练样例数据未提供 HOT 1
- 开源模型,做了屏蔽词管理么? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llama-chinese.