Comments (12)
inference 请参见 Colab demo,训练这块会补一下文档
from gpt2-ml.
请问下载的两个预训练模型对应的config.json和vocab.txt分别是?
你好,请问1.5B语料训练出来的预训练5.2G的模型这个两个文件在哪下载呀
from gpt2-ml.
感谢分享,期待使用文档。
from gpt2-ml.
训练怎么使用 训练语料有什么要求嘛
from gpt2-ml.
感谢分享,不知道训练和finetune的文档进展如何?
from gpt2-ml.
+1
from gpt2-ml.
+1
from gpt2-ml.
请问下载的两个预训练模型对应的config.json和vocab.txt分别是?
from gpt2-ml.
不知道训练和finetune的文档进展如何?
from gpt2-ml.
Colab demo 现在都跑不起来了
from gpt2-ml.
Colab demo 现在都跑不起来了
是啊,报错
from gpt2-ml.
Colab demo 现在都跑不起来了
是啊,报错
今天上午好了一会,现在又开始了...
from gpt2-ml.
Related Issues (20)
- linux运行会生成乱码 HOT 2
- 怎么进行微调? HOT 1
- Can you make a tutorial?
- [Discussion] Why is the max_seq_len in the prepare_data script 1024 +1
- Exception occurred when running on Colab HOT 1
- [Bug] No such file or directory HOT 1
- [Discussion] 跑demo时,显存基本不占用是正常的么? HOT 1
- Google Colab 运行的时候,文件无法下载到,会报错403 HOT 1
- Unable to load the trained GPT-2 model HOT 1
- colab怎么老是崩啊...
- 这两个模型文件和vocab/config的对应关系是什么呀
- [Discussion] 一般推理阶段用多少显存?
- [CoLab操作求助]红框里,输了句子以后,该怎么进入下一步? HOT 1
- 谷歌账号能不能提供一下子?
- 该项目能直接使用TPU方式运行吗?我这里采用TPU貌似无法很快速的生成文章,甚至更慢 HOT 1
- 为什么叫多语言GPT,哪里有多语言?不是中文GPT吗?
- 'NoneType' object has no attribute 'kernel'
- ERROR 404: Not Found. HOT 2
- [Discussion] This repo is not maintained anymore so i created a fork and fixed bugs.这个仓库没有被维护了,所以我创建了一个分支并修复了一些错误。
- File "scripts/demo.py", line 7, in <module> HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpt2-ml.