Comments (5)
请问你是怎么对cws_dat文件进行压缩的?@alexlee728
from thulac.
我是用压缩软件测试的。你可以做个开关,训练模型时候对二进制文件压缩成N个包,用到哪个解压哪个。这样能保证非服务器应用也能用。
from thulac.
另外内存过大是应为开始就申请了模型那么大的数据。我觉得要综合考虑,不一定为了快就要全部加载到内存。一般不会有人去对几M以上文本分词,至少说大多数不会。所以可以配置成服务器版本和非服务器版最好,非服务器版要压缩数据并控制内存。
from thulac.
谢谢 @alexlee728
from thulac.
非常感谢您的意见,我们也会考虑这样的做法,尽量减少一开始占用的内存~
from thulac.
Related Issues (20)
- 拼音分词
- readme中有不良链接 HOT 1
- cws_label.txt file not find Segmentation fault: 11
- 文档及模型参数提示信息的一处错误
- 内存占用3个多G,正常吗
- SEGV signal occurred when running program thulac HOT 1
- Buffer overflow occurred during training process
- Alloc-dealloc-mismatch
- 英文分词时候,标点符号分割错误
- 区域,时间等这些模型数据是如何训练出来的,可以修改吗? HOT 2
- core down when training_file is large, how to deal with it?
- how to trainging on private corpus?
- 请问Windows端如何使用
- 用户定义词典有时候不起作用 HOT 1
- 你好请问一下,算法原理是用 的什么模型,各个版本一样么
- mac g++ 编译报错1个语法错误1个async错误1个call private错误 HOT 2
- 作者c++工程水平急需提高 HOT 1
- 训练时提示longer than max如何解决?
- 不支持Tigerlake架构的Intel cpu编译 HOT 1
- 有计划开发ruby版吗?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from thulac.