Comments (4)
您好,目前主要的瓶頸在載入詞典的時候,只要詞典載入之後,多少句子、多少字數基本上不會影響到太多時間,因此在做分詞前處理時可以考慮一次將所有文本進行分詞,而不要一句一句來執行,以避過前面必要載入詞典的 over head(3 secs)。
from jieba-php.
您好,請問載入詞典是哪一個檔案呢?我想去精簡詞典,這樣可以提高速度嗎?我寫的是一個網頁程式,用戶按鍵就返回結果。要分詞的句子只有十個字符左右,而最終結果,可能1-2個詞就夠了。
謝謝您!
from jieba-php.
不好意思,是我拼错。改用small库,1秒出结果,应该可以接受了,谢谢您!
from jieba-php.
@graycatclub 喔喔 好的 沒問題~
from jieba-php.
Related Issues (20)
- 作者你好,提个优化内存消耗和加载字典时间的建议 HOT 7
- 如何在分词完成后,载入停用词表去除停用词 HOT 1
- tp5使用测试占用331M内存 HOT 1
- 如何根据自定义词典,从文本中提取词典中的关键词? HOT 2
- 超出内存限制 HOT 4
- 【优化建议】冗余代码 HOT 2
- 【bug】JiebaAnalyse::init()的options['dict']参数不会生效 HOT 1
- 请求联系方式——any contact way HOT 1
- textrank实现
- 个人整理了关于HMM、Viterbi和中文分词的学习笔记,请交流指导
- 请问一下,自定义添加词条时怎么设置词性,词性可以自定义吗? HOT 2
- 中文操作tip
- 请问有没有日语分词用的词典? HOT 1
- 如何设置初始化参数选择Jieba分词模式? HOT 1
- 初始化之后,内存一直占用着不会释放
- 实现初始化时的性能调优 HOT 1
- 结果中出现了形如 \n2 这样的换行加数字的结果,如何不匹配换行开始的结果呢? HOT 3
- 能否支持下 PHP 8.1 HOT 3
- 因为词典缓存导致内存无限制增加 HOT 1
- 未定义报错Posseg.php:268
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from jieba-php.