Comments (4)
在尝试了使用trim和stop filter,都没办法过滤掉空格时,
我的解决方案是在JiebaTokenFilter
类的incrementToken
的方法对值为空格的token进行过滤,经过测试是可以的。
但是由于我对es插件机制和java并不熟悉,所以我不确定这是否是个好的方案。
如果这是个可行的方案,那我提个pull requset, 如果有更好的办法,麻烦告知我一下哈
多谢!
from elasticsearch-analysis-jieba.
同义词那边回答里面我用了whitespace tokenizer ,所以空格都被过滤掉了
from elasticsearch-analysis-jieba.
我是用 trim
後 刪除空字串 (remove_empty
)的方法:
"analysis": {
"analyzer": {
"norm_jieba_index": {
"tokenizer": "jieba_index",
"filter": [
"lowercase",
"trim",
"remove_empty"
]
},
"norm_jieba_search": {
"tokenizer": "jieba_search",
"filter": [
"lowercase",
"trim",
"remove_empty"
]
}
},
"filter": {
"remove_empty": {
"type": "stop",
"stopwords": [""]
}
}
}
from elasticsearch-analysis-jieba.
我是用
trim
後 刪除空字串 (remove_empty
)的方法:"analysis": { "analyzer": { "norm_jieba_index": { "tokenizer": "jieba_index", "filter": [ "lowercase", "trim", "remove_empty" ] }, "norm_jieba_search": { "tokenizer": "jieba_search", "filter": [ "lowercase", "trim", "remove_empty" ] } }, "filter": { "remove_empty": { "type": "stop", "stopwords": [""] } } }
我按这个来,可以了。感谢
from elasticsearch-analysis-jieba.
Related Issues (20)
- 最新版本编译不过去 HOT 2
- 请问什么时候可以支持elasticsearch 1.7.2 和2.0 HOT 2
- 请问什么时候可以支持elasticsearch 2.X版本以上 HOT 3
- mvn package test failed HOT 1
- Test case failure caused by wrong path HOT 3
- 请问jieba分词插件是否支持配置同义词呢 HOT 3
- 請問何時可支援至 es 2.4.0 ? HOT 3
- 请问插件里面tokenizer叫什么 HOT 1
- 支持 es 5.x 版本 HOT 6
- 安装2.3.5版本出现问题。。。。
- 分词顺序不对,高亮有BUG
- 希望继续更新啊。ES6.0.0版本
- 热更新词典,繁体分词是否支持?
- 不支持es6.3么
- 本项目还更新吗? HOT 1
- out of memory while compile the app HOT 2
- 怎么使用Lucene自带的synonym filter HOT 3
- ElasticSearch 最新版本下不能使用 HOT 1
- ElasticSearch 1.5.0 更新了,jieba还更新吗 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elasticsearch-analysis-jieba.