Comments (1)
Notes: fuzzy for edit distance as 2 in stems
Code part | 1 name (exact) | 1 name (fuzzy, no stemming) | 302 names | 906 names |
---|---|---|---|---|
Exact match | ~50ms | ~50ms | ~93ms for 301 uuid and 296 can uuid, 159 matched; exact match for ~255ms | 117ms for 301 uuid and 296 can uuid, 477 matched; exact match for 440ms |
Fuzzy match (svc response) | - | ~12ms | 10ms initial filtering, 540ms: 141 recs for 532ms, 89 recs for 2ms | 150ms for initial filtering, 1721ms: 423 recs for 1561ms, 267 recs for 1ms, 200ms for transfer |
Fuzzy match (db response) | - | ~46ms | 68ms for 180 recs | 2527ms for 540 recs |
Fuzzy total | - | ~63ms | 608ms | 4254ms |
Todos:
- 2527ms for 540 recs is too slow. Explore
Explain
for PostgreSQL. Try to assemblecanonical names
table. - check bug: fuzzy is launched even match is successful?
- optimise creation of inner structures from Slick response (even exact match is too slow)
- optimise Thrift data from Matcher
- limit fuzzy matched data from Matcher
- check UUIDs existence in RAM instead of DB request
from gnindex.
Related Issues (20)
- As a User I want to have cursor on the search form, to save me some time
- Fuzzy match, partial match should not return abbreviations in results HOT 2
- Context should not contain results with empty clades
- Change DataSource and ScoredResults ordering
- preferredDataSourceIds should not correlate with dataSourceIds
- When I search by list of names results do not correspond to names clicked on the left side of the GUI HOT 1
- As a user I want to see in English what kind of match type I am getting for each result. I also want the match type phrase to be color-coded. HOT 1
- A few examples provided with GUI do not work HOT 1
- Introduce "dirty fuzzy matches"
- As a User I want partial match finds also contain results with genus and the lowest infraspecies. HOT 4
- As a User I want to see examples how to use Graphql API
- As a User I want to see `Exact` instead of `Match` result for simplified search HOT 1
- As a User I don't need dataSourceId field from AcceptedName
- As an API User I want a 'boiler plate' query
- As a User I need `matched datasources count` field HOT 1
- As an API user I want to know how many databases carry matches to my name-string HOT 1
- As a User I'd like to have flatten results HOT 1
- As an API User I want to be able to receive errors when my query is not correct HOT 6
- NameFilter: check if `Homo sapiens blabla` works properly
- Fix `dataSources` and `preferredDatasources` interplay HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gnindex.