Comments (6)
It is certainly not set to search in the content
field (not every corpus may have a field with that exact name, so naturally, the application does not make assumptions about this). As far as I can tell, the _all
field is not disabled anywhere in our code, either. Does this sufficiently answer the question?
from i-analyzer.
I see now that the regular query string query actually seems to have features like the ones described in José's manual, i.e. the user can specify which fields to search by typing "field1:this field2:that" (searches for 'this' in field1 OR 'that' in field2).
https://www.elastic.co/guide/en/elasticsearch/reference/2.3/query-dsl-query-string-query.html
from i-analyzer.
O.k. That means I might have taken the wrong decision when choosing simple query string query. Should this be changed? Or should some code be written to enable users to search in specified fields? Or, I think Berit mentioned this possibility: should we offer a button in the user interface to enable users to search in a specified field? (means writing extra code, right?
from i-analyzer.
Part of the confusion may come from the fact that the old search field actually suggested that the user enter field:term
syntax. I don't know whether the new Angularized frontend is still doing that. In any case, it made me believe that the field:term
syntax was available, too, until I read the simple query string documentation.
As to what to do, I think this depends on what use cases we believe need to be supported and which ones not. Searching the whole query in all fields at the same time seems like an important use case to me, which most users will find convenient. At least Ortal-Paz Saar was very positive about this way of searching in a corpus. In fact, she said that she thought that if searching was in all fields, she probably didn't need much other functionality. (@JelmerVNuss do you recall the same?)
A step upwards in terms of user control would be to apply the whole query only to one field or a subset of the fields. This is what would be achieved by providing a <select>
dropdown or a set of checkboxes in the search panel, as Berit suggested. Yes, this would amount to writing extra code, though not necessarily a lot. I am not sure that users actually need this, but maybe they do.
A second step upwards would be to enable users to search some parts of the query in one field and other parts in another field. This is what the field:term
format in the non-simple query string format provides. I doubt that this is actually in the interest of the end user; while it theoretically gives them more control, in practice I suspect that it mostly gives them more ways to shoot themselves in the foot. At least, this is what we saw with Texcavator. While researchers may in principle be intelligent enough to use the additional control, Daniel Kahneman predicts that most of the time they will opt out (unconsciously) of applying that intelligence.
So I think these are the options:
- Accept that searching is always in all fields, just leave the
field:term
syntax out of the documentation. - Allow users to restrict the search to a set of fields, provide user-friendly checkboxes to select the set of fields.
- Switch to non-simple query strings, accept that most users probably either don't use it or use it incorrectly. I'm somewhat against this last option because of KISS.
from i-analyzer.
Options:
- Leaving field term out of documentation is not wise. Ortal-Paz is not a very common user, her texts are like short sentences. One might want to know what newspapers are writing in their article titles for instance. You would want to search in title: only.
- Might be the best option. Offer flexibility and prevent confusion at the same time.
- Kahneman is right..........
from i-analyzer.
This issue is closed per #133. Julian implemented option 2.
from i-analyzer.
Related Issues (20)
- Github actions don't cache
- DBNL: "view book" shows reverse chapter order
- Large downloads for non-authenticated users HOT 1
- List of most frequent bigrams/trigrams HOT 1
- Add lemmatized data to ES index HOT 1
- Check corpus data status
- Robots.txt HOT 1
- View document context is not closing the document pop-up HOT 1
- Add `url` display type
- Improve documentation of `addcorpus.models.Field.language` HOT 2
- Change data model for corpus attachments
- Backend testing: allow postgres database access in elasticsearch indexing fixtures
- Infer upper/lower for RangeFilter and DateFilter HOT 1
- Indexing lifecycles hooks HOT 1
- Merge Corpus and CorpusConfiguration models?
- Default Highlight Behavior HOT 1
- Index corpus without Python definition
- Endless `formatInnerHtml` loop HOT 2
- Word models not accessible to unauthorized users?
- Anonymous users cannot see coverage data
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from i-analyzer.