Comments (5)
Compare with PropS output for the same sentence
from okr.
@rachelvov also ran evaluation on V2 conversions, also got low entity mention extraction evaluation (~0.3).
I think we should try to split noun compound as much as possible. @gabrielStanovsky , is it a significant change of the props_wrapper? we should change and run evaluaion again as you suggested. @shanybar 's evaluation code is merged already I think.
from okr.
@gabrielStanovsky - how long do you estimate that the process of fixing this will take?
I need a decent V2 baseline for my thesis, and currently with node mention score of 0.3 it's really not good enough. I'm trying to decide whether to wait for the fix in the pipeline or to change the evaluation to more flexible (take partial match into account and etc).
Also please let me know if there's any way I can help with this.
from okr.
The problem is that I'm not sure what's the behaviour we want here.
For example, for the sentence "The summer school board council announced the new dates", we currently get this PropS parse
Which has this long entity mention summer school board council
, as opposed to OKR which I assume will break it into 3 entities?
@kleinay and @rachelvov, what do you think the resulting PropSWrapper should look like?
I guess I can restore the dependency version, but I don't think that this would be very helpful, and will not improve the metrics according to V1 (but maybe in V2?)
@rachelvov, I think you can write in your thesis that a simple "noun baseline" achieves good results, but in general it deteriorates the performance for downstream tasks?
from okr.
I think for "summer school board council" V1 gold will be E1- "summer school", E2 - "board", E3 - council. you can't break summer school because it's not actually a school of/for summer, but "board council" is actually a council of boards. But this is indeed a hard case, most cases in the tweet are much easier.
Can we maybe use the baseline I made for the OKR paper? It was very simple (taking all Spacy NER mentions, and separate nouns and adjectives for everything that is not part of a NER mention) and had 85% F1 score.
@gabrielStanovsky not sure I understood what you meant in the last paragraph, let's talk tomorrow (in the lab or on phone).
from okr.
Related Issues (20)
- Artifacts from cleaning? HOT 5
- Weird parsing by PropS wrapper HOT 3
- Problems arising from errors in dependency parsing
- PropSWrapper not including arguments with more than one node HOT 1
- Make sure PropS abstract nodes don't make it to the proposition structure HOT 1
- Fix elements interchange HOT 3
- Consider introducing some sort of unit tests HOT 2
- Python import stop_words fails HOT 2
- determiner in an entity mention HOT 1
- Nominalizations? sentences without any entities or predicates in the pipeline output HOT 1
- template of implicit proposition is misformed HOT 1
- Coreference crucial enhancement - account for predicate's arguments HOT 4
- Implicit proposition argument ordering backwards HOT 1
- Important argument disregarded after implicit propositions added HOT 1
- props_wrapper: symbol A1 in template but not in list of entities; symbol P at entity name HOT 1
- not taking the head of noun compounds HOT 2
- evaluation- some input tweets has no gold annotation
- prefix "A" for Predicate in props_wrapper output HOT 1
- prepositional phrase of nouns is not captured as predicates HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from okr.