Comments (2)
Hi @agr505 ,
SentenceEntityResolverApproach
annotator is used for model training,SentenceEntityResolverModel
is used for downloading a pretrained resolver model. When you train a model withSentenceEntityResolverApproach
and save it, then you can call it by usingSentenceEntityResolverModel
annotator. All "Model" extensions are used for calling pretrained models in Spark NLP.- It is like a tree architecture but not the same, there are some other special algorithms behind it. We don't have a paper for Sentence Entity Resolution yet.
- These models are trained with the augmented version of the formal datasets, so there may be more than one line that has the same code, but not the same concept name and embeddings, in the model training data. When you want to drop a code from the model, all the lines in the model that has the same code will be dropped. So yes, the embeddings, codes, concept names, etc. will be dropped.
- You can train your own resolver model and update it from time to time with the new rows, this would be better in your case. But if the pretrained model performances are enough for you, dropping irrelevant codes from the model may help to increase the accuracy since the models return the closest embeddings from the space.
- The most important stage of getting resolutions of the terms is entity extraction. So having a specific NER model will help you to extract the appropriate entities according to the concept and this will affect the accuracy directly.
- You can use assertion status models to check the negation status of the entities.
from spark-nlp-workshop.
Also we will have some SDOH models soon @agr505. I hope these answers would help you.
from spark-nlp-workshop.
Related Issues (20)
- How to process a large document which has longer text length for NER? HOT 2
- Dead URLs for example input files
- Create a NLP Lab in Azure from the Marketplace. Unable to access thru the http://IP address HOT 1
- Saving the sparknlp models in Tensorflow or keras format HOT 1
- Py4JJava Error ( java.lang.NoClassDefFoundError) while running Text2SQL colab notebook HOT 5
- Script to convert json file (with RE annotations from ALAB) to Training CSV format HOT 1
- Regarding training of Relation Extraction model HOT 1
- Spacy sentence splitting results in notebook do not reflect current Spacy version HOT 1
- update ALAB conll conversion script
- I can't fit the pipeline for RoBerta For Sequence Classification HOT 1
- data required for training HOT 1
- Entity Recognizer Download HOT 1
- how to load custom trained ner crf model HOT 1
- NER Demo notebooks fail on NerConverter HOT 1
- Spark NLP Workshop annotators notebook is getting an error HOT 2
- Unexpected format when saving to S3 from EMR HOT 3
- IllegalArgumentException: requirement failed: License Key not set please set environment variable JSL_NLP_LICENSE,SPARK_NLP_LICENSE,SPARK_OCR_LICENSE,JSL_LEGAL_LICENSE,JSL_FINANCE_LICENSE or property jsl.settings.license! HOT 1
- Generate/run output should not be stored in private vars
- does pretrained.cache_folder support self hosted S3A ? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spark-nlp-workshop.