tstanislawek / awesome-document-understanding Goto Github PK
View Code? Open in Web Editor NEWA curated list of resources for Document Understanding (DU) topic
A curated list of resources for Document Understanding (DU) topic
Hello,
I know this isnt a issue, but i couldnt find a better place to ask this question.
I guess extracting data from resumes belongs to the key-information-extraction area.
So for the start I thought about using just a normal BERT and in my training data I only mark the entities that want to extract. But does it also makes sense to create a label for the label "english" (see example below) to get better results or to use relation extraction at this semi-form like data? Or does the behaviour of BERT recognizes automatically that after the string "english: " is going be a grade?
For a simple example at extracting grades from a resume:
input:
"english: 2"
-> do i need only need to annotate "2" or is it recommended to do something else
output:
grade_english: "2"
wanted labels:
I am sorry, I know this is not an issue, but I don't know where to ask it.
I am parsing PDF documents and now I have a task to group entities together: I have a chemical and its characteristics, I am parsing them using NER (huggingface transformers) and quality is OK, but I don't know how to group each chemical with corresponding characteristics (I don't even how the task is called). I can write some rules, that characteristics, which appear after the chemical name, correspond to this chemical, but sometimes the order is different and some characteristics appear before the name of the chemical.
So I want to use some model to link chemicals and their corresponding characteristics somehow together.
Please can you help me and give me some advice for this problem
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.