doc-analysis / xfund Goto Github PK
View Code? Open in Web Editor NEWXFUND: A Multilingual Form Understanding Benchmark
Home Page: https://arxiv.org/abs/2104.08836
XFUND: A Multilingual Form Understanding Benchmark
Home Page: https://arxiv.org/abs/2104.08836
Excuse me, 1 What tools are used to label your xfund dataset. 2. What tools are used to mark the data set of funsd
In your article you describe relation extraction task. In this task you require as an output set of tuples of three in form of
(head_entity, tail_entity, relation_label). However, in the dataset you publish one may find only relations in form defined by documents/*/document/*/linking
, which allows to create tuples of two (head_entity, tail_entity). Information about relation label is missing. Is it present somewhere else in your dataset, or it was by some mistake omitted when the dataset was published?
Dear all, I am trying to train layoutLM Relation extraction model with FUNSD, as the RE model was pre-trained on XFUN, so I need to convert FUNSD to XFUN format, thus I would like to ask if anyone has any idea on how to convert the FUNSD to XFUN? Very much appreciated your help!
Ps: I have tried to build the conversion script, but am a bit confused with the derivation of word level bboxes, in another word, how can we derive the word level bboxes based on FUNSD? (because the texts in FUNSD is entity level instead of word level)
Hi, may I ask if the annotation rule is the same as FUNSD? There are a number of strange examples.
Is there any way to validate on english dataset?
Hi
I am new to the LayoutXLM model. I have taken the dataset from https://github.com/doc-analysis/XFUND/releases/tag/v1.0
these files de.train.json,de.train.zip,de.val.json,de,val.zip and uploaded to the https://github.com/jyotiyadav94/Relational-Extraction/releases/tag/xfund
whenever I try to access using the below script where the only change is URL I am getting JSONDecode error every time.
I am bit confused why i am always getting this although the dataset is the same.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.