Giter VIP home page Giter VIP logo

xfund's People

Contributors

ranpox avatar wolfshow avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

xfund's Issues

Annotation tool

Excuse me, 1 What tools are used to label your xfund dataset. 2. What tools are used to mark the data set of funsd

Missing semantic relation labels

In your article you describe relation extraction task. In this task you require as an output set of tuples of three in form of
(head_entity, tail_entity, relation_label). However, in the dataset you publish one may find only relations in form defined by documents/*/document/*/linking, which allows to create tuples of two (head_entity, tail_entity). Information about relation label is missing. Is it present somewhere else in your dataset, or it was by some mistake omitted when the dataset was published?

format of zh and ja

hi, I wanted to know that, why zh and ja datasets are split by character? not word by word?
when building a dataset, sentences can be split by words, not characters?
thank you.
image

How to convert FUNSD to XFUN format?

Dear all, I am trying to train layoutLM Relation extraction model with FUNSD, as the RE model was pre-trained on XFUN, so I need to convert FUNSD to XFUN format, thus I would like to ask if anyone has any idea on how to convert the FUNSD to XFUN? Very much appreciated your help!

Ps: I have tried to build the conversion script, but am a bit confused with the derivation of word level bboxes, in another word, how can we derive the word level bboxes based on FUNSD? (because the texts in FUNSD is entity level instead of word level)

Getting JSONDecode error

Hi

I am new to the LayoutXLM model. I have taken the dataset from https://github.com/doc-analysis/XFUND/releases/tag/v1.0
these files de.train.json,de.train.zip,de.val.json,de,val.zip and uploaded to the https://github.com/jyotiyadav94/Relational-Extraction/releases/tag/xfund
whenever I try to access using the below script where the only change is URL I am getting JSONDecode error every time.
I am bit confused why i am always getting this although the dataset is the same.

image

Statistics different from paper

Hi,

I found that ZH's stats are different from the paper ones, but exactly the same as FUNSD's, is there a mistake?

截圖 2023-08-19 上午3 21 32
[Screenshot from this repo]

截圖 2023-08-19 上午3 21 58
[Screenshot from LayoutXLM paper]

截圖 2023-08-19 上午3 23 01
[Screenshot from FUNSD paper]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.