thiippal / gem-tools Goto Github PK
View Code? Open in Web Editor NEWTools for working with multimodal corpora annotated using the Genre and Multimodality model
Tools for working with multimodal corpora annotated using the Genre and Multimodality model
Pressing 'r' resets the manually drawn areas of interest but does not remove them from the annotation.
Hi,
Can you suggest where to find the file in test_images/ in "04_generate_gem.ipynb" thanks.
This issue arises when the base units are split between multiple layout units, for instance, in the case a sentence extends begins on one page and continues on another.
In the original XQuery script, this issue was resolved using the following code:
let $parent-link := for $b in $base-file//unit[@id = $leaf-xref]/../@id
return tokenize($b, " ")
let $child-link := for $b in $base-file//unit[@id = $leaf-xref]/@id
return tokenize($b, " ")
let $rst-segs := for $se in $rst-segments/*[@id = $segment-ids and (@xref = $child-link or @xref = $parent-link)]/@id
return tokenize($se, " ")
The models seem to be making the repository quite large.
I can think of three potential ways to minimise size. First, If there are many old model versions, maybe just wipe them: https://rtyley.github.io/bfg-repo-cleaner/
Second, there's GitHub's Large File Storage could help: https://git-lfs.github.com/
Finally, find a host for the data and have a download_models()
method?
For instance, if a layout unit "lay-1" is defined to be of type "graphics" in the realization information, and a layout unit "lay-12" is defined to be of type "text", the dictionary of layout units and their realizations is updated with a false value because "lay-12" contains "lay-1".
This can be fixed by verifying the result from the XPath expression against a list of layout units in the document.
The layout graph is not populated with layout leafs unless realization information is present, preventing the use of the script.
I have a question? when I run my data on python and come up with the RST tree and LAYOUT tree. the tree is too big cant fit with the A4 page. what can I do to make the tree fitting page? shall I redraw it? or cut it? secondly, after having tree diagrams after the Run process is done, I don't get any statistics? for example how many elaboration relations? how many 16 Arial font type occur?How can I get the frequencies of the data? please Dr help me with that. here is I tagged a file to see how long is the tree.
. any suggestion is highly appreciated .
It's be great if this project was pip installable:
pip install gem-tools
A simple guide for this is available here:
https://packaging.python.org/distributing/
I might be able to help if need be!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.