marcusklang / docforia Goto Github PK
View Code? Open in Web Editor NEWSemistructured Multilayer Document Model
License: Apache License 2.0
Semistructured Multilayer Document Model
License: Apache License 2.0
When creating a sub document using subDocument()
on a document without an id docforia throws an exception.
Currently a work around is to check the id before using subDocument:
if (doc.id() == null) {
doc.setId("<anything here>")
}
doc.subDocument(0, 3)
Hi, Docforia looks very good and I'd like to consider using it, I just have a question on the conceptual model:
Let's say I have two witnesses of a text:
W1: docforia is good
W2: docforia are good
and I have some recorded emendations:
W2-fixed: are
--> is
and I would like to express that in one text with multiple layer:
layer 0: docforia is good
layer 1: W2 says is
--> are
layer 2, on top of layer 1: I say it should be are
--> is
and then on top of layer 0 and 2, I would have all sorts of NLP annotations.
Is that something docforia can model?
Add functions to import/export from/to CoNLL type data formats, such as TSVs or whitespace separated values.
flattend
--> flattened
in https://github.com/marcusklang/docforia/tree/master#why
Are you using this in a project? Are there newer/better alternatives?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.