Comments (5)
Or can you get some example of text files, which will convert to TEI properly.
from openconvert.
You can do the conversion to TEI online here: http://openconvert.clarin.inl.nl/openconvert/tagger/ui#file
(you need a CLARIN account, which you should be able to get here: https://user.clarin.eu/user/register)
I didn't develop this code, so I'm not sure about the commandline tool, sorry.
from openconvert.
@jan-niestadt Thanks, so If I want to build my-self corpus, How can I combine multi TEI into one? I mean, in practice, I would like to add one sentence containing a key word in plain text format each time (which can be converted to TEI by the tools as you mentioned above), then upload the TEI to my Black Lab-server such that it can be queried by the user. It will be useful for scientific writing, since then I can query by key word.
from openconvert.
Hello all, sorry to catch up only today
- The right command line for conversion from txt to TEI is (txt not text)
java -jar OpenConvert.jar -from txt -to TEI test/test.txt test/test.tei - For use with blacklab, (only available in the online version), it is best to enable the tokenizer in OpenConvertWeb
- To combine TEI files, there is no special tool. The element (teiCorpus http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-teiCorpus.html) may contain an arbitrary number of TEI elements containing documents. It also requires a corpus header, but for blacklab indexing, is should be sufficient to start with <teiCorpus>, then cat all the individual files, and then end the teiCorpus element.
from openconvert.
Currently, I grub the data (submit text, and output tei) from the OpenConvert. Since the site may change, I want to have a local version of it, that means, I need a similar function of convert plain text to TEI format. I have noted that you have provided openconvert.client.jar, did it design for this? (In fact, I can't execute it on my server, did it need this openconvert git project?)
from openconvert.
Related Issues (5)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openconvert.