Components for use with UIMA
- In package com.kincaidweb.uima.cc
A UIMA AnalysisEngine that serializes the CAS to an Avro file. Each instance of the class writes batches to files prefixed with the provided file name followed by a "-" and a incrementing counter.
- FileName: file name to write the records to
- DocumentIdField: the Avro field name to use for the document id field
- CasField: the Avro field name to use for the XMI serialized CAS
- In package com.kincaidweb.uima.cr
A UIMA Collection Reader that reads documents from an Avro file.
- InputFileName: name of the file to read documents from
- DocumentIdField: the Avro field name that contains the document id
- ContentField: the Avro field name that contains the text content of the document