Skip to main content
Kofax

KTA 7.9.0.1 - Expose the table extraction details provided by OCR engines

3029053

Applies to: 

KTA 7.9.0.1

Solution: 

Expose the table extraction details provided by OCR engines

The extracted raw engine data of the result is stored as a page-level string within the OCR text representation of the .xdc. JSON and XML are provided as the format of the raw data. The data definition can be downloaded as ssdoc-shema3.xsd. This allows the customer to easily process the data as desired. The feature of retrieving the raw result data from the engine is optional. The introduction of a scope of the result data is done so that the amount of data (and thus the size of the .xdc files) can be controlled. If a user needs only the table details, all other raw page data need not be provided. The OCR engine data is stored as text-extension in the KTA document repository. The name of the extension is “Kofax.CEBPM.RawPageData”

 

(attached file ssdoc-shema3.xsd for download)

 

Applies to:  

Product Version
KTA 7.9.0.1