Skip to main content
Kofax

Kofax TotalAgility - OmniPage Raw Data

Article # 3034310 - Page views: 29

Applies to

  • TotalAgility 7.9.0.1 (and higher)

Topic

In KTA v7.9.0.1, a feature was added to the OmniPage recognition engine to expose the raw OCR/Table data in either JSON/XML format.

clipboard_eecc024add32e3900c802447905fdb355.png 

Below shows a comparison of the page level properties in the xDoc with this setting enabled vs disabled.

Enabled

clipboard_e7d3d1d1553027db7ef3186a2d4f36b21.png

Disabled

clipboard_ea64a2713e74327f746fe93c554995b72.png

 

This property can be accessed in KTD script as shown by the example below

pXDoc.Pages.ItemByIndex(0).RawEngineData

Or, it can be retrieved by calling the GetPageTextExtension() SDK method using the extension name Kofax.CEBPM.RawPageData.

 

Note:  This feature will come at a cost in terms of performance and additional space in the DB