Char coordinates with SourceContentReuseMode = CRM_ContentOnly and SourceContentReuseMode = CRM_DoNotReuse

Description

Why char coordinates with IObjectsExtractionParams::SourceContentReuseMode = CRM_ContentOnly differ from coordinates with IObjectsExtractionParams::SourceContentReuseMode = CRM_DoNotReuse and IObjectsExtractionParams::SourceContentReuseMode = CRM_Auto?

Solution

When property IObjectsExtractionParams::SourceContentReuseMode = CRM_ContentOnly is used FineReader Engine does not rasterize PDF. In this case, char coordinates can be received only from command descriptions of this PDF. Often they contain rectangles which are bigger than visible text. It can be seen if one opens PDF-file with Adobe Reader and highlights a part of the text— the highlighted area (rectangle in PDF) will be bigger than visible text (rectangle in FREngine). This is the PDF design.

So such behavior is expected and may occur in different PDF.

Was this article helpful?

0 out of 0 found this helpful

Have more questions? Submit a request

Recently viewed