Processing of PDFs with poor quality text layer

Symptoms

mceclip0.png 
Text from the existing PDF text layer is recognized as "                 "  

Or pre-recognized in FlexiLayout Studio as the following : mceclip3.png

Even if you will select such text on an initial document in question and then paste it somewhere - it can give you incorrect results that could look like this: "                 " or any other symbol/sign/digit, etc. 

Cause

It usually caused by the poor quality of the existing PDF text layer. 

Resolution

In FlexiCapture or FlexiLayout Studio it is recommended to use "Use OCR only" or "Auto" processing modes for such documents:

1. In FlexiLayout Studio:


2. In Batch Type properties on FlexiCapture 12 Project Setup Station - > Image Processing tab (same can be set for the whole project in Project Properties - > Image Processing tab): 
mceclip1.png

3. Or specified for a selected page:
mceclip2.png

Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.