How image preprocessing works in SDK products?
Once document pages are loaded, SDK offer a variety of image preprocessing options which allow amending the document images and optimize them for the actual recognition step. The goals of this preprocessing are to enhance the document pages as much as possible and reach the best OCR results during the recognition step.
Image preprocessing options:
Cleaning routines to remove noise and garbage from the image
Optimization of images taken by digital cameras and smartphones, e.g. straighten curved text lines
Dual-page splitting by photographed books
Different algorithms for deskew (up to 20 degrees)
The level of control is different, depending on the target audience
Several samples allow the users / admins to select some basic options.
The toolkits like FineReader Engine have a full API for image pre-processing:
For more details, please see the FineReader Engine sample: Image Preprocessing for OCR