Community

OCR great; can I add text to *original* images of pages, without changing those images? Answered

The trial of abbyy OCR is *so great*. But as I test it, I have a problem in the particular files I'm working on: Original test of about 10 pages, the original PDF is 1.7kB. I have a few PDFs of 100s of pages each, which I would do, if I buy abbyy. Each is about 100MB. 

Abbyy nails the ocr, but when I go to save a PDF, all the options seem to involve changing the original images of the pages. By default, it does decrease the size greatly, but the results don't look great. Adding an option -- I think it is precisescan increases size, and improves output. But still not as good as the original. 

It is like abbyy is doing a great job of sharpening up what is there. But fine lines connecting strokes on the characters get lost, making reading tough. images below. The n is particularly bad.

 

If I vary the options to not allow a decrease in quality, the filesize jumps up many times, from 1.7kB to 23MB!

Can't I in some way just add the new ocr text to the *original* images? Or are there any other recommendations?

Images below. TIA!

 

original:

after abbyy:

Was this article helpful?

1 out of 1 found this helpful

Comments

1 comment

Please sign in to leave a comment.