Hello,
I have a big project where i have to scan and OCR a lot of verry complicated documents to archive and make accessible PDFs. I really want very good image resolution as they are visual art documents. The OCR text will be behind the image. Put for best result for the OCR, i seem to have do downscale the resolution, and some times edit the images.
Is there a way, after editing the OCR, to save a PDF with the original HR images and not the LR edited ones used for the OCR?
Comments
4 comments
Hello Valérie,
The issue is already known to our developers. I have forwarded your comment to the existing issue.
Currently, in OCR Editor it is not possible to disable all changes for the images, but you can use PDF Editor if you only need to add a text layer to the original PDF as described in the article: https://support.abbyy.com/hc/en-us/articles/4406209816979-Document-recognition-in-the-PDF-Editor.
In case you need to scan images in OCR Editor, I am afraid I cannot provide you with a workaround at this moment.
Ok, thank you.
Yes i understand how it all work. In my case creating a PDF with the HR original scanned images is very important. But i need to OCR the documents as well as i have to remediate them after to make them totally accessible in CommonLook. Some times i need to transform the image for the OCR to correctly recognize, like downscaling the resolution, but i don't want the changes in the images in the final export of the PDF.
I understand it is not possible at the moment and really hope that in a near future it could eventually have an option at export for "Export with original images" or "Export with edited images". I fell i can't be that complicated! but i'm not a programmer ;)
And, The OCR is not as good in the PDF editor, and i cant control anything, so it's not an option for me.
Valérie,
Thank you for the details described. I've forwarded your suggestion to our R&D team.
Please sign in to leave a comment.