Community

Difference in extracted result with documents in Arabic language Answered

Written by Permanently deleted user

February 13, 2018 13:49
1

We are trying to extract contents of input image "original format.pdf" attached. OCR engine is not extracting all the words in it, for example "بيانات الكفيل"

But if we change the orientation of same image - slightly tilted to the left, the engine is able to extract that particular word "بيانات الكفيل" Refer attachment "adjusted format.pdf"

We need to do this kind of adjustment in orientation to extract the required words. But not able to fix a orientation that will extract all the required words from the input document.

Can you tell us a solution for this?

Was this article helpful?

0 out of 0 found this helpful

Comments

1 comment

Permanently deleted user

March 05, 2018 14:04
Hi,

As we discussed by the email, Cloud OCR SDK has limited functionality, therefore if an image doesn't meet Source Image Recommendations like for example low contrast, the result might not be satisfactory.

In this case, you may perform initial preprocessing by yourself. It may be helpful to increase brightness, exposure and contrast of a source image in a way that the text could be seen well by the human eye and other elements, like prints or background image looks overexposed and almost disappear.

This preprocessing might be done by almost every standard photo editor.

0

Please sign in to leave a comment.

Community

Difference in extracted result with documents in Arabic language Answered

Was this article helpful?

Comments

Didn't find what you were looking for?