Community

How to implement the option "Ignore text outside tables" in Cloud OCR Sdk for C#.net

Hi,

I want to extract only the tables from pdf into excel using the Cloud OCR Sdk. I am unable to locate the option "Ignore text outside tables" in Cloud OCR SDK as it is there in Abbyy FineReader 12 Professional attached screenshot.

Was this article helpful?

0 out of 0 found this helpful

Comments

1 comment

  • Avatar
    Permanently deleted user

    This option is not supported in ABBYY Cloud OCR SDK. All ABBYY products are different, they use a very sophisticated artificial intelligence algorithms for OCR, and they are adjusted for various scenarios and situations. FineReader 12 is a ready-to-use solution which provides more opportunities for tuning the options. And Cloud OCR SDK is designed for very common usage scenarios and this is our SDK product, it means that our users/developers retrieve the data from the image for some further processing.

    To workaround this issue you can get the recognized text with the coordinates using the XML export format in order to re-create the output in the needed format using the information about found blocks and its text on your side. But this approach might be not easy to implement.

    0

Please sign in to leave a comment.