コミュニティ

How ABBY is doing OCR for table structure in PDF.

Written by Permanently deleted user

2016年09月15日 14:13
2

We are doing OCR of PDF and exporting to XML. We are not able to understand how ABBYY is treating the table resides on PDF.

この記事は役に立ちましたか？

0人中0人がこの記事が役に立ったと言っています

コメント

2件のコメント

Permanently deleted user

2016年09月16日 12:40
If you use the documentConversion profile, the tables are detected during the document analysis stage. When you export the recognition result to the xml format, the output is described with the following XML scheme. In this case the recognized text is presented in proper hierarchy: document > page > block > region > etc. And the block tag has the blockType attribute, which denotes the type of the block: Text, Table, Picture, Barcode, Separator, SeparatorsBox.

0
Permanently deleted user

2016年09月20日 13:08
Thanks for your answer Oksana

0

サインインしてコメントを残してください。