Community

PDF Text Extraction from Text Layer via OCR SDK

Written by Permanently deleted user

August 23, 2013 17:05
4

Having looked at PDF-Text Extraction from Text Layer, I can see it's possible to get the underlying text of a PDF document from FineReader Engine 10. Is this possible via ABBYY Cloud OCR SDK at all?

Was this article helpful?

0 out of 0 found this helpful

Comments

4 comments

Permanently deleted user

August 27, 2013 14:54
Unfortunately, this feature is not implemented in ABBYY Cloud OCR SDK.

1
Permanently deleted user

August 27, 2013 15:49
Hi,

If you want to extract the text layer, you can use a PDF lib like Poppler or PDFMiner.

All the best,

Sam

0
Permanently deleted user

August 28, 2013 13:59
Sadly, I'm really interested in automatic layout detection, which is scarce.

0
Permanently deleted user

August 28, 2013 14:01
What do you mean by "automatic layout detection". Could you give an example/more detail?

0

Please sign in to leave a comment.

Community

PDF Text Extraction from Text Layer via OCR SDK

Was this article helpful?

Comments

Didn't find what you were looking for?