Having looked at PDF-Text Extraction from Text Layer, I can see it's possible to get the underlying text of a PDF document from FineReader Engine 10. Is this possible via ABBYY Cloud OCR SDK at all?
PDF Text Extraction from Text Layer via OCR SDK
Was this article helpful?
0 out of 0 found this helpful
Comments
4 comments
Unfortunately, this feature is not implemented in ABBYY Cloud OCR SDK.
Hi,
If you want to extract the text layer, you can use a PDF lib like Poppler or PDFMiner.
All the best,
Sam
Sadly, I'm really interested in automatic layout detection, which is scarce.
What do you mean by "automatic layout detection". Could you give an example/more detail?
Please sign in to leave a comment.