Community

PDF Text Extraction from Text Layer via OCR SDK

Having looked at PDF-Text Extraction from Text Layer, I can see it's possible to get the underlying text of a PDF document from FineReader Engine 10. Is this possible via ABBYY Cloud OCR SDK at all?

Was this article helpful?

0 out of 0 found this helpful

Comments

4 comments

  • Avatar
    Permanently deleted user

    Unfortunately, this feature is not implemented in ABBYY Cloud OCR SDK.

    1
  • Avatar
    Permanently deleted user

    Hi,

    If you want to extract the text layer, you can use a PDF lib like Poppler or PDFMiner.

    All the best,

    Sam

    0
  • Avatar
    Permanently deleted user

    Sadly, I'm really interested in automatic layout detection, which is scarce.

    0
  • Avatar
    Permanently deleted user

    What do you mean by "automatic layout detection". Could you give an example/more detail?

    0

Please sign in to leave a comment.