コミュニティ

text layer extraction from PDF without recognition 回答済み

Written by Permanently deleted user

2018年04月19日 04:03
1

Hi All,

We are processing both PDFs with text layers and scanned PDF documents.

If I checked whether there are text layers with Engine :: IsPdfWithTextualContent, I want to extract only the text in the text layer without performing recognition, but I wonder if there are any APIs that can provide such functions.

Thank you.

----

Added:

Would you send me (troublecoder@gmail.com) a c++ hello sample using AddImageFileFromMemory, please?

この記事は役に立ちましたか？

1人中1人がこの記事が役に立ったと言っています

1件のコメント

Permanently deleted user

2018年04月24日 11:36
Hi,

In case some document contains text layer you can process it using ObjectsExtractionParams::SourceContentReuseMode = CRM_ContentOnly setting. Then FRE won't try to recognize this document.

As for your second question, we'll try to help you with this at the nearest time.

1

サインインしてコメントを残してください。

コミュニティ

text layer extraction from PDF without recognition 回答済み

この記事は役に立ちましたか？

コメント

お探しのものを見つけられませんでしたか？