I want to use language autodetection feature in FREngine. And documentation suggests that I should somehow specify a list of potential languages, but does not clarifies, how.
In FREngine User 's Guide - Linux, version 11, page 550 it's specified:
When language autodetection is on, the recognition language is detected for each word in the text. It is selected from the list of languages specified in the TextLanguage property.
On the OCR Language Auto-Detection page it's written:
FineReader Engine 11 is the first SDK where a new language detection is implemented, it is part of the “FRDocument Object”
- The recognition language of a document can be automatically detected, but the developer has to specify at least 3 languages that might show up in the document.
The problem is that IRecognizerParams.TextLanguage property supports only 1 object. So it's not clear how to specify the list of 3 or more languages. Could you explain?
Comments
1 comment
Hi Alexey,
You can provide more than one language to TextLanguage property, using RecognizerParams::SetPredefinedTextLanguage. This way you will be able to set multiple languages simultaneously. For example for three lanugages it will look like:
SetPredefinedTextLanguage("English,Chinese,German");
Please note, that the languages should be named as it is shown in “Internal name” column in Predefined languages list. This list can be found in Help → Specifications → Predefined Languages.
Please sign in to leave a comment.