コミュニティ

Supporting multiple languages (language detection)

We are running into some problems dealing with multiple languages where characters are not being recognised correctly. We receive sets which often contain documents in multiple languages. So for example, of the 10 pages 7 will be in English, 2 in French and 1 in Turkish. All documents have the same layout so we only have one FlexiLayout. Out project currently uses the XML export per page which we then submit to are back office application (per page).

We have added the required languages to the pre-recognition properties of the FlexiLayout however this has not had any effect. Consequently, we have added multiple document definitions to our project using the same FlexiLayout and configuring each language separately in the Regional settings of the Document Definition properties. However only the English language version is used which is the first FlexiLayout in our project.

What we would like to achieve is that FlexiCapture detects the language automatically and then applies the related language dictionary (so recognises the characters correctly) and passes the recognised language to the export XML so we can route the document accordingly in our back office system.
Is this possible, and if so how?

この記事は役に立ちましたか?

1人中1人がこの記事が役に立ったと言っています

コメント

2件のコメント

  • Avatar
    Permanently deleted user
    Hello Struggler,

    For matching necessary layout alternative (or just layout) you should have a few different "key" elements that will detect which one document belongs to.
    To do so please make a few different Required (if you know that it absolutely must be here) or Prohibited (if you know that it absolutely must not be here) elements that are present on English\French\Turkish documents and are not present on the others.
    Also please remove the checkmark "Use first acceptable FlexiLayout" in Document definition list window.

    Edit: you might probably have to make a classificator if a solution mentioned above doesn't work. Please contact me again for details.

    Hope that helps,
    Vladislav
    1
  • Avatar
    Kalyani Bajaj

    Hi, Is the issue of language detection resolved? 

    Even I want to know how to detect language of document in ABBYY FC 11. Based on that fields are detected accordingly. 

    Thanks, 

    Kalyani

    0

サインインしてコメントを残してください。