How to get list of characters for the certain recognition language

Question

How to get the list of characters for the particular recognition language?

Answer

Every recognition language consists of one or more base languages. To list all the characters in any language, use the LetterSet property of the BaseLanguage object:

// C#
TextLanguage textLanguage = Engine.PredefinedLanguages.Find("English").TextLanguage;
BaseLanguage baseLanguage = textLanguage.BaseLanguages[0];

String alphabet = baseLanguage.LetterSet[BaseLanguageLetterSetEnum.BLLS_Alphabet];

Sample output for the English language

-.ABCDEFGHIJKLMNOPQRSTUVWXYZabcdefghijklmnopqrstuvwxyz

Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.