Supported OCR Text/Print Types

ABBYY OCR technology for printed text is available for more than 200 languages, including:

  • European languages (Latin, Cyrillic, Armenian, Greek alphabets)
  • Asian & Middle East languages (Chinese Simplified and Traditional), Japanese, Korean, Thai, Vietnamese, Hebrew, Arabic, Farsi
  • Gothic languages — an OCR module designed specifically for digitizing and archiving old documents, books and newspapers published in the 19th century.

ABBYY OCR technology is an “omni-font” technology that supports recognition of text provide in many different fonts.

To receive fast and high- quality OCR results, the OCR Engine should receive information about font types used in the documents.

ABBYY products support following printed text types:

  • Normal - a common typographic type of text, such as Arial, Times New Roman or Courier.
  • Typewriter - text typed on a typewriter

text_type_machine_courier.png

  • Matrix - text printed on a dot-matrix printer.

text_type_matix.png

  • Gothic - text printed with the Gothic type and used for Gothic recognition.

text_type_gothic.png

  • Index - a special set of characters including only digits written in ZIP-code style.

text_type_index.png

  • OCR_A - A monospaced font designed specifically for OCR. It is largely used by banks, credit card companies and similar institutions.

ocrA

  • OCR_B - A font designed specifically for OCR.

text_type_ocr_b.png

  • MICR_E13B - special numeric characters printed in magnetic ink. MICR (Magnetic Ink Character Recognition) characters are found in a variety of places, including personal checks.

text_type_micr_e13b.png

  • MICR_CMC7 - Special MICR barcode font (CMC-7) used on the bank checks.

text_type_micr_cmc7.png

Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.