FineReader Engine 11 for Linux specification

Requirements

ABBYY FineReader Engine 11 for Linux is designed for glibc version 2.11 and above. If you need ABBYY FineReader Engine for older glibc versions, please contact the ABBYY office serving your region.

For the FineReader Engine dynamic library, the standard libstdc++.so.6 and libgcc_s.so.1 libraries have to be used.

ABBYY FineReader Engine 11 for Linux has been tested on the following operating systems:

  • Fedora 25, 24, 23
  • Red Hat Enterprise Linux 7.2, 6.8
  • openSUSE 42.1
  • SUSE Linux Enterprise Server 12 SP1, 11 SP1
  • Debian GNU/Linux 8.7, 8.0
  • Ubuntu 16.04.1 LTS, 15.10, 14.04.5 LTS, 12.04.5 LTS
  • CentOS 7.1

ABBYY FineReader Engine has been tested on the following cloud computing platforms:

  • Windows Azure
  • Amazon EC2

ABBYY FineReader Engine has been tested in the following virtual environments:

  • Microsoft Hyper-V (only with software protection key)
  • Oracle VM VirtualBox 3, 4 (Windows hosts only)
  • VMware Server 2 (Windows hosts only)
  • VMware ESXi 5
  • Linux KVM/QEMU 2.0.0

In addition, ABBYY offers professional services to adapt the software to other platforms.

PC with an Intel® Pentium® or compatible processor (1 GHz or higher) which supports SSE and SSE2 instruction sets.

Memory:

  • for processing one-page documents — at least 400 MB RAM, 1 GB RAM recommended
  • for processing multi-page documents — at least 1.5 GB RAM, 4 GB RAM recommended
  • for parallel processing — 700 MB * (cores number) + 900 MB RAM
  • for parallel processing of documents in Arabic or CJK languages — 1.5 GB * (cores number) + 1.5 GB RAM
  • for parallel export — at least 4GB RAM

Tmpfs — 4GB + 1GB * (cores number)

Swap size — 4GB + 1GB * (cores number)

Hard disk space: 800 MB for library installation and 100 MB for program operation, plus an additional 15MB for every processing page of a multi-page document.

Languages

Here is the list of the predefined languages that are supported in ABBYY FineReader Engine. Availability of this or that predefined recognition language depends on the availability of the corresponding modules among ABBYY FineReader Engine files.

ABBYY FineReader Engine provides core recognition languages for OCR with full built-in dictionary support. Some recognition languages are available only for OCR, or do not have full built-in dictionary support. See details in the table below.

Recognition language Can be used for OCR Full dictionary support available Can be used for classification Can be used for BCR
Abkhaz +    
Adyghe +    
Afrikaans +    
Agul +    
Albanian +    
Altaic +    
Arabic (Saudi Arabia) + +  
Armenian (Eastern) + +  
Armenian (Grabar) + +  
Armenian (Western) + +  
Avar +    
Aymara +    
Azerbaijani (Cyrillic) +    
Azerbaijani (Latin) + +  
Bashkir + +  
Basic programming language +    
Basque +    
Belarussian +    
Bemba +    
Blackfoot +    
Breton +    
Bugotu +    
Bulgarian + +  
Buryat +    
C/C++ programming language +    
Catalan + +  
Chamorro +    
Chechen +    
Simple chemical formulas +    
Chinese Simplified +   + +
Chinese Simplified and English +   + +
Chinese Traditional +   + +
Chinese Traditional and English +   + +
Chukcha +    
Chuvash +    
For MICR CMC-7 text type +    
Cobol programming language +    
Corsican +    
Crimean Tatar +    
Croatian + +  
Crow +    
Czech + + + +
Danish + + +
Dargwa +    
Numbers +    
Dungan +    
Dutch (Netherlands) + + + +
Dutch (Belgium) +    
For MICR (E-13B) text type +    
English + + + +
Eskimo (Cyrillic) +    
Eskimo (Latin) +    
Esperanto +    
Estonian + + +
Even +   +  
Evenki +    
Faeroese +    
Farsi +   +  
Fijian +    
Finnish + + +
Fortran programming language +    
French + + + +
Frisian +    
Friulian +    
Scottish Gaelic +    
Gagauz +    
Galician +    
Ganda +    
German + + + +
German (new spelling) + +  
German (Luxembourg) +    
Greek + + +
Guarani +    
Hani +    
Hausa +    
Hawaiian +    
Hebrew + +  
Hungarian + + +
Icelandic +    
Ido +    
Indonesian + + +
Ingush +    
Interlingua +    
Irish +    
Italian + + + +
Japanese + + + +
Japanese and English + + + +
Java programming language +    
Kabardian +    
Kalmyk +    
Karachay-Balkar +    
Karakalpak +    
Kasub +    
Kawa +    
Kazakh +    
Khakas +    
Khanty +    
Kikuyu +    
Kirghiz +    
Kongo +    
Korean + + + +
Korean and English + + + +
Korean (Hangul) + +  
Koryak +    
Kpelle +    
Kumyk +    
Kurdish +    
Lak +    
Sami (Lappish) +    
Latin + +  
Latvian + +  
Latvian language written in Gothic script +    
Lezgin +    
Lithuanian + +  
Luba +    
Macedonian +    
Malagasy +    
Malay +    
Malinke +    
Maltese +    
Mansi +    
Maori +    
Mari +    
Maya +    
Miao +    
Minangkabau +    
Russian and English + + + +
Mohawk +    
Mongol +    
Mordvin +    
Nahuatl +    
Nenets +    
Nivkh +    
Nogay +    
NorwegianNynorsk and NorwegianBokmal + + +
Norwegian (Bokmal) + + +
Norwegian (Nynorsk) + + +
Nyanja +    
Occidental +    
Ojibway +    
Old English + +  
Old French + +  
Old German + +  
Old Italian + +  
Old Slavonic +    
Old Spanish + +  
Ossetian +    
Papiamento +    
Pascal programming language +    
Tok Pisin +    
Polish + + + +
Portuguese (Brazil) + + +
Portuguese (Portugal) + + + +
Provencal +    
Quechua +    
Rhaeto-Romanic +    
Romanian + + +  
Romanian (Moldavia) +    
Romany +    
Ruanda +    
Rundi +    
Russian (old spelling) + +  
Russian + + + +
Russian (with accents marking stress position) + + +
Samoan +    
Selkup +    
Serbian (Cyrillic) +    
Serbian (Latin) +    
Shona +    
Sioux (Dakota) +    
Slovak + +  
Slovenian + +  
Somali +    
Sorbian +    
Sotho +    
Spanish + + + +
Sunda +    
Swahili +    
Swazi +    
Swedish + + + +
Tabassaran +    
Tagalog +    
Tahitian +    
Tajik +    
Tatar + +  
Thai + +  
Jingpo +    
Tongan +    
Tswana +    
Tun +    
Turkish + + +
Turkmen +    
Turkmen (Latin) +    
Tuvan +    
Udmurt +    
Uighur (Cyrillic) +    
Uighur (Latin) +    
Ukrainian + + +
Uzbek (Cyrillic) +    
Uzbek (Latin) +    
Vietnamese + +  
Cebuano +    
Welsh +    
Wolof +    
Xhosa +    
Yakut +    
Yiddish +    
Zapotec +    
Zulu +    

* These are compound recognition languages. They are defined internally in FineReader Engine API in the same way as comma-separated list of languages, for example "Japanese,English". We recommend setting the compound recognition language only by means of the comma-separated list. The compound predefined languages are to be removed in future versions.

Supported Image Formats:

The ABBYY FineReader Engine 11 opens and saves image files in the following formats:

Format Extension Open  Save
BMP:
uncompressed black and white
4- and 8-bit — uncompressed Palette
16-bit — uncompressed, uncompressed Mask
24-bit — uncompressed
32-bit — uncompressed, uncompressed Mask
bmp + +
BMP:
4- and 8-bit — RLE compressed Palette
bmp +  
DCX:
black and white
2-, 4- and 8-bit palette
24-bit color
dcx + +
GIF:
black and white — LZW-compressed
2-, 3-, 4-, 5-, 6-, 7-, 8-bit palette — LZW-compressed
gif +  
JBIG2:
black and white
jb2 + +
JPEG:
gray, color
jpg, jpeg, jfif + +
JPEG 2000:
gray — Part 1
color — Part 1
jp2, jpc, j2k + +
PCX:
black and white
2-, 4- and 8-bit palette
24-bit color
pcx + +
PDF (Version 1.7 or earlier) pdf + +
PNG:
black and white, gray, color
png + +
TIFF:
black and white — uncompressed, CCITT3, CCITT4, Packbits, ZIP, LZW
gray — uncompressed, Packbits, JPEG, ZIP, LZW
24-bit color — uncompressed, JPEG, ZIP, LZW
1-, 4-, 8-bit palette — uncompressed, Packbits, ZIP, LZW
(including multi-page TIFF)
tif, tiff + +
TIFF:
black and white — CCITT3FAX
tif, tiff +  

Note: The ABBYY FineReader Engine will not open images larger than 32512*32512 pixels.

Document Saving Formats

  • Rich Text Format (RTF)
  • XML-based Microsoft Office 2007 (DOCX, XLSX and PPTX)
  • PDF, PDF/A (MRC compression for PDF and PDF/A)
  • Comma Separated Values (CSV)
  • HTML. FineReader Engine supports various code pages (Windows, DOS, Mac, ISO) and Unicode (UTF-8) encoding
  • Plain Text (TXT). FineReader Engine supports various code pages (Windows, DOS, Mac, ISO) and Unicode (UTF-16, UTF-8) encoding
  • XML (file format contains recognized text which structure is described with the help of XML)
  • ODT
  • ALTO
  • FB2
  • EPUB
  • vCard

Barcode Types

ABBYY FineReader Engine 11 recognizes the following types of barcodes:

  • 1D: Codabar, Code 128, Code 39, Code 93, Code 32, EAN 8 and 13, Full ASCII Code 39, GS1-128, IATA 2 of 5, Industrial 2 of 5, Interleaved 2 of 5, Matrix 2 of 5, Patch, PostNet, UPC-A, UPC-E.
  • 2D: PDF 417, Aztec, Data Matrix, QR Code, MaxiCode.

Have more questions? Submit a request

Recently viewed