ABBYY FineReader Engine 12 for Linux - Release 4 Update (includes Update 1)
Release date: 16 December 2019
New: 'Compare Documents' Module
- The new 'Compare Documents' Module in ABBYY FineReader Engine provides your software with the ability to quickly and accurately compare two versions of the same document and detect possible differences in their content – for example to compare the originally created contract and its printed and signed version and automatically verify the document’s integrity.
- The technology can analyze documents in different formats, i.e. Microsoft Word, PDF, JPEG, TIF and other document or image formats and can provide significant value to any document processing solution, especially to applications for the legal, government and business sector.
- For easy implementation and demonstration, a new ready-to-use code sample with sample documents is available and can be used in own applications to speed up development work. The sample compares the selected files and, if necessary, saves the detected differences to a file of the specified format.
Higher accuracy for Arabic OCR due to new AI-based algorithms
- To increase recognition accuracy on low-quality images, new smart AI-based algorithms were implemented for recognition of Arabic. In addition to the ‘standard’ OCR processing approach, a newly trained Recurrent Neural Network for Arabic enables End-to-End recognition, thus delivering highly accurate recognition results – even on images with very low quality.
- To optimize recognition quality and processing speed, a new intelligent built-in classifier selects the appropriate processing methodology.
Higher accuracy for Korean OCR due to new AI-based algorithms
- To further increase the recognition accuracy of the Korean language, a new Deep Learning Language model was trained. This intelligent model selects the best recognition variant from different recognition hypotheses based on his ‘knowledge’ acquired during an extensive training on documents in Korean.
- To optimize the balance between the recognition accuracy and OCR processing speed, a new smart built-in classifier selects the usage of this new model only under specific conditions.
Enhanced: MRZ recognition
- The MRZ extraction function was enhanced by new document format enums to accurately attribute extracted data to Optional data and Personal number fields. In addition, the IMrzData has received a new property to inform the system, if a checksum digit for the whole document data is available. (The MRZ recognition functionality was introduced in the Release 3.)
Enhanced: Text-based classifier with advanced security of training data
- To further increase the security and protection of information contained in training documents for the text-based classification , hashing algorithms were implemented. Using information from checksums eliminates the possibility to recover information from the sample documents.
Enhanced: Classification Demo Sample - now with Office format documents
- To reflect the FineReader Engine’s capability of processing Office documents in the classification process, the provided Demo Sample for classification was enhanced and allows now to display Office documents in the classification results (in addition to PDFs and image formats). Furthermore, the newly added sample documents can be used to test the classification capabilities.
Improved: Document layout preservation
- To improve the detection and recreation of document layout, a new 'single-column' document model was introduced that provides more exact detection and analysis of tables and charts and significantly improves the detection and recreation of document layout.
Enhanced: Java wrapper documentation
- To simplify the usage of the API, the documentation of the ABBYY FineReader Engine 12 has been extended and the documentation for the Java wrapper is now provided in JavaDoc format in addition to the HTML and PDF formats.
New code sample for document comparison
- The extensive code samples library was extended by a new sample that allows testing and demonstrating the ability to compare two versions of the same document and detect differences in their content
Improved: Developer's Help documentation
- The Developer’s Help of the FineReader Engine 12 has been extended by additional information about different possibilities of licensing the SDK, describing the individual types of licensing options in an easy-to-understand comparison table.
GetEngine function was deprecated in R2 ⇒ To load the Engine object, please use the InitializeEngine function. It provides the unified Engine loading procedure for all license types (including the Online License).
Customers updating from previous versions of FineReader Engine such as version 9, 10, or 11 as well as upgrading from FineReader Engine 12 Release 1 to Release 2 and higher who use the GetEngine function would receive an error message if they keep using it in later releases. Please update your code and replace the GetEngine with the InitializeEngine function.
License name change: To better reflect its functionality, the license type ‘Cloud License’ was renamed into ‘Online License’ in the R3 for Windows. This license supports deployment in virtual & cloud environments, usage with Docker containers as well as on premise installations.