I am going to capture data from invoices, bills, questionnaires, application forms, and some other documents. Should i perform OCR and search for field labels in it?
Community
-
There is no need to recognize the whole document and search for the data in it. Instead you can recognize only certain text fields of a document and directly capture data from these fields into an information system or database. Please refer to the "How to Recognize Text Fields" article.
1 -
Any example on how to implement this? I think example codes and tutorial is lacking in your documentation.
0 -
To test field-level recognition, you can use ConsoleTest application from .NET sample code.
To recognize a single text field call
ConsoleTest.exe --asTextField [common options] <source_dir|file> <target_dir>
It performs recognition via processTextField call.
Common options description:
--lang=<languages>
: Recognize with specified language. Example:--lang=English --lang=English,German,French
--out=<output format>:
Create output in specified format: txt, rtf, docx, xlsx, pptx, pdfSearchable, pdfTextAndImages, xml--options=<string>
: Pass additional arguments.For example:
ConsoleTest.exe --asTextField --lang=English --options=region=0,0,200,50 D:\1.jpg D:\result
To recognize several text fields in one request call
ConsoleTest.exe --asFields <source_file> <settings.xml> <target_dir>
It performs recognition via processFields call. Processing settings should be specified in xml file. A sample xml can be found at gitnib.
1 -
can i have the same code for android ?
0 -
Anastasia, thanks a lot for a https://github.com/abbyysdk/ocrsdk.com/tree/master/SampleData!
0
Please sign in to leave a comment.