The method starts the processing task with the specified parameters.
Customize the following request URL according to your application processing location:
[POST] https://<PROCESSING_LOCATION_ID>.ocrsdk.com/v2/processDocument
This method allows you to process several images using the same settings and obtain recognition result as a multi-page document. You can upload several images to one task using submitImage method.
It is also possible to specify up to three file formats for the result, in which case the server response for the completed task will contain several result URLs.
Only the task with Submitted, Completed or NotEnoughCredits status can be started using this method.
Parameters
Parameter | Is required | Default value | Description |
---|---|---|---|
taskId | Yes | No | Specifies the identifier of the task. If the task with the specified identifier does not exist or has been deleted, an error is returned. |
language | No | "English" | Specifies recognition language of the document. This parameter can contain several language names separated with commas, for example "English,French,German". See the list of available recognition languages. |
profile | No | "documentConversion" | Specifies a profile with predefined processing settings. It can be one of the following:
|
textType | No | "normal" | Specifies the type of the text in the document. This parameter may also contain several text types separated with commas, for example "normal,matrix". The following values can be used:
|
imageSource | No | "auto" |
Specifies the source of the image. It can be either a scanned image, or a photograph created with a digital camera. Special preprocessing operations can be performed with the image depending on the selected source. For example, the system can automatically correct distorted text lines, poor focus and lighting on photos. The value of this parameter can be one of the following:
|
correctOrientation | No | "true" | Specifies whether the orientation of the image should be automatically detected and corrected. It can have one of the following values:
|
correctSkew | No | "true" | Specifies whether the skew of the image should be automatically detected and corrected. It can have either true or false value. |
readBarcodes | No | "true" for xml export format and "false" in other cases | Specifies whether barcodes must be detected on the image, recognized and exported to the result file. It can have either true or false value. |
exportFormat | No | "rtf" | Specifies the export format. This parameter can contain up to three export formats, separated with commas (example: "pdfa,txt,xml"). The available formats are:
If either of XML export formats is selected, barcodes are recognized on the image and saved to output XML no matter which profile is used for recognition. Please note that setting multiple export formats does not affect the cost of task processing. |
xml:writeFormatting | No | "false" | Specifies whether the paragraph and character styles should be written to an output file in XML format. This parameter can be used only if the exportFormat parameter contains xml or xmlForCorrectedImagevalue. The parameter can have one of the following values:
|
xml:writeRecognitionVariants | No | "false" | Specifies whether the variants of characters recognition should be written to an output file in XML format. This parameter can be used only if the exportFormat parameter contains xmlvalue. The parameter can have one of the following values:
|
xml: writeWordRecognitionVariants | No | "false" | Specifies if collections of variants of words recognition are to be written in a file in XML format. This parameter can be used only if the exportFormat parameter contains xml or xmlForCorrectedImage value. The parameter can have one of the following values:
|
pdf:writeTags | No | "auto" |
Specifies whether the result must be written as tagged PDF. This parameter can be used only if the exportFormat parameter contains one of the values for export to PDF. It can have one of the following values:
|
description | No | "" | Contains the description of the processing task. Cannot contain more than 255 characters. |
txtUnstructured: paragraphAsOneLine | No | "false" | Specifies if each paragraph in the recognized text is exported as one line. The parameter can have one of the following values:
|
Status codes and response format
General status codes and response format of the method are described in HTTP Status Codes and Response Formats.
Comments
1 comment
Ben Meddeb Lotfi
Hi, how to have a Basic-XML (Text on One Line) with API
Please sign in to leave a comment.