コミュニティ

Cloud OCR processFields extra spaces

Written by Permanently deleted user

2015年07月15日 10:47
1

I'm using processFields in Cloud OCR API to extract text from a PDF document.

Output XML contains extra spaces like this:

<value>MLTMRC5 7E2 9H2 6 4O</value>

while PDF text field is:

how can I prevent this?

bd5d5ec7-8362-48fa-a539-a74500bff691_cattura.jpg

この記事は役に立ちましたか？

0人中0人がこの記事が役に立ったと言っています

コメント

1件のコメント

Permanently deleted user

2015年07月15日 18:36
Please try to use the following description for this text field in your XML file with recognition settings:

<oneTextLine>true</oneTextLine> <oneWordPerLine>true</oneWordPerLine>

The oneTextLine element specifies whether the field contains only one text line. And the oneWordPerLine element specifies whether the field contains only one word in each text line.

Also in your case the letterSet and regExp elements of the text tag can be useful. Please see more details here.

Hope this will help!
0

サインインしてコメントを残してください。