mpraining: Some junk chars are detected, e.g.: "PINOT NOIR" - this is the first line of the result of the attached image. Another one "Joan d’Anguera". Here we need the text after such junk char removed. So is there any option to avoid such characters?
junk chars are detected
この記事は役に立ちましたか?
0人中0人がこの記事が役に立ったと言っています
コメント
3件のコメント
The issue is not reproduced on our side. We recommend to recognize your image with the URL "http://cloud.ocrsdk.com/processImage?language=english,french&profile=textextraction&exportFormat=txt". In this case the result is
Hello Anastasia, Thanks for your feedback, I got it working better, but still there is one thing I do not understand is that, please check the following entry which I got from my result
Here actually, we expect something like this,
But result is not fine, can you please check why this is happening otherwise my algorithm to detect this line will fail due to this OCR mistake. And I checked the xml format, that is not suitable for us. I'm just expecting the contents as in the image. Please check and help me.
The automatic analysis recognize this picture as several separate areas, that's why the text order is not from left to right and from top to bottom. Unfortunately, now it's impossible to export text in this order automatically. So the only way to get this order is to sort the words using its coordinates on your side.
サインインしてコメントを残してください。