コミュニティ

Capturing whole page text

Just wondering, what is the best way to capture text of whole page using FlexiCapture 10. We got Distributed Flexicapture 10 for invoice processing now want to use it for another task where whole page needs to be convert into string by eliminating all separators, images and special characters.

この記事は役に立ちましたか?

0人中0人がこの記事が役に立ったと言っています

コメント

5件のコメント

  • Avatar
    Permanently deleted user


    farukhali,

    So you want to do a full text OCR of the page? i.e. like Recognition server or FineReader retail where you can convert an image to a Txt file? If that is the case, it would be better to use the other product but you could sort of do this by creating a large capture area for one field. i.e. in FlexiLayout Studio, use the Region and have it capture the whole page area. Then on the FlexiCapture side, you can use the autocorrect function to remove all your characters you don't want. From there I would create a special export that would only export this one field value.


    0
  • Avatar
    Permanently deleted user
    Thanks for the reply. How about using Paragraph instead of Region? I tried it seems to working pretty well.

    BTW is there any way to detect human snap or a particular logo on an image?
    0
  • Avatar
    Permanently deleted user
    One more thing where can I use autocorrect function. Any documentation on this. Appreciate your help!
    0
  • Avatar
    Permanently deleted user
    Farukhali,

    Paragraph can work too but I'm worry about an image or something getting picked up in the middle of the document and then you're missing some other text. Region will get everything. Otherwise if you are worry about getting photos OCR as bad data, you could just the paragraph and put it in a repeatable groups

    As for detecting logos, it can't do that. Basically you can detect the size but it can't distinguish between the other different type of photo.

    Autocorrection Function is in FlexiCapture. On the field, go to the properties and select Data Type. Autocorrection is the middle option. It will let you replace the characters you don't want with empty values.
    0
  • Avatar
    Permanently deleted user
    Thanks Sushi. I'll check these.
    0

サインインしてコメントを残してください。