Hi,
I have implemented a layout which searches for Id in the documents provided. I have a multi-page document
the documents are random (combinations- one search string and one additional page or multiple pages with search string and multiple annex pages)
I have enabled annex page in Assembly of Document Definition.
When I add document (multi-page, 4 pages for this example) for OCR, the document returned by Abbyy contains three pages. The pages where the search string was found is exported along with a blacked out page (which had data in it but no match for search string).
I need to hide the sensitive detail (ID) while exporting the rest of the document as such. Please best way to set this.
I have implemented a layout which searches for Id in the documents provided. I have a multi-page document
the documents are random (combinations- one search string and one additional page or multiple pages with search string and multiple annex pages)
I have enabled annex page in Assembly of Document Definition.
When I add document (multi-page, 4 pages for this example) for OCR, the document returned by Abbyy contains three pages. The pages where the search string was found is exported along with a blacked out page (which had data in it but no match for search string).
I need to hide the sensitive detail (ID) while exporting the rest of the document as such. Please best way to set this.
コメント
8件のコメント
Are you saying you want to hide the ID on the image you are saving? Sort of like redacting it from the page? If so you can easily do that. Just open up the DocDef. Go to export. In the there should be an option to Redact. Just select the ID field.
Thanks for the reply. I found out what I was doing wrong. In assembly Abbyy was expecting 1-10 file or document and annex 1-10. I have updated them to 0-10 both cases. As its not not always true that abbyy would find the data.
Abbyy was blacking out the entire page since it was not able to find the search string (constrain). Once I made it required this issue was resolved as well.
I am facing another issue. In the training -> Properties of search element -> character string-> alphabets, I have added number -100 (portion) and '-' 22(portion). Percentage of non-alphabet charater is set to 4, whole words true, and max space length to 50. But Abbyy is detecting alphabets (Eg: this is the search string) from the document. But I want to restrict this to only numbers and few special char ( - , / )
Why not set the percentage of non-alphabet characters to 0? This was, the word you're trying to pick up should only be composed of the character set from your alphabet only.
Yes, I could set it to 0 but I expecting few char that was not in my test documents. Eg I have added - and / .. The document might contain \ or : Hence I added a value of 4 to it for unexpected char.
Abbyy is removing a few words Eg:"this is a test message" from the document but the setting says to allow only 4 char (non -alphabets).
That 4 isn't 4 characters. Its 4 percent. If you are allowing those extra character why not include them in the DataSet?
Thanks for your reply. I have got it fixed. the setting were right. After setting the value (key already set) to required this problem was resolved.
Thanks Sushi, Much appreciated.
サインインしてコメントを残してください。