Community

Full Text Extraction Issues

Written by Permanently deleted user

May 27, 2021 13:55
6

Hi All,

I am trying to do a full page extract. for this i am using the FL (semi-structured document), I have created a field which has a rule script (VB RegEx script to extract the data), this all works great, text is extracted, I get what i need. however, when it comes to pushing documents through this project, the document definition isn't applied to the documents, and therefore nothing gets extracted *cry*

Is there a way to run these through the project unattended, utilizing a script for exporting the original image file to a specific destination, and the exported data (hopefully .xlsx or .csv) to a different location?

Apologies if this topic has already been covered, I have check the forums, but I haven't seen anything matching this issue.

Cheers

Was this article helpful?

0 out of 0 found this helpful

Comments

6 comments

Scott Chau

May 27, 2021 18:59
Lee,

If its not matching your DocDef and you are using a FLS, the issue is the rule in your FLS. So if you have any required elements and its not on that page it won't match, which means it won't run that FLS/DocDef. Also if your elements are just not picking the correcting thing and the score gets too low, it won't match the FLS/DocDef. So to trouble shoot this, run it in the DocDef and see where the issue is. A good FLS should have solid color circle and should end with a green solid circle.

As for your other questions, you can have it setup for unattended. Its a workflow setting in FlexiCapture Distributed. Just set the workflow to unattended. As for export, in your DocDef, you have option to create multiple export.

1
Permanently deleted user

May 28, 2021 09:18
Hi Scott,

Thank you for your response, it was as you had said, there was a required element on the field that I had not unchecked, and that seemed to be causing the issue. I have also now switched over to unattended, which seems to have sped up the processing time.

However, I am unsure what you mean by

" A good FLS should have solid color circle and should end with a green solid circle."

I have only seriously been working with FlexiCapture for about 6 months, so i am still fairly new when it comes to how it fully works.

Many Thanks,

Lee

0
Permanently deleted user

May 28, 2021 12:49
So update!

this project applies the DocDef intermittently, when it applies it works great. the problem is when it doesn't it will cause an issue for RPA that is picking it up downstream. When watching the processing it seems to skip the analysis on certain batches? could this be because the CI is too low? if so how do I fix it?

In theory this should just be a simple case of 'read document>extract all alpha-numeric data>export to excel file job done. I don't see where I am going wrong.

0
Scott Chau

May 29, 2021 00:30
This is hard to trouble shoot without see the project. If you look in your project setup station are there any error message for those problem batches?

0
Permanently deleted user

June 01, 2021 07:45

Edited
Hi Scott,

yes there is one batch that does not have a stage next to it, the error message here states: 'Unable to process a non-analyzed document'

there are other batches that have been processed but not exported (I am assuming that these have just not been matched to the DocDef and therefore there is nothing to export)

0
Scott Chau

June 10, 2021 01:12
non-analyzed document it was not recognized. The questions is why it didn't get recognized. You would need to look at the event log to track it down.

1

Please sign in to leave a comment.

Community

Full Text Extraction Issues

Was this article helpful?

Comments

Didn't find what you were looking for?