Community

Full Text Extraction Issues

Hi All, 

I am trying to do a full page extract. for this i am using the FL (semi-structured document), I have created a field which has a rule script (VB RegEx script to extract the data), this all works great, text is extracted, I get what i need. however, when it comes to pushing documents through this project, the document definition isn't applied to the documents, and therefore nothing gets extracted *cry* 

Is there a way to run these through the project unattended, utilizing a script for exporting the original image file to a specific destination, and the exported data (hopefully .xlsx or .csv) to a different location? 

 

Apologies if this topic has already been covered, I have check the forums, but I haven't seen anything matching this issue. 

 

Cheers

0

Comments

6 comments

  • Avatar
    Scott Chau

    Lee,

    If its not matching your DocDef and you are using a FLS, the issue is the rule in your FLS.  So if you have any required elements and its not on that page it won't match, which means it won't run that FLS/DocDef.  Also if your elements are just not picking the correcting thing and the score gets too low, it won't match the FLS/DocDef.  So to trouble shoot this, run it in the DocDef and see where the issue is.  A good FLS should have solid color circle and should end with a green solid circle.

    As for your other questions, you can have it setup for unattended.  Its a workflow setting in FlexiCapture Distributed.  Just set the workflow to unattended.  As for export, in your DocDef, you have option to create multiple export.

    1
  • Avatar
    Lee Norwood-Green

    Hi Scott, 

    Thank you for your response, it was as you had said, there was a required element on the field that I had not unchecked, and that seemed to be causing the issue. I have also now switched over to unattended, which seems to have sped up the processing time. 

    However, I am unsure what you mean by

    " A good FLS should have solid color circle and should end with a green solid circle."

    I have only seriously been working with FlexiCapture for about 6 months, so i am still fairly new when it comes to how it fully works.

     

    Many Thanks, 

    Lee

    0
  • Avatar
    Lee Norwood-Green

    So update!

    this project applies the DocDef intermittently, when it applies it works great. the problem is when it doesn't it will cause an issue for RPA that is picking it up downstream. When watching the processing it seems to skip the analysis on certain batches? could this be because the CI is too low? if so how do I fix it? 

    In theory this should just be a simple case of 'read document>extract all alpha-numeric data>export to excel file job done. I don't see where I am going wrong.

    0
  • Avatar
    Scott Chau

    This is hard to trouble shoot without see the project.  If you look in your project setup station are there any error message for those problem batches?

    0
  • Avatar
    Lee Norwood-Green

    Hi Scott,

    yes there is one batch that does not have a stage next to it, the error message here states: 'Unable to process a non-analyzed document'

    there are other batches that have been processed but not exported (I am assuming that these have just not been matched to the DocDef and therefore there is nothing to export)

    0
  • Avatar
    Scott Chau

    non-analyzed document it was not recognized.  The questions is why it didn't get recognized.  You would need to look at the event log to track it down.

    0

Please sign in to leave a comment.