Community

Document section export options

Hello

A query on exporting a section within a document definition to XML:

One of the documents I am recognising has a repeating section that appears on the back of every page, the contents of it never changes. Is there any way to export this section just the once instead of having the same number of recognised occurences in the document exported into the XML file? Changing the number of occurences in the section makes sure only one is assigned to the recognised document but this leaves a large number of incorrect documents due to being incomplete in FlexiCapture that need deleting prior/after verification which is a pain.

Cheers

Was this article helpful?

0 out of 0 found this helpful

Comments

7 comments

  • Avatar
    Andrew Zyuzin
    Hello

    Do you export the whole document in one XML file? Do you use standard export to data files profile (or script)? Do you have fixed number of pages/sections at each document or random?
    0
  • Avatar
    Permanently deleted user
    Hello

    We export the whole document in one file using the standard export settings.
    The document is somewhat random. It can be made up of a number of sections of variable page lengths, with each section having its own FlexiLayout. This repeating section appears in every document but we only need to export one copy of it. Currently it exports however many instances are discovered.

    The version of Flexicapture we are using is 9.0.4.2510
    0
  • Avatar
    Andrew Zyuzin
    There is a way to replace the standard export with script export - I guess that just export all document fields to XML file is an extremly easy script. There you will be able to check section name and ignore its re-export.

    But if you use Distributed version of FlexiCapture the most effective way is to add one additional script stage between recognition and verification. Using Document processing script you can delete excess pages (=sections) from the document and remain just one instance of it. This will save your time during verification and you will have exactly what you need to export.
    Actually, I'm not sure that this script is available at FC9, it may require an upgrade to FlexiCapture 10
    0
  • Avatar
    Permanently deleted user

    Andrew Zyuzin

    Thanks for this info, can you please help me with sample script i am stuck with similar issue in Abbyy

    0
  • Avatar
    Andrew Zyuzin

    Hi Sirish

    Using FlexiCapture 12 this case may be resolved even more efficient:

    1) train Classifier to differentiate "a repeating section that appears on the back of every page" from other pages; 2) use a Scripting step in workflow with Batch Processing script (https://help.abbyy.com/en-us/flexicapture/12/developer/scripts_handling) to delete excess pages basing on classification result (Documents.Pages[*].ResultClassName)

    0
  • Avatar
    Andrew Zyuzin

    Or you may filter out excess pages at export, using export script (https://help.abbyy.com/en-us/flexicapture/12/developer/script_export): use Document.Sections to iterate document sections and save in JSON (or whatever file you create as a result) only fields from required sections. If you don't want this repeating pages exist in result image file (e.g. PDF), use Pages.ExcludedFromDocumentImage = true for the pages of undesired sections.

    0
  • Avatar
    Permanently deleted user

    Thank You Andrew Zyuzin in taking time and replying back to me.


    i went through some of the documentation with your guidance, i found this link https://support.abbyy.com/hc/en-us/articles/360017800999?input_string=document+assembly+scripting+help is useful and relevant for my use  case 

    am struggling to point pointers as mentioned in this document. if you feel this is relevant can you help me in sharing any example script like the one showed in documentation.

    0

Please sign in to leave a comment.