Hello
A query on exporting a section within a document definition to XML:
One of the documents I am recognising has a repeating section that appears on the back of every page, the contents of it never changes. Is there any way to export this section just the once instead of having the same number of recognised occurences in the document exported into the XML file? Changing the number of occurences in the section makes sure only one is assigned to the recognised document but this leaves a large number of incorrect documents due to being incomplete in FlexiCapture that need deleting prior/after verification which is a pain.
Cheers
A query on exporting a section within a document definition to XML:
One of the documents I am recognising has a repeating section that appears on the back of every page, the contents of it never changes. Is there any way to export this section just the once instead of having the same number of recognised occurences in the document exported into the XML file? Changing the number of occurences in the section makes sure only one is assigned to the recognised document but this leaves a large number of incorrect documents due to being incomplete in FlexiCapture that need deleting prior/after verification which is a pain.
Cheers
コメント
7件のコメント
Do you export the whole document in one XML file? Do you use standard export to data files profile (or script)? Do you have fixed number of pages/sections at each document or random?
We export the whole document in one file using the standard export settings.
The document is somewhat random. It can be made up of a number of sections of variable page lengths, with each section having its own FlexiLayout. This repeating section appears in every document but we only need to export one copy of it. Currently it exports however many instances are discovered.
The version of Flexicapture we are using is 9.0.4.2510
But if you use Distributed version of FlexiCapture the most effective way is to add one additional script stage between recognition and verification. Using Document processing script you can delete excess pages (=sections) from the document and remain just one instance of it. This will save your time during verification and you will have exactly what you need to export.
Actually, I'm not sure that this script is available at FC9, it may require an upgrade to FlexiCapture 10
Andrew Zyuzin
Thanks for this info, can you please help me with sample script i am stuck with similar issue in Abbyy
Hi Sirish
Using FlexiCapture 12 this case may be resolved even more efficient:
1) train Classifier to differentiate "a repeating section that appears on the back of every page" from other pages; 2) use a Scripting step in workflow with Batch Processing script (https://help.abbyy.com/en-us/flexicapture/12/developer/scripts_handling) to delete excess pages basing on classification result (Documents.Pages[*].ResultClassName)
Or you may filter out excess pages at export, using export script (https://help.abbyy.com/en-us/flexicapture/12/developer/script_export): use Document.Sections to iterate document sections and save in JSON (or whatever file you create as a result) only fields from required sections. If you don't want this repeating pages exist in result image file (e.g. PDF), use Pages.ExcludedFromDocumentImage = true for the pages of undesired sections.
Thank You Andrew Zyuzin in taking time and replying back to me.
i went through some of the documentation with your guidance, i found this link https://support.abbyy.com/hc/en-us/articles/360017800999?input_string=document+assembly+scripting+help is useful and relevant for my use case
am struggling to point pointers as mentioned in this document. if you feel this is relevant can you help me in sharing any example script like the one showed in documentation.
サインインしてコメントを残してください。