What is internal process of multiple semi-structured (based on FlexiLayout) Document Definitions' matching?

Question

When there are multiple semi-structured (based on FlexiLayout) Document Definitions in the FlexiCapture project, when importing a file, the FlexiCapture needs to decide what Document Definition to match with the imported file. Based on what factors, how, FlexiCapture decides what Document Definition to use?

Answer

The are multiple factors that affect semi-structured Document Definitions' matching.

  • Use first acceptable FlexiLayout option. This option can be found in the Document Definition list of the project. Checking every Document Definition could take time and resources, thus this option is turned on by default. If FlexiCapture finds the Document Definition that can be matched with the imported document, it matches it and does not check the quality of matching for other Document Definitions. If you are sure that each imported document can be matched with only one Document Definition it is recommended to leave this option on. Otherwise, turn it off, for FlexiCapture to check the quality of matching for other Document Definitions.
  • Quality of matching. Quality of matching is an internal metric that takes into account multiple matching factors. If the Use first acceptable FlexiLayout option is turned off, the FlexiCapture will check the imported document against each Document Definition and will choose the one with the highest matching quality.
  • Required Element. When creating an Element in FlexiLayout, you can choose the element to be Required, Optional, or Prohibited. Matching of Optional Elements affects only the quality of matching. If the Required Element of FlexiLayout was not found on the document, it means that the FlexiCapture will not match this FlexiLayout with said document.
  • Prohibited Element. Contrary to the Required Element, if the Prohibited Element was found on the imported document, it means that the FlexiCapture will not match this FlexiLayout with said document.

What to check when the Document Definition matching results in FlexiCapture are not as expected?

  • Usage of Required and Prohibited Elements in FlexiLayouts. If you need FlexiCapture to match Document Definitions based on some text or other Elements on the Document, make sure that the configuration of said elements is correct.
  • Use first acceptable FlexiLayout option. If you do not use the Required and Prohibited Elements in your FlexiLayouts, this option needs to be turned off, for FlexiCapture to be able to check each of the Document Definitions, compare the quality of matching, and choose the most suitable Document Definition.

Was this article helpful?

1 out of 1 found this helpful

Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.

Recently viewed