Community

Multi Columns Documents

Written by Permanently deleted user

April 09, 2014 00:51
9

How can I set up a felxilayout template to capture data from a document where, in some pages, data is organised in 2 columns ? In each column I need to capture table data and data has to be read left column first. Tables can go over to the next column (same page or next page). Thanks

Was this article helpful?

0 out of 0 found this helpful

Comments

9 comments

Permanently deleted user

April 11, 2014 13:21
i have the same issue... wondering if someone have hints about it

0
Aleksei Nikitin

April 11, 2014 14:54
I propose the following way:
1. distinguish column regions in repeating group. For example, find vertical white gaps within whole page height from left to right. Areas between whitegaps will be columns.
2. running in a loop from 1st to last column, find header and footer for each table.
3. form table search areas as RectArrays. For example, if table header is in column N and footer is in column N+2 then RectArray will contain 3 rectangles: 1st is from header to column N bottom, 2nd is a whole column N+1 and 3rd is from column N+2 top to footer
4. find table instances in appropriate areas.

0
Permanently deleted user

April 11, 2014 18:01
Hi Aleks,

thanks for your tips. I am quite new to abbyy, especially on the scripting side. My document is divided in 2 columns exactly and I know the exact coordinate (area) of each column, so I do not need to detect the number of columns using white gaps. I have created a repeating group to detect header and footer, but I do not seem to be able to force the search using the column's order and I have difficulties coupling them (detecting header and footer of the same table) especially if the table crosses 2 columns (that could also be on different pages). This is probably what you were suggesting in step 2. How do I run a loop from first to second column for each page ? Can I do this using the search area ? Once detected the header and footer how can I create the rectarrays for searching the table ? Thank you for your help

0
Eugenia Posylnaya

April 15, 2014 13:20
Hey, Susanna!

Probably the easiest way in the situation is to do the following:
1. Create Repeating Group Element
Use the “Page mode”, specify the number of repetitions on page
Use Absolute search area constrain; define coordinates of the two regions of the page (left and right halves, the gap between them excluded)
2. Create a Table within the Repeating Grout Element, check “Look for header / footer”, uncheck “Header / Footer is on each page” options;
3. When you describe the position of the Table block by specifying the source element (the Repeating Group Element), check “has repeating instances” and choose “Left to right” as instance sort order to make sure that data is read left column first

Will that suit you?

0
Permanently deleted user

April 17, 2014 18:14
Hi Eugenia,

thanks for your help. unfortunately I have tried, but it does not resolve my problem. The number of instances per page is not known and tables can go across columns and pages. I would be glad to hear any other tip you might have. Thank you.

0
Permanently deleted user

April 19, 2014 01:56
Susanna,

Is there a sample image that you can provide so I can try on my end?

0
Permanently deleted user

April 19, 2014 12:15
Hi Sushi,

I can certainly provide you with the document. I have attached a scanned version of the 5 pages of the document we would like to capture data from. For each table you will see a table name (es. 1st table = Chiamate Locali), a total number of rows (the table is a call list - Numero totale chiamate) and at the end of the table a footer that provides information on the total duration of the call (total of 3rd column - Durata totale chiamte), the average duration (Durata media chiamate) and total amount (total of column 5.

We have tried some scripting ourselves, but we do not seem able to implement the logic of sequence between columns and pages (first Left Column page 1, then right column page 1, then left column page 2 and so on).
Thank you for any help you will be able to provide.

4c0240a4-7404-420a-aa92-a6ba00d73808_image-85.jpeg

b02781a6-3aaa-4c12-8541-a6ba00d73840_image-86.jpeg

989f0f17-1c45-4556-b5c9-a6ba00d73880_image-87.jpeg

3757b4d6-ba99-4d1a-9f6a-a6ba00d738d3_image-88.jpeg

29984ae8-a12a-4e4a-a52c-a6ba00d7391e_image-89.jpeg
0
Permanently deleted user

April 22, 2014 14:14
On the issue of multipage Document Definitions, tried to follow the guidelines in the tutorial publication and keep getting the error "Several fields are exported to same 'ROW_INDEX'" I defined sections in the document and also defined footer as the tell sign of end of document. Assembled the sections using a key field that appears on all pages. Am managing to export to desktop but not into database where it is very important

0
Permanently deleted user

April 28, 2014 18:56
Susanna,

Thank you for the images. I had training last week so couldn't look at them too much. Looking at these images, the way I would approach this is definitely a repeatable group. I would probably create a repeatable group for the left and another for the right side. In the FlexiCapture side, I would then write a script to merge the export together. The hard part is that the table will jump from the left side to the right side. I suspect I would have to create some non-recognized repeatable group and remove the labels I don't want from the search area.

0

Please sign in to leave a comment.

Community

Multi Columns Documents

Was this article helpful?

Comments

Didn't find what you were looking for?