Community

How to merge two Batches into one task

Hi,

I am an user of ABBYY Flexicapture 11 Distributed.
In Image Import Profile, I have selected an option of load one file for one batch because I don't know the number of docs/pages in input PDF file in advance. So it creates separate batch for every input file. This option is useful only when the file contains more then say 25 docs so I have set a limit of 25 docs per task. But if the file has only one doc in it then it becomes very time consuming for verification operator to open every task and verify doc.
So my question is : is it possible to merge two or more batches into one task programatically on the basis of some criteria say if 15 batches have only 1 doc each then merge those docs into one task for verification operator.

Thanks in Advance.
Rahul

Was this article helpful?

0 out of 0 found this helpful

Comments

6 comments

  • Avatar
    Permanently deleted user
    Hello,

    Could you please describe why cant you load all files at once, or load several files?

    Regards
    Tim





    0
  • Avatar
    Permanently deleted user
    Hi,

    I can not load all files at once because of following scenario:
    If I receive suppose 20 files at a time in Hot Folder, out of which 18 files contains 1 doc each, and 2 files contains 100 docs each then If I load all files in single batch then it will become confusing for Operator that which file is from which client(file name contains client name). I want it to be distributed(in systematic way) as suppose 8 files in one task because it is from same client then other 10 files in one another task, and last 2 files(which contains 100 docs each) I want to make two separate batches of it(event if it is from same client), So as soon as Operator completes task(and finally a batch) then we can send it to next stage. Currently We can not send part of batch to next stage because then there will be multiple tasks will get created on next stage(Currently I have 2 Verification Stages). And currently We are picking the Output Files from Output Folder on the basis of batch, so we don't want to create big batches to avoid a delay in getting output files. As well as I want to set priority of batches(on the basis of a file name criteria or by calling one outside function to get priority of file of particular client etc.) so the batches should be as separate as possible but not as small as one doc per batch.

    Regards,
    Rahul
    0
  • Avatar
    Permanently deleted user
    Hello,

    The problem is that FlexiCapture offers two types of stages for processing:batch processing stage and document processing stage. You can only access one batch in a batch processing stage.From withing FlexiCapture you cant merge two batches by using workflow settings since you only have access to one batch.

    There are several solutions from my point of view:

    1.Use scanning stations
    You can group documents how ever you want

    2.Create separate batch types for each client with separate hotfolders

    3.Import images from hot folders by means of description files
    Descrioption files allow you to specify images that should be imported to a new batch, as well as batch type to use and additional parameters
    This approach is more complex since you will need to create an application for generating these description files. You can find information about description files in FlexiCapture help file.


    Merging two batches in one in an automatic manner is complex, you will need to create an external windows service to monitor your batches and use web-api to merge batches. Check if above options can be used in your scenario. Creating a solution to merge batches may be time consuming and lower overall reliability of your solution.

    Regards
    Tim







    0
  • Avatar
    Permanently deleted user
    Hi Timur,
    Thanks for the solutions. I have done some initial study on those solutions, I have some queries on it as follows:
    1. Scanning Stations: Can we use Scanning station in fully automated manner, means there should not be any requriement of manual intervention to send files to ABBYY OR to Hot folder. It should work like- it will pick files from one Input Folder then will make sets of files on the basis of 2 criteria (1. File name 2. Number of pages in file) and then load to ABBYY. Is it possible? If it is possible then I will request a Scanning station to my vendor.

    2. Separate Batch Type: How to identify that how many pages are there in a single file, because this is also one of the criteria on the basis of which I want to make separate batches. Is it possible if I use separate batch type?

    3. I have read help of Description File, this seems usefull but still not sure how to apply criteria of page count of file. Can you please give some sample Description files...

    For your information I am giving the following scenario and telling how the batch formation need to be happened:
    Following 10 files received in Hot Folder at a time:
    1. Client=ABC_Subclient=A1_File1_01.PDF (1 Pages)
    2. Client=ABC_Subclient=A1_File2_02.PDF (1 Pages)
    3. Client=ABC_Subclient=A5_File3_03.PDF (6 Pages)
    4. Client=ABC_Subclient=A5_File4_04.PDF (85 Pages)
    5. Client=ABC_Subclient=A5_File5_05.PDF (4 Pages)
    6. Client=PQR_Subclient=P1_File6_06.PDF (22 Pages)
    7. Client=PQR_Subclient=P1_File7_07.PDF (8 Pages)
    8. Client=PQR_Subclient=P1_File8_08.PDF (100 Pages)
    9. Client=PQR_Subclient=P1_File9_09.PDF (12 Pages)
    10. Client=PQR_Subclient=P1_File10_10.PDF (1 Pages)

    The Batches should be made as follows:
    Batch1: (File1, File2)
    Batch2: (File3, File5)
    Batch3: (File4)
    Batch4: (File6, File10)
    Batch5: (File7, File9)
    Batch6: (File 8)

    Pages or Docs per Task limit is 25. Batches should be formed on the basis of Client and Subclient name And maximum page count for a batch should not exceed 25 if files to be loaded are more than 1. And if single file contains more then 25 pages then it should make one batch for that file(finally the tasks will have 25 pages or docs per task)


    Regards,
    Rahul
    0
  • Avatar
    Permanently deleted user
    Hello,

    It will not be possible to use scanning station in fully automated manner, you will still need to at least open the scanning station each time you need to export images to FlexiCapture. Scanning station doesn't support any monitoring functionality it is for manual import mostly.

    In order to import images using such a scenario, you probably need a custom solution. You can try using the following approach:
    1.Use image import according to description file
    2.Create a windows service using .net framework to monitor hotfolder, check image names and sizes and create description files according to your criteria

    Regards
    Tim





    0
  • Avatar
    Permanently deleted user
    Hi Timur,

    Thanks for the information and solutions.

    I have tested with DAT description files, it is working but one problem is if some new files comes in Hot Folder which are alphabetically higher, then those files will get loaded in the first batch which is planned by creating first dat file.
    I think XML Description file will work better as we have to specify names of files in the XML file.
    I have some queries on it: If I update XML file by using Windows Service/Utility then will it work? can you please tell me where can I get some example XML files?

    Regards,
    Rahul
    0

Please sign in to leave a comment.