I'm responsible for setting up the image import. From my review of the Abbyy Flexicapture system, it doesn't seem what I need is easily accomplished. Please let me know if I'm overlooking something.
We plan to use hotfolders for all of our image importing. Some of our image files contain barcodes, and some files contain none. For example, one file may have separate invoices in it, each separated by a barcode. The next file may only consist of 1 invoice, so no barcode is used. Because of this, I need for our import process to create a new document for each file that is imported, and then to create a new document if the import process recognizes a barcode. Currently, it seems the only options are to EITHER create a new document for each file, or to create new documents when a barcode is found. The combination of both is not able to be accomplished unless I import 1 file at a time. If my hotfolder sweeps 1 file at a time, I can set the separation settings to separate by barcode. This will result in accomplishing what I need however it results in each batch only consisting of 1 file, or to keep 1 continuous batch open that keeps having documents created into it. Neither of those options is ideal.
I want to be able to import 10 files into a new batch, creating a new document for each of the 10 files, and a new document when any barcode is found within any of those 10 files. The reason this does not work is because if a file does not contain a barcode as it's first page, that file will get added as a page to the previous document.
The knowledgebase had some barcode identifier script examples but I believe it's old and no longer works. I'm not finding any available methods to identify a barcode. My original thought was to set the import setting to create a new document for each file then create an event handler On Page Create to check if a barcode is found, and if so, create a new document. That didn't work out, so I then decided to set the import setting to create a new document per barcode and on the event handler On Page Create I am storing the image filename as a batch parameter so that on each page create I can check if a new filename is being imported, and then if so, create a new document. The problem with this is that it doesn't appear that I can make batch structure modifications within this processing event handler. I'm being thrown errors. From what I'm researching, those type of changes can only be made in a custom stage setup for batch scripting. Is the solution therefore to setup a custom stage that will loop through every document and move pages into new documents if the source of that page does not equal the source of the previous page in the same document?
I know there is the option to set the documents to be separated in recognition, but this seems like it could be a nightmare for us.
Any thoughts on this or If I'm way off track? We process through 100-120k invoices per month, most invoice files coming directly from the vendors which is why the files vary in their use of barcodes. I need to find a way to make this work well. I may try to find an outside tool to separate the file out by barcode prior to importing into Abbyy.
Thanks in advance for any assistance
-Keith
We plan to use hotfolders for all of our image importing. Some of our image files contain barcodes, and some files contain none. For example, one file may have separate invoices in it, each separated by a barcode. The next file may only consist of 1 invoice, so no barcode is used. Because of this, I need for our import process to create a new document for each file that is imported, and then to create a new document if the import process recognizes a barcode. Currently, it seems the only options are to EITHER create a new document for each file, or to create new documents when a barcode is found. The combination of both is not able to be accomplished unless I import 1 file at a time. If my hotfolder sweeps 1 file at a time, I can set the separation settings to separate by barcode. This will result in accomplishing what I need however it results in each batch only consisting of 1 file, or to keep 1 continuous batch open that keeps having documents created into it. Neither of those options is ideal.
I want to be able to import 10 files into a new batch, creating a new document for each of the 10 files, and a new document when any barcode is found within any of those 10 files. The reason this does not work is because if a file does not contain a barcode as it's first page, that file will get added as a page to the previous document.
The knowledgebase had some barcode identifier script examples but I believe it's old and no longer works. I'm not finding any available methods to identify a barcode. My original thought was to set the import setting to create a new document for each file then create an event handler On Page Create to check if a barcode is found, and if so, create a new document. That didn't work out, so I then decided to set the import setting to create a new document per barcode and on the event handler On Page Create I am storing the image filename as a batch parameter so that on each page create I can check if a new filename is being imported, and then if so, create a new document. The problem with this is that it doesn't appear that I can make batch structure modifications within this processing event handler. I'm being thrown errors. From what I'm researching, those type of changes can only be made in a custom stage setup for batch scripting. Is the solution therefore to setup a custom stage that will loop through every document and move pages into new documents if the source of that page does not equal the source of the previous page in the same document?
I know there is the option to set the documents to be separated in recognition, but this seems like it could be a nightmare for us.
Any thoughts on this or If I'm way off track? We process through 100-120k invoices per month, most invoice files coming directly from the vendors which is why the files vary in their use of barcodes. I need to find a way to make this work well. I may try to find an outside tool to separate the file out by barcode prior to importing into Abbyy.
Thanks in advance for any assistance
-Keith
Comments
6 comments
Hi Keith.
Have you tried the setting Hot Folder : Document Separation > Create New Document > For every image file ?
I assume you are already get each document in 1 images and want to send them to be processed 1 files represent 1 document.
Per my post, we cannot assume each image is 1 document. Most of our images contain many documents, each separated by a barcode. Some image files contain only 1 document with no barcodes. I need for the image import to create a new document for each image file, and for each barcode.
The above allows for me to import multi-document files separated by patch codes, with single-document files containing no patch code separators, all within the same import task.
I have created hotfolder and perform all setting required for it for example
1. Enable from processing server
2. Set 20 sec interval
3. Create batch for imprted image automatically
4. Number of files to selected : from subfolder
But when i import image it only import one folder and even after interval completed no more folder imported ..
Can any one guide me ?
Check to see if hotfolders are enabled for your project (you can check this at the processing server). Keep in mind that if you have a 3 level folder structure (E.G.: A/B/C1 , A/B/C2), importing from subfolder A, will only create one batch with all images of subfolders B/C1,B/C2. But importing from A/B will create two batches one for C1 images and the other one for images under C2.
Hope this helps.
Kind regards,
Alberto
@... can you please share the script, we also facing the same problem.
Thanks,
Sarath.
Please sign in to leave a comment.