Community

check for duplicate PDF files in a collection of files

Is it possible to have an ABBYY product scan my collection of PDF files automatically and report duplicate files?

Background:

I have a lot of PDF files and some of these files in the collection are (more or less) the same. If you look at the contents (OCR). I would to scan all the PDF files automatically and get a report of files which are (for example min. 90% the same, based on contents after OCR scan).

Basically something like Anti-Twin does for .jpg files, but now for PDF files.

Thanks in advance.

0

Comments

1 comment

Please sign in to leave a comment.