I need to extract raw text from a lot of random pdfs.
The issue arises with tables which consist of large amounts of numbers or simply dont OCR properly, and pictures.
I was wondering if there is a way to find tables/pictures automatically so i can quickly delete/fix them without manually scrolling through hundreds of pages.
Comments
0 comments
Please sign in to leave a comment.