Hi,
Here is a free copy of a book in the public domain: http://www.gutenberg.org/ebooks/1497.
Converting the available files to PDF yields a perfect PDF.
Each time I try to convert to Word the file has multiple flaws: broken paragraphs. Multiple fonts. Differeing line heights.
What do you think is happening?
Sincerely,
OT
+++
FOLLOW UP
Tried to OCR the file and send to Word. Successfully. However, errors recurred in the same places as the regular conversion.
text-breaks-to-new-page-pagination-lost
PAGINATION BREAKS, FOPNT HEIGHT
Comments
3 comments
Hi OT,
None of the formats available by the link you provided can be directly converted to editabe by FineReader PDF, unfortunately. As to conversion of the PDF that you created from them to Word and page numeration, it may happen for multiple reasons, it's hard to say, especially not having the PDF that you're converting. If the text on the pages of the initial PDF doens't fit the pages in the converted Word document, maybe setting bigger page size for converted document can help. In general, OCR, as any technology, has it's own boundaries, and it's a kind of technology that can't be 100% accurate in 100% of the cases.
Regards, Yuriy
Hi Yuriy,
You wrote:
maybe setting bigger page size for converted document can help
How would I do that. It seems the program auto-launches to Word once it is ready? How can I tell ABBYY what size pages to use? Or, alternately, let Word know that newly created files arriving from ABBYY should use a specific paper size?
Sincerely,
OT
Hi OT, it's here:
Please sign in to leave a comment.