Tips for converting a PDF file to HTML

2024年09月03日 18:33
1

I think I've found a trick to preserve the formatting of a PDF file as well as possible, when it is converted to HTML by ABBY fine Reader 16, especially when it contains many tables. It is to first go not through the PDF editor but through the OCR editor, by re-saving the original PDF file, in exact copy, after passing OCR, but without using MRC compression and only then to convert this new file to html, through the PDF editor. Because it seems that MRC compression causes some data to be lost, and probably some concerning the arrays. That is what I think I have observed. Am I right?

1件のコメント

Victoria Dvornikova

2024年09月09日 11:19
Hello,

The issue you described can be caused by a specific PDF file or its content (e.g. existing text layer). To investigate the cause I've created a support ticket from your post as we would like to check the original file to find out the reason for such an issue with conversion.

0

サインインしてコメントを残してください。

コミュニティ

Tips for converting a PDF file to HTML

この記事は役に立ちましたか？

コメント

お探しのものを見つけられませんでしたか？