コミュニティ

Tips for converting a PDF file to HTML

Tips for converting a PDF file to HTML

I think I've found a trick to preserve the formatting of a PDF file as well as possible, when it is converted to HTML by ABBY fine Reader 16, especially when it contains many tables. It is to first go not through the PDF editor but through the OCR editor, by re-saving the original PDF file, in exact copy, after passing OCR, but without using MRC compression and only then to convert this new file to html, through the PDF editor. Because it seems that MRC compression causes some data to be lost, and probably some concerning the arrays. That is what I think I have observed. Am I right?

この記事は役に立ちましたか?

0人中0人がこの記事が役に立ったと言っています

コメント

1件のコメント

  • Avatar
    Victoria Dvornikova

    Hello,

    The issue you described can be caused by a specific PDF file or its content (e.g. existing text layer). To investigate the cause I've created a support ticket from your post as we would like to check the original file to find out the reason for such an issue with conversion.

    0

サインインしてコメントを残してください。