Community

Remove "low confidence character" highlighting in DOCX files with Microsoft Word

I would like to know how to remove "low confidence character" highlighting in DOCX files with Microsoft Word. When I try to do this the regular way within MS-Word, nothing happens, the highlighting remains in the Word DOCX file.  This is frustrating. There seems to be something special about the way that Finereader adds this highlighting that MS-Word cannot handle. (I like to save my translated documents with this feature.)  I use the default color of light blue/teal, one of the "Basic colors". Should I switch to another color like yellow? Are there any workarounds to make removing this highlighting more manageable in MS-Word?

Was this article helpful?

0 out of 0 found this helpful

Comments

6 comments

  • Avatar
    Yuriy Korotkevych

    Hello Ron,

    You can use the Format Painter tool, or Clear Formatting from the Styles menu, or apply necessary style to the text highlighted by FineReader PDF. 

    We'll check why usual Highlight tool doesn't work in this case, and if there's any useful information I'll get back to you here.

    Best regards,

    Yuriy

    0
  • Avatar
    Ron W Lah

    Thanks for the response Yuriy!

    I see that Format Painter tool works but only for single cases, but it is not useful for global use where there is much formatting. My specific situation is that I have output with hundreds of pages, and even thousands of text frames produced by the "Exact text" output. Below is a typical page of a scanned old book I am currently working on, multiple columns, font changes, variation in formatting across each page. 

    I want to highlight the text in the entire document across all pages, use the Text Highlight Color tool to make all text highlighting "No color".  Is there some way to actually do this?  If I "clear formatting", I lose all my formatting, not just the unwanted highlighting. 

    Looking forward to any help on this. 

    Ron

    0
  • Avatar
    Margarita Nefedova

    Hello Ron,


    I created a support ticket regarding this situation, and we will continue the investigation there. Please await a reply from our Support team.


    Kind regards,
    Margarita

    0
  • Avatar
    William Owen Burgess

    I'm having the same issue… Three years after you all. I found a (not great) work around. You can select all the text in your document, create a new document, and paste the text into the new document. Choose "merge formatting" and the resulting text should not have the annoying highlights (I do admit that I like using them to find errors) left in it. I did find that it destroyed my heading structure, but thankfully it left my numbered lists and in-document links intact.

    0
  • Avatar
    Maiia Chenchyk

    Hi William,

    Thank you for sharing your workaround! I wanted to suggest another method for removing the highlights of low-confidence characters in an exported Microsoft Word document. You can use the Shading option to eliminate the highlighted text. Simply select the highlighted text in the Word document and choose Shading > No Color. Please note that this option cannot be applied to the entire document at once, you'll need to select the highlighted text in the paragraph manually.

    Additionally, you can disable the "Highlight low-confidence characters" feature in the FineReader PDF settings by navigating to the Tools menu > Options… > Format Settings > DOC(X)/RTF/ODT and turning off this option. With this adjustment, the highlighted symbols will still appear in the right pane Text of the OCR Editor, but they won't show up in the exported file.

    Kind regards,
    Maiia

    0
  • Avatar
    Ron W Lah

    I think the best workaround that I found since my original posting was to open the DOCX file in either Google Docs or in LibreOffice, both of which allow one to select the entire document and remove all highlighting/highlighting, no restrictions as in MS-Word.     

    0

Please sign in to leave a comment.