Community

Is there a way to eliminate separate text boxes completely?

We switched from Adobe OCR to ABBYY Fine Reader because of its greater accuracy in character recognition.

But, we just discovered something ABBYY does on Word and RTF document creation that makes it incompatible with our work flow.

We have many archival 'document of documents' that need to be OCR'd, and separated out into discrete docs.

Adobe OCR emits contiguous text, and only creates boxes for images.

ABBYY, despite the fact that I unselected options for headers and footers in OCR and formatting options, insists on creating separate boxes for page elements, including creating headers and footers.

Unless I can override this behavior, we're not going to be able to use the software, as it makes selecting document text to separate out much too painful.

Any ideas or suggestions are appreciated.

 

Was this article helpful?

0 out of 0 found this helpful

Comments

2 comments

  • Avatar
    Victoria Dvornikova

    Hi Randall,

    If you save recognized text with the Editable copy formatting (option on the main toolbar in the OCR Editor or Tools > Options > Format settings > Document layout) the text areas shouldn't be in the boxes. But to double-check everything I'll create a support ticket and our Customer support agent can analyze exactly the file you have and provide additional recommendations.

    0
  • Avatar
    Randall Perry

    'Editable Copy' was the default.

    When I switched it to 'Formatted Text' all the boxes (except picture boxes) went away.

    This works.

    0

Please sign in to leave a comment.