Community

Output document as .txt with one .txt per page

I am using Recognition Server 4 for the first time, creating a workflow for a shared folder that will receive multi-page .pdf and .djvu files as input.  For each input file, I would like the output to consist of one .txt per page of the input.  For example, if an input file test.djvu has 100 pages, then I would like the output to be 100 .txt files, test - 0001.txt, test - 0002.txt, etc., with each containing the recognized text for the corresponding page of test.djvu.  When using the Verification Station, though, I want to verify the 100-page .djvu all at once, not have to verify 100 separate jobs.  How can I do this in Recognition Server?

I used FineReader previously and, when saving a document as text (.txt), I could select "create a separate file for each page," so that a 100-page document would be saved as 100 .txt files.

 

Thanks very much.

Was this article helpful?

0 out of 0 found this helpful

Comments

2 comments

  • Avatar
    Permanently deleted user

    Hello,

    Sorry, this feature is not supported.

    But we want to draw your attention to the “Tab of Output Format Settings Dialog Box” article in the RS4 Help. You may find some helpful properties there.  For example, “Insert page break character (#12) to separate pages”

    0
  • Avatar
    Permanently deleted user

    Thank you for the quick reply.  I used your suggestion to "insert page break character" and then, outside of Recognition Server, split the file into multiple pages.

    0

Please sign in to leave a comment.