Community

How to identify a page ending in a word document using Abby Fine Reader

Is it possible in the Abbyy fine reader to identify when a page is ending in a word document and mark it with additional text like PAGEEND_<>. 

0

Comments

3 comments

  • Avatar
    Daria Zvereva

    Hello!

    Please specify the following additional information:

    1. The product you are using

    2. Your recognition scenario and processing settings

    In case you are using FineReader Engine and trying to export in TXT format you may use the UsePageBreaks property of the TextExportParams object. Please check the article in the Developer`s Help: API Reference→Parameter Object→Export Parameters→TextExportParams.

    Hope the information will be useful!

     

    0
  • Avatar
    Viji

    Hi Daria,

    I am Using Cloud OCR application.

    Below is the piece of my code:

     def process(self,path,export_format,applicationid,application_password):        

            ocr_engine = CloudOCR(application_id=applicationid, password=application_password)

            pdf = open(path, 'rb')

            file = {pdf.name: pdf}

            result = ocr_engine.process_and_download(file, exportFormat=export_format, language='English')

    for format, content in result.items():

                with open(output_path, 'wb') as output_file:

                    output_file.write(content.read())

            return output_file 

    Kindly guide me if there is any possibility of inserting page breaks.

    0
  • Avatar
    Oksana Serdyuk

    There are no any features to change/modify the recognition result before exporting in ABBYY Cloud OCR SDK, you receive a ready output in the specified export format. ABBYY FineReader Engine gives such possibilities.

    0

Please sign in to leave a comment.