Community

PDF to XML

Hi Team,

I am trying to convert pdf file to xml.

getting output as xml but not in well format.

can you please help me.

Code Here :

ocr_engine = CloudOCR(application_id='XX', password='XX')

pdf = open('file.pdf', 'rb')

result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

for format, content in result.items():

    with open('final_xml_file13.xml', 'wb') as output_file:

        output_file.write(content.read())

 

Attached output XML in short

 

Was this article helpful?

0 out of 0 found this helpful

Comments

2 comments

  • Avatar
    Permanently deleted user

    Hi Team,

    I am trying to convert pdf file to xml.

    getting output as xml but not in well format.

    can you please help me.

    Code Here :

    ocr_engine = CloudOCR(application_id='XX', password='XX')

    pdf = open('file.pdf', 'rb')

    result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

    for format, content in result.items():

        with open('final_xml_file13.xml', 'wb') as output_file:

            output_file.write(content.read())

     

    Attached output XML in short

     

    It is very little available on the net about PDF to XML, and usually its too hard to find some good way to convert the PDF to Excel. Normally you can find any of the software that can convert any of the formats into the PDF but from PDF into XML, you will have to search a lot on the net. I 've searched and I tried most of the result I got in Google for pdf to XML, and among them I found one resource to be worth read and use.

     

    0
  • Avatar
    Permanently deleted user

     You can use online PDF to XML converter.

    0

Please sign in to leave a comment.