Community

PDF to XML

Hi Team,

I am trying to convert pdf file to xml.

getting output as xml but not in well format.

can you please help me.

Code Here :

ocr_engine = CloudOCR(application_id='XX', password='XX')

pdf = open('file.pdf', 'rb')

result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

for format, content in result.items():

    with open('final_xml_file13.xml', 'wb') as output_file:

        output_file.write(content.read())

 

Attached output XML in short

 

0

Comments

2 comments

  • Avatar
    Bond James

    Hi Team,

    I am trying to convert pdf file to xml.

    getting output as xml but not in well format.

    can you please help me.

    Code Here :

    ocr_engine = CloudOCR(application_id='XX', password='XX')

    pdf = open('file.pdf', 'rb')

    result = ocr_engine.process_and_download(file, exportFormat='xml', language='English')

    for format, content in result.items():

        with open('final_xml_file13.xml', 'wb') as output_file:

            output_file.write(content.read())

     

    Attached output XML in short

     

    It is very little available on the net about PDF to XML, and usually its too hard to find some good way to convert the PDF to Excel. Normally you can find any of the software that can convert any of the formats into the PDF but from PDF into XML, you will have to search a lot on the net. I 've searched and I tried most of the result I got in Google for pdf to XML, and among them I found one resource to be worth read and use.

     

    0
    Comment actions Permalink
  • Avatar
    JonsonSmith

     You can use online PDF to XML converter.

    0
    Comment actions Permalink

Please sign in to leave a comment.