Community

Converting pdf - xml and getting character not word coordinates in the xml

HI

So im trying to convert my pdf to xml and its working great but it gives me an xml with the charparams data.

how do i get the same data but word wise not character wise? 

 

                                                       

Was this article helpful?

0 out of 0 found this helpful

Comments

6 comments

  • Avatar
    Denis Gusak

    Hi!

    Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

    Please check [Developer's Help → API Reference → Parameter Objects → Export Parameters → XMLExportParams] for more information.

    -1
  • Avatar
    sid9394

    Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

    Where do i make this change?

    I am coding in python 

    0
  • Avatar
    Aleksandra Zendrikova

    Hi!

    Sorry, the previous answer refers to another product (FineReader Engine).

    In Cloud OCR SDK export to XML is performed in other mode (IXMLExportParams::WriteCharAttributes=XCA_Basic). This parameter cannot be changed by means of Cloud OCR SDK.

    0
  • Avatar
    sid9394

    Oh alright

    Thank you

    Edit: I also have FineReader 14 enterprise. Can it be done through that?

    0
  • Avatar
    Aleksandra Zendrikova
    No, only SDK products support export to XML.  
    0
  • Avatar
    joshi.pankaj112

    I am not getting where do you have make change. can you please explain me again? i am also facing same problem 

     

    0

Please sign in to leave a comment.