Community

Converting pdf - xml and getting character not word coordinates in the xml

HI

So im trying to convert my pdf to xml and its working great but it gives me an xml with the charparams data.

how do i get the same data but word wise not character wise? 

 

                                                       

0

Comments

6 comments

  • Avatar
    Denis Gusak

    Hi!

    Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

    Please check [Developer's Help → API Reference → Parameter Objects → Export Parameters → XMLExportParams] for more information.

    0
    Comment actions Permalink
  • Avatar
    sid9394

    Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

    Where do i make this change?

    I am coding in python 

    0
    Comment actions Permalink
  • Avatar
    Aleksandra Zendrikova

    Hi!

    Sorry, the previous answer refers to another product (FineReader Engine).

    In Cloud OCR SDK export to XML is performed in other mode (IXMLExportParams::WriteCharAttributes=XCA_Basic). This parameter cannot be changed by means of Cloud OCR SDK.

    0
    Comment actions Permalink
  • Avatar
    sid9394

    Oh alright

    Thank you

    Edit: I also have FineReader 14 enterprise. Can it be done through that?

    0
    Comment actions Permalink
  • Avatar
    Aleksandra Zendrikova
    No, only SDK products support export to XML.  
    0
    Comment actions Permalink
  • Avatar
    joshi.pankaj112

    I am not getting where do you have make change. can you please explain me again? i am also facing same problem 

     

    0
    Comment actions Permalink

Please sign in to leave a comment.