Community

Converting pdf - xml and getting character not word coordinates in the xml

Written by Permanently deleted user

March 15, 2019 04:38
6

HI

So im trying to convert my pdf to xml and its working great but it gives me an xml with the charparams data.

how do i get the same data but word wise not character wise?

1bcdaa8c-8bd1-40b9-b48c-aa11004cba85_cmc-steel.xml

Was this article helpful?

0 out of 0 found this helpful

Comments

6 comments

Permanently deleted user

March 18, 2019 10:36
Hi!

Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

Please check [Developer's Help → API Reference → Parameter Objects → Export Parameters → XMLExportParams] for more information.

-1
Permanently deleted user

March 19, 2019 10:06
Try to set XMLExportParams::WriteCharAttributes = XMLCharAttributesEnum.XCA_None

Where do i make this change?

I am coding in python

0
Aleksandra Zendrikova

March 20, 2019 16:48
Hi!

Sorry, the previous answer refers to another product (FineReader Engine).

In Cloud OCR SDK export to XML is performed in other mode (IXMLExportParams::WriteCharAttributes=XCA_Basic). This parameter cannot be changed by means of Cloud OCR SDK.

0
Permanently deleted user

March 21, 2019 03:37
Oh alright

Thank you

Edit: I also have FineReader 14 enterprise. Can it be done through that?

0
Aleksandra Zendrikova

March 21, 2019 14:21
No, only SDK products support export to XML.

0
Permanently deleted user

January 07, 2020 10:43
I am not getting where do you have make change. can you please explain me again? i am also facing same problem

0

Please sign in to leave a comment.