Hi Team,
I have a requirement of converting a PDF to XML i.e contents of PDF to XML
I have tried using TaggedPdfReaderTool but I get the following exception
Exception in thread "main" java.io.IOException: No StructTreeRoot found, this
probably isn't a tagged PDF document!
I understand that PDF is unstructured(no tags to identify headings, title,
table, image etc) and so it cannot covert document to xml.
Please confirm my understanding.
I have tried using PDFReader class which helps me get entire content of pdf but
I am not able to find out which is the heading , title, table in the pdf
content. My requirement is to create an XML doc with heading in pdf as tags and
content in pdf as tag-element contents.
Please let me know how this can be achieved using iText. Its urgent.
Thanks in advance
Regards,
Neha
DISCLAIMER
==========
This e-mail may contain privileged and confidential information which is the
property of Persistent Systems Ltd. It is intended only for the use of the
individual or entity to which it is addressed. If you are not the intended
recipient, you are not authorized to read, retain, copy, print, distribute or
use this message. If you have received this communication in error, please
notify the sender and delete all copies of this message. Persistent Systems
Ltd. does not accept any liability for virus infected mails.
------------------------------------------------------------------------------
Rapidly troubleshoot problems before they affect your business. Most IT
organizations don't have a clear picture of how application performance
affects their revenue. With AppDynamics, you get 100% visibility into your
Java,.NET, & PHP application. Start your 15-day FREE TRIAL of AppDynamics Pro!
http://pubads.g.doubleclick.net/gampad/clk?id=84349351&iu=/4140/ostg.clktrk
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples:
http://itextpdf.com/themes/keywords.php