Daniel, I have been playing with PDFText2HTML I will have to add a lot of code to meet my requirements, but I don't think there is any other option at this time. I have been searching in internet for a while, the only think I found that works well was a C/C++ program called pdf2html.
thanks! On Fri, Apr 24, 2009 at 1:37 PM, Daniel Wilson < [email protected]> wrote: > have a look at the class PDFText2HTML > The description in the comments is: > " * Wrap stripped text in simple HTML, trying to form HTML paragraphs. > Paragraphs > * broken by pages, columns, or figures are not mended." > > As far as PDF -> fully formatted HTML ... I don't think so. > > Daniel Wilson > > 2009/4/24 César Fernando Henriques <[email protected]> > >> Hi guys, anyone knows if there are some Java code that use pdfbox that >> convert pdf files to html? >> >> >> thanks! >> > >
