Hi, On Thu, Aug 25, 2011 at 10:21 AM, mavdewan <[email protected]> wrote: > This doesn't help out much. Any idea if i can get the proper html output from > the document? > > This is because the xhtml doesn't extract the tabular formats etc...
Yep, as I said, the XHTML output from Tika is not designed to preserve the full formatting of the original document. For a more accurate document preview feature you'll need to look for solutions beyond Tika. BR, Jukka Zitting
