david.stu...@progressivealliance.co.uk
Thu, 03 Dec 2009 12:04:24 -0800
K I added the advice from Alex with still no luck BodyContentHandler handler = new BodyContentHandler(); AutoDetectParser parser = new AutoDetectParser(); // System.out.println(input.toString()); ParseContext context = new ParseContext(); context.set(Parser.class, parser); parser.parse(input, handler, metadata, context)
I compile is javac TikaParseHtml.java then when I run the command java -classpath [path to jar dir]/tika-core-0.5.jar TikaParseHtml i.html should I add other tika jars? Regards On 03 December 2009 at 20:13 Jukka Zitting <jukka.zitt...@gmail.com> wrote: > Hi, > > On Thu, Dec 3, 2009 at 8:00 PM, david.stu...@progressivealliance.co.uk > <david.stu...@progressivealliance.co.uk> wrote: > > I am trying to implement a really simple example of the tika html to xhtml > > parser here is what I have so far. I have tried two different > > implementations but neiter seem to produce any results. > > The most likely reason is that you don't have all the required > libraries in your classpath. > > BR, > > Jukka Zitting