tika-app (the gui) gives me back the xhtml just fine.. not sure what is going on here.. maybe it is not stored properly in the documentfragment upon parsing?
-- View this message in context: http://lucene.472066.n3.nabble.com/Cached-page-like-google-with-hits-highlighted-tp4001374p4001449.html Sent from the Nutch - User mailing list archive at Nabble.com.