Hi,

On Thu, Apr 29, 2010 at 5:26 PM, Jawad Bokhari <[email protected]> wrote:
> Caused by: java.nio.charset.IllegalCharsetNameException:

It looks like the HTML documents you have use some character encoding
that's not supported by the underlying Java platform.

Can you file a bug about this in
https://issues.apache.org/jira/browse/TIKA for the Tika project that
Jackrabbit nowadays uses for full text extraction? It would be great
if you could also attach a troublesome HTML file to the bug report.

BR,

Jukka Zitting

Reply via email to