hi,everyone

have you ever met StringIndexOutOfBoundException while parsing msword?
when the .doc file is totally English, it's ok, even the file is very large.
but if there are some Chinese characters, then it comes up the StringIndexOutOfBoundException, even when the file is no more than a page.

maybe this should be posted to the poi maillist, but i guess maybe someone here know why and how. as it is also a part of nutch.

help me, and thanks in advance


TKDD


Reply via email to