Hi,

I was trying to parse HTML files and it throwed this error for one
particular HTML file. I guessed we are using tagsoup for parsing and
someone already fixed this in the tika code. (
https://github.com/jukka/tagsoup/commit/9cfe7b48745173faafa419f540538a0b6309b699
)

Can someone tell me if this revision is included in the tika that we have
with Nutch-2.x ? Should i use latest tika-dev to have this included and
change libraries in ivy.xml ?

Thank you,

-- 
Kiran Chitturi

Reply via email to