On Tue, 21 Nov 2017, Jim Idle wrote:
Following up on this, I will try cancelling my thread based tasks after a pre-set time limit. That is only going to work if Tika and the underlying parsers behave correctly with the interrupted exception. Anyone had any success with that? I am mainly looking at Office, PDF and HTML right now. I will try it myself of course, but perhaps someone has already been down this path?

Have you tried with ForkParser? That would also protect you against other kinds of failures like OOM too

Nick

Reply via email to