Re: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-26 Thread c.leitin...@lirum.at
Hi, On Tue, Jul 26, 2016 at 03:07:59AM +, Allison, Timothy B. wrote: > If you could open an account on JIRA, it would be helpful for > discussion on this issue. Done. Thanks! Best regards, Christian

Re: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-26 Thread Christian
Hi, On Tue, Jul 26, 2016 at 02:17:13AM +, Allison, Timothy B. wrote: > Exactly what code are you using? How are you doing detection? I see that you already have something working on TIKA-2041. But for completeness' sake: Our code is a bit convoluted. It boils down to running the following p

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-26 Thread Nick Burch
On Tue, 26 Jul 2016, Allison, Timothy B. wrote: Charset detection _should_ be thread safe. If you can help us track down the problem (unit test?), we need to fix this. On the whole, I think Tika is following the POI model on thread-safety as a minimum. That is, two threads working on two diff

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-25 Thread Allison, Timothy B.
Christian, If you could open an account on JIRA, it would be helpful for discussion on this issue. Thank you, again. Best, Tim -Original Message- From: c.leitin...@lirum.at [mailto:c.leitin...@lirum.at] Sent: Monday, July 25, 2016 6:01 PM To: user@tik

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-25 Thread Allison, Timothy B.
Y, we have a problem. Thank you for raising this. https://issues.apache.org/jira/browse/TIKA-2041

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-25 Thread Allison, Timothy B.
s); String val = reader.getCharset().toString(); if (val == null) { return "NULL"; } else { return val; } } } -Original Message----- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Monday, July 2

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-25 Thread Allison, Timothy B.
return val; } } } -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Monday, July 25, 2016 9:21 PM To: user@tika.apache.org Subject: RE: Is Tika (especially CharsetDetector) considered thread-safe? Charset detection _should_

RE: Is Tika (especially CharsetDetector) considered thread-safe?

2016-07-25 Thread Allison, Timothy B.
Charset detection _should_ be thread safe. If you can help us track down the problem (unit test?), we need to fix this. Thank you for raising this. Best, Tim -Original Message- From: c.leitin...@lirum.at [mailto:c.leitin...@lirum.at] Sent: Monday, July 25, 2016 6:01 PM To: u