Re: Thread-safe versions of some of the tools

Thilo Goetz Wed, 11 Jan 2017 05:38:33 -0800

Correct me if I'm wrong, but that approach only works if you control thethread creation yourself. In my case, for example, I was using Scala'sparallel collection API, and had no control over the threading. I willusually want to create one service that does tokenization or POS taggingor whatever, which can be accessed by many threads. I don't want to haveto mess around with an object pool, or thread locals, or anything likethat. Especially since there is really no good reason IMHO. You couldvery easily just return the probabilities together with the spans, andwhoever doesn't need them can ignore them. Or have two methods, one withprobabilities, one without. Maybe it's just where I'm coming from, but Ifail to see the advantages of the current approach.


--Thilo



On 11/01/2017 13:58, Joern Kottmann wrote:

Hello Thilo,

I am interested in your opinion about how this is done currently.
We say: "Share the model between threads and create one instance of the
component per thread".

Wouldn't that work well in your use case?

Jörn



On Wed, Jan 11, 2017 at 11:05 AM, Thilo Goetz <[email protected]> wrote:

Hi,

in a recent project, I was using SentenceDetectorME, TokenizerME and
POSTaggerME. It turns out that none of those is thread safe. This is
because the classification probabilities for the last tag() call (for
example) are stored in a member variable and can be retrieved by a separate
API call.

I'm planning to build thread safe versions for myself, and I'd be happy to
contribute a patch if there is interest. This could be done as a
conservative extension with an additional method such as tagReentrant,
where the old API calls would continue to work as before and would still
not be thread safe. Alternatively, one could remodel the API so that
everything was thread safe, but that would break backwards compatibility.

Final question: if I do this for the classes mentioned above, are there
other tools that should be made thread safe while we're at it?

Opinions?

--Thilo

Re: Thread-safe versions of some of the tools

Reply via email to