[
https://issues.apache.org/jira/browse/NUTCH-496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501266
]
Sami Siren commented on NUTCH-496:
----------------------------------
I believe the problem is even more severe. Now several threads share the
NgramProfile what is used to identify a piece of text, if parllel threads have
access to same object the reults are more or less random.
This could be fixed by changing the NGramProfile (what currently is a field
"suspect" in LanguageIdentifier) to be a thread local.
> ConcurrentModificationException can be thrown when getSorted() is called.
> -------------------------------------------------------------------------
>
> Key: NUTCH-496
> URL: https://issues.apache.org/jira/browse/NUTCH-496
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Affects Versions: 0.9.0
> Environment: Nutch application, during fetch.
> Reporter: Briggs
> Attachments: language_analyzer_ngram.patch
>
>
> NGramProfile (within the org.apache.nutch.analysis.lang) package is not
> thread-safe due to a ConcurrentModificationException that can occur if during
> iteration of the resultant List from getSorted() and another call to
> getSorted() is invoked from within another thread.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers