[ https://issues.apache.org/jira/browse/NUTCH-496?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12501266 ]
Sami Siren commented on NUTCH-496: ---------------------------------- I believe the problem is even more severe. Now several threads share the NgramProfile what is used to identify a piece of text, if parllel threads have access to same object the reults are more or less random. This could be fixed by changing the NGramProfile (what currently is a field "suspect" in LanguageIdentifier) to be a thread local. > ConcurrentModificationException can be thrown when getSorted() is called. > ------------------------------------------------------------------------- > > Key: NUTCH-496 > URL: https://issues.apache.org/jira/browse/NUTCH-496 > Project: Nutch > Issue Type: Bug > Components: fetcher > Affects Versions: 0.9.0 > Environment: Nutch application, during fetch. > Reporter: Briggs > Attachments: language_analyzer_ngram.patch > > > NGramProfile (within the org.apache.nutch.analysis.lang) package is not > thread-safe due to a ConcurrentModificationException that can occur if during > iteration of the resultant List from getSorted() and another call to > getSorted() is invoked from within another thread. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-developers mailing list Nutch-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-developers