[ http://issues.apache.org/jira/browse/NUTCH-60?page=comments#action_12312863 ]
Jerome Charron commented on NUTCH-60: ------------------------------------- Committers, don't apply these patches, there is a loss of precision on identification. I have identified the problem and have just quantified it. I'm currently working on a new patch version solving this issue. > Bad language identifier plugin performances > ------------------------------------------- > > Key: NUTCH-60 > URL: http://issues.apache.org/jira/browse/NUTCH-60 > Project: Nutch > Type: Improvement > Components: indexer > Reporter: Jerome Charron > Priority: Minor > Attachments: NUTCH-60-050526.patch, NUTCH-60-050605.patch > > As reported by Stefan Groschupf > (http://www.mail-archive.com/[email protected]/msg04090.html) > the language identifier plugin consumes a lot of processing time. > Some optimizations and/or configuration options are required. -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - For more information on JIRA, see: http://www.atlassian.com/software/jira
