[ http://issues.apache.org/jira/browse/NUTCH-60?page=all ]
Jerome Charron updated NUTCH-60:
--------------------------------
Attachment: NUTCH-60-050607.patch
Here it is: the final (?) patch. It provides around +25% performance and
increase the identification precision. More details are availale on
http://wiki.apache.org/nutch/LanguageIdentifierBenchs
> Bad language identifier plugin performances
> -------------------------------------------
>
> Key: NUTCH-60
> URL: http://issues.apache.org/jira/browse/NUTCH-60
> Project: Nutch
> Type: Improvement
> Components: indexer
> Reporter: Jerome Charron
> Priority: Minor
> Attachments: NUTCH-60-050526.patch, NUTCH-60-050605.patch,
> NUTCH-60-050607.patch
>
> As reported by Stefan Groschupf
> (http://www.mail-archive.com/[email protected]/msg04090.html)
> the language identifier plugin consumes a lot of processing time.
> Some optimizations and/or configuration options are required.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira