Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by JeromeCharron: http://wiki.apache.org/nutch/LanguageIdentifierBenchs The comment on the change is: Add some explanation ------------------------------------------------------------------------------ + == Introduction == + + This page provides some benchmarks of the LanguageIdentifierPlugin between the ''old'' (previous) version and the ''new'' (configurable) version (see NewLanguageIdentifier for more details). + + These data can be usefull if you want to contribute in increasing the LanguageIdentifierPlugin performances, or if you want to tune precisely your ["Nutch"] configuration. + + == Data set == + + These benchmarks were produced by testing the LanguageIdentifierPlugin on a set of 492 french files representing a total size of 171,3 Mo. These files were extracted from the ''[http://people.csail.mit.edu/koehn/publications/europarl/ European Parliament Proceedings Parallel Corpus 1996-2003 Release v2]''. + + == Raw results == + ||'''Data Size'''||'''P.V.'''||'''[1-4]'''||'''[2-2]'''||'''[3-3]'''||'''[4-4]'''||'''[2-3]'''||'''[3-4]'''||'''[2-4]'''|| ||'''128'''||8314||5124||1627||2245||1393||3073||2996||4243|| ||'''256'''||7660||4950||1408||1604||1425||3033||2809||3983|| ------------------------------------------------------- This SF.Net email is sponsored by Yahoo. Introducing Yahoo! Search Developer Network - Create apps using Yahoo! Search APIs Find out how you can build Yahoo! directly into your own Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005 _______________________________________________ Nutch-cvs mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-cvs
