[
https://issues.apache.org/jira/browse/NUTCH-657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13066426#comment-13066426
]
Lewis John McGibbney commented on NUTCH-657:
--------------------------------------------
I have been unsuccessful in submitting a patch for a file name change as oppose
to content changes within the file... any pointers please? I am not familiar
with submitting patches for file name changes.
Yes Markus, non of these files exist within trunk... strange. From doing some
background reading into the classes I can see that two authors are Sami Siren
and Jerome Charron. Is there anyone on board that has experience working with
the language identifier code? This is really the first time I have looked over
it...
> Estonian N-gram profile has wrong name
> --------------------------------------
>
> Key: NUTCH-657
> URL: https://issues.apache.org/jira/browse/NUTCH-657
> Project: Nutch
> Issue Type: Bug
> Affects Versions: 0.8.1, 0.9.0
> Reporter: Jonathan Young
> Priority: Trivial
>
> The Nutch language identifier plugin contains an ngram profile, ee.ngp, in
> src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang . "ee"
> is the ISO-3166-1-alpha-2 code for Estonia (see
> http://www.iso.org/iso/country_codes/iso_3166_code_lists/english_country_names_and_code_elements.htm),
> but it is the ISO-639-2 code for Ewe (see
> http://www.loc.gov/standards/iso639-2/php/English_list.php). "et" is the
> ISO-639-2 code for Estonian, and the language profile in ee.ngp is clearly
> Estonian.
> Proposed solution: rename ee.ngp to et.ngp .
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira