[
https://issues.apache.org/jira/browse/LUCENE-6366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14367629#comment-14367629
]
Robert Muir commented on LUCENE-6366:
-------------------------------------
Its not exactly obvious what they should be. keep in mind some languages don't
have ISO-639-1 or ISO-639-* at all (e.g. brazilian portuguese, sorani kurdish)
so adhering to that will just not work. Language tags (e.g. pt-BR) mean
packages would have to have underscores, hyphen is not even allowed in the
package name.
> "cz" (vs ISO langauge code "cs") for Czech analysis package?
> ------------------------------------------------------------
>
> Key: LUCENE-6366
> URL: https://issues.apache.org/jira/browse/LUCENE-6366
> Project: Lucene - Core
> Issue Type: Bug
> Reporter: Hoss Man
>
> As noted by Eduard Moraru on the solr-user mailing list, the sample
> fieldtypes Solr provides for dealing with the Czech use "cz" as a fieldType,
> dynamicField, and stopwords file naming convention -- but "cz" isn't the
> language code for Czech -- the correct langauge code is "cs".
> Solr's naming convention here comes directly from the lucene analysis package
> name for the Czech analysis classes: {{org.apache.lucene.analysis.cz;}} so
> before making any changes in the Solr sample configs (SOLR-7267), we should
> probably clarify if/why the lucene package name is like this.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]