[ 
https://issues.apache.org/jira/browse/LUCENE-6366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14367629#comment-14367629
 ] 

Robert Muir commented on LUCENE-6366:
-------------------------------------

Its not exactly obvious what they should be. keep in mind some languages don't 
have ISO-639-1 or ISO-639-* at all (e.g. brazilian portuguese, sorani kurdish) 
so adhering to that will just not work. Language tags (e.g. pt-BR) mean 
packages would have to have underscores, hyphen is not even allowed in the 
package name.




> "cz" (vs ISO langauge code "cs") for Czech analysis package?
> ------------------------------------------------------------
>
>                 Key: LUCENE-6366
>                 URL: https://issues.apache.org/jira/browse/LUCENE-6366
>             Project: Lucene - Core
>          Issue Type: Bug
>            Reporter: Hoss Man
>
> As noted by Eduard Moraru on the solr-user mailing list, the sample 
> fieldtypes Solr provides for dealing with the Czech use "cz" as a fieldType, 
> dynamicField, and stopwords file naming convention -- but "cz" isn't the 
> language code for Czech -- the correct langauge code is "cs".
> Solr's naming convention here comes directly from the lucene analysis package 
> name for the Czech analysis classes: {{org.apache.lucene.analysis.cz;}} so 
> before making any changes in the Solr sample configs (SOLR-7267), we should 
> probably clarify if/why the lucene package name is like this.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to