[
https://issues.apache.org/jira/browse/LUCENE-6875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14985724#comment-14985724
]
Nikola Smolenski commented on LUCENE-6875:
------------------------------------------
I'm not sure what do you mean by "normalized". There are the two alphabets, and
this is the conversion between them. This is the common conversion, not
something I came up with. Regarding the letters you mentioned, {{ж}} is
transliterated as {{ž}}, but {{џ}} is transliterated as {{dž}}.
> New Serbian Filter
> ------------------
>
> Key: LUCENE-6875
> URL: https://issues.apache.org/jira/browse/LUCENE-6875
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/analysis
> Reporter: Nikola Smolenski
> Priority: Minor
> Attachments: Lucene-Serbian-Regular.patch
>
>
> This is a new Serbian filter that works with regular Latin text (the current
> filter works with "bald" Latin). I described in detail what does it do and
> why is it necessary at the wiki.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]