[
https://issues.apache.org/jira/browse/LUCENE-6875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14985944#comment-14985944
]
Dawid Weiss commented on LUCENE-6875:
-------------------------------------
By normalized I meant some kind of standard that defines this transliteration.
I'm (fairly) confident there is a transliteration guide for doing cyrillic ->
Polish (latin and diacritics), but I'd have to look for exact reference.
I was just curious, it wasn't meant to be a negative remark :)
bq. Regarding the letters you mentioned, ж is transliterated as ž, but џ is
transliterated as dž.
Oops, sorry -- I only looked at the patch and the aligned conversion strings
being compared, my bad.
> New Serbian Filter
> ------------------
>
> Key: LUCENE-6875
> URL: https://issues.apache.org/jira/browse/LUCENE-6875
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/analysis
> Reporter: Nikola Smolenski
> Priority: Minor
> Attachments: Lucene-Serbian-Regular.patch
>
>
> This is a new Serbian filter that works with regular Latin text (the current
> filter works with "bald" Latin). I described in detail what does it do and
> why is it necessary at the wiki.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]