ICU looks promising: Μῆνιν ἄειδε, θεὰ, Πηληϊάδεω Ἀχιλλῆος ->
1.μηνιν 2.αειδε 3.θεα 4.πηληιαδεω 5.αχιλληοσ -----Original Message----- From: Alexandre Rafalovitch [mailto:arafa...@gmail.com] Sent: Friday, November 21, 2014 3:08 PM To: dev@lucene.apache.org Subject: Re: Lucene ancient greek normalization Are you sure that's not something that's already addressed by the ICU Filter? http://www.solr-start.com/javadoc/solr-lucene/org/apache/lucene/analysis/icu/ICUTransformFilterFactory.html If you follow the links to what's possible, the page talks about Greek, though not ancient: http://userguide.icu-project.org/transforms/general#TOC-Greek There was also some discussion on: https://issues.apache.org/jira/browse/LUCENE-1343 Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On 21 November 2014 14:14, paolo anghileri <paolo.anghil...@codegeneration.it> wrote: > For development purposes I need the ability in lucene to normalize ancient > greek characters for al the cases of grammatical details such as accents, > diacritics and so on. > > My need is to retrieve ancient greek words with accents and other > grammatical details by the input of the string without accents. > > For example the input of οργανον (organon) should to retrieve also Ὄργανον, > > > I am not a lucene commiter and I a new to this so my question is about the > best practice to implement this in Lucene, and possibile submit a commit > proposal to Lucene A project management committee. > > I have made some searches and found this file in Lucene-soir: > > > It contains normalization for some chars. > My thought would be to add extra normalization here, including all unicode > ancient greek chars with all grammatical details. > I already have all the unicode values for that chars so It should not be > difficult for me to include them > > If my understanding is correct, this should add to lucene the features > described above. > > > As I am new to this, my needs are: > > To be sure that this is the correct place in Lucene for doing normalization > How to post commit proposal > > > Any help appreciated > > Kind regards > > Paolo --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org