Re: Persian Implementation

Patrick Estarian Mon, 18 Jul 2011 20:38:50 -0700

Robert,

Thank you very much for the reply.


If I understand it correctly, you have a main project and a contrib section.
It is very important for us to have this Persian search work "correctly"
rather than efficient. Would this be a good candidate for the contrib
section if I wrote some codes for the Persian stemmer?

Thanks,
-Patrick



On Mon, Jul 18, 2011 at 6:49 PM, Robert Muir <rcm...@gmail.com> wrote:

> On Mon, Jul 18, 2011 at 6:24 PM, Patrick Estarian
> <patrick.estar...@gmail.com> wrote:
> > Hi,
> >
> > I am trying to get the Persian part of Lucene to work but apparently the
> > current implementation is just a simple version of sopt word tokenizer
> and
> > no stemmer, etc. I was trying to find the contact of the person who had
> done
> > this but couldn't find it any where in the code.
> >
>
> There is no stemmer intentionally, as my findings (and others) seem to
> correspond with this statement:
>
> Our various experiments clearly show that a stemming
> procedure decreases retrieval effectiveness when applied
> to the Persian language.
>
> http://portal.acm.org/citation.cfm?id=1674748
>
> But YMMV,
>
> --
> lucidimagination.com
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: dev-h...@lucene.apache.org
>
>

Re: Persian Implementation

Reply via email to