Hello Robert,
you are so right, plurals based on prefixes and suffixes are working.
Plurals based on inserted و do not (باب and ابوب).
The few words i had tested where all of the insert type and not the
prefix/suffix.
thank you :)
-walid
On Sun, 2009-08-02 at 15:08 -0400, Robert Muir wrote:
Walid, thanks for your feedback.
fyi I created an issue with some minor improvements (such as lam-lam
prefix) to the arabic analyzer:
http://issues.apache.org/jira/browse/LUCENE-1758
I also tried to improve the stopwords list, but your Arabic is surely
much better than mine. If you
the fact is, plural (as an example) is not supported, and that is one of
the most common things that a person doing some search will expect to
Walid, I'm not sure this is true. Many plurals are supported
(certainly not exceptional cases or broken plurals).
This is no different than the other
://issues.apache.org/jira/browse/LUCENE-153
based on this, there are two options:
1- using the aramorph library
2- moving the code from trunk to the current release and using the
provided arabic analyzer
1- the library works very well in indexing, tokenizing, stemming and
everything
library
2- moving the code from trunk to the current release and using the
provided arabic analyzer
1- the library works very well in indexing, tokenizing, stemming and
everything, but causes memory leaks
2- the provided library has a very poor result compared to the aramorph
library
http://issues.apache.org/jira/browse/LUCENE-1406
http://issues.apache.org/jira/browse/LUCENE-153
based on this, there are two options:
1- using the aramorph library
2- moving the code from trunk to the current release and using the
provided arabic analyzer
1- the library works very well
://issues.apache.org/jira/browse/LUCENE-1406
http://issues.apache.org/jira/browse/LUCENE-153
based on this, there are two options:
1- using the aramorph library
2- moving the code from trunk to the current release and using the
provided arabic analyzer
1- the library works very well in indexing
Hi All,
is anybody have a persian (farsi) analyzer for lucene search engine. if
arabic is available, i can customize it to farsi/persian.
Regard,
---
Amir Hosein Jadidi Nejad
Student Of Computer Science
Bu-Ali Sina
Hi,
Amir Hosein Jadidi Nejad wrote:
if arabic is available, i can customize it to farsi/persian.
There is indeed an arabic analyzer :
http://savannah.nongnu.org/projects/aramorph
... but, given the difference of structure between the 2 languages, I
doubt it can be very helpful