Hi all,

I have a lot of documents and PDF's in romanian language and tried a way to index them. I have tried a lot of desktop search tools : beagle , strigi , recoll , pinot , tracker. The final choise is : RECOLL , because it has some features the are suitable for indexing and searching romanian documents, THE SEARCH IS IMMUNE TO ACCENTED CHARACTERS and IT HAS ROMANIAN LANGUAGE STEMMING.

The second in preferences is TRACKER , and it would be probably the first choice IF it can do the same things.

So my 2 question are:

1. is there a way of configuring tracker to index/search romanian words regardless of their accented variants ? ( Example, not sure that specials chars it will pass the mailer , ţeavă or teava , şuncă or sunca , învăţat or invatat )


2. what is needed in order to add the romanian language for stemming ? http://snowball.tartarus.org/algorithms/romanian/stemmer.html <http://snowball.tartarus.org/algorithms/romanian/stemmer.html> helps somehow ?

Best regards,
Constantin Teodorescu
Romania



begin:vcard
fn:Constantin Teodorescu
n:Teodorescu;Constantin
email;internet:[EMAIL PROTECTED]
tel;home:0788-256.456
tel;cell:0722-673.571
x-mozilla-html:TRUE
version:2.1
end:vcard

_______________________________________________
tracker-list mailing list
[email protected]
http://mail.gnome.org/mailman/listinfo/tracker-list

Reply via email to