[HACKERS] Dictionary chaining and stop words

2007-08-29 Thread Heikki Linnakangas
It's nice to be able to chain tsearch dictionaries, but I find that it's not as flexible as it should be. Currently we have these dictionaries built-in: dict_simple - lowercases and checks against stop word list, accepts everything not in stop word list dict_synonym - replaces with synonym, if

Re: [HACKERS] Dictionary chaining and stop words

2007-08-29 Thread Tom Lane
Heikki Linnakangas [EMAIL PROTECTED] writes: There's clearly need for transforming a word and passing on the transformed version to the next dictionary. dict_thesaurus does exactly that by supporting a subdictionary which is called before invoking the thesaurus, but it should be generic

Re: [HACKERS] Dictionary chaining and stop words

2007-08-29 Thread Oleg Bartunov
Heikki, we know about this ( I call it filtering), but we leave it for the future after we'll have everything in core. The more demonstrative example is well-known accent-removal problem. I used to recommend to preprocess string before tsearch2, but there is a problem with headline() when this