Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-09 Thread Teodor Sigaev
I'll place contrib module which will make all Snowball stemmers. Right now I'm working on supporting OpenOffice's dictionaries in tsearch2, so it will be simple to add it to packaging system. done, http://archives.postgresql.org/pgsql-committers/2006-06/msg00112.php -- Teodor Sigaev

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-08 Thread Teodor Sigaev
Maybe putting it on pgFoundry? Hmm, it's a variant. We can create project 'tsearch2_dict' and there I'll place contrib module which will make all Snowball stemmers. Right now I'm working on supporting OpenOffice's dictionaries in tsearch2, so it will be simple to add it to packaging system.

[HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Teodor Sigaev
We got a lot requests about including stemmers and ispell dictionaries for all accessible languages into tsearch2. I understand that tsearch2 will be closer to end user. But sources of snowball stemmers is about 800kb, each ispell dictionaries will takes about 0.5-2M. All sizes are sized with

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Markus Schiltknecht
Hello Teodor, I've just recently implemented an advanced full-text search function on top of tsearch2. Searching through the manuals and websites to get the snowball stemmer and compile my own module took me way to long. I'd rather go fetch a cup of coffee during a 30 minute download...

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Teodor Sigaev
800kb, each ispell dictionaries will takes about 0.5-2M. All sizes are Sorry, withOUT compression... -- Teodor Sigaev E-mail: [EMAIL PROTECTED] WWW: http://www.sigaev.ru/ ---(end of

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread John Jawed
OpenFTS ebuild: http://bugs.gentoo.org/show_bug.cgi?id=135859 It has a USE flag for the snowball stemmer. I can take care of packaging for Gentoo if it will free up time for you to work on other distros. John PS, upstream package size isn't, and shouldn't be an issue, it should be left to the

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Christopher Kings-Lynne
We got a lot requests about including stemmers and ispell dictionaries for all accessible languages into tsearch2. I understand that tsearch2 will be closer to end user. But sources of snowball stemmers is about 800kb, each ispell dictionaries will takes about 0.5-2M. All sizes are sized with

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Christopher Kings-Lynne
Perhaps we can put together the source code for all languages modules available and provide scripts to fetch ispell data or to generate the snowball stemmers. A debian package maintainer would have to fetch all the data to generate all language packages. Someone else might just want to

Re: [HACKERS] Snowball and ispell in tsearch2

2006-06-07 Thread Christopher Kings-Lynne
I'd be willing to help with such a project. I have experience with tsearch2 as well as with gentoo and debian packaging. I can't help with rpm, though. I could help with a FreeBSD package I suppose. Although I should probably finish up those damn GIN docs first :)