[HACKERS] Generalized edit function?

2011-02-26 Thread fork
Hi hackers, I am interested in extending Postgres with a generalized edit function like SAS's compged[1], which is basically levenshtein distance with transposes (ab - ba) and LOTS of different weights for certain ops (like insert a blank versus delete from the end versus insert a regular

Re: [HACKERS] Generalized edit function?

2011-02-26 Thread Josh Berkus
Fork, 1. Does anybody else care? I would love to see this in contrib, but if the chances are slim, then I would like to know that too. That really depends on how well it works, and how much code it is. It's way too early for anyone to have a viewpoint on this. For example, a few years ago

Re: [HACKERS] Generalized edit function?

2011-02-26 Thread Robert Haas
On Sat, Feb 26, 2011 at 4:19 PM, Josh Berkus j...@agliodbs.com wrote: Anyway, if it's ASCII-only, that's a guaranteed way to make sure it isn't taken seriously. Pre-9.1 levenshtein is ASCII-only, and I think some of the other stuff in contrib/fuzzystrmatch still is. We had to work pretty hard

Re: [HACKERS] Generalized edit function?

2011-02-26 Thread fork
Robert Haas robertmhaas at gmail.com writes: On Sat, Feb 26, 2011 at 4:19 PM, Josh Berkus josh at agliodbs.com wrote: Anyway, if it's ASCII-only, that's a guaranteed way to make sure it isn't taken seriously. Pre-9.1 levenshtein is ASCII-only, and I think some of the other stuff in

Re: [HACKERS] Generalized edit function?

2011-02-26 Thread Robert Haas
On Sat, Feb 26, 2011 at 7:40 PM, fork forkandw...@gmail.com wrote: Pre-9.1 levenshtein is ASCII-only, and I think some of the other stuff in contrib/fuzzystrmatch still is. I am only looking at 9.0.3 for levenshtein, so I don't have any thoughts yet on multi-byteness so far.   I will have