On Fri, Feb 17, 2012 at 3:48 AM, Heikki Linnakangas <heikki.linnakan...@enterprisedb.com> wrote: > Here's a wild idea: keep the class of each codepoint in a hash table. > Initialize it with all codepoints up to 0xFFFF. After that, whenever a > string contains a character that's not in the hash table yet, query the > class of that character, and add it to the hash table. Then recompile the > whole regex and restart the matching engine. > > Recompiling is expensive, but if you cache the results for the session, it > would probably be acceptable.
What if you did this ONCE and wrote the results to a file someplace? -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company -- Sent via pgsql-hackers mailing list (email@example.com) To make changes to your subscription: http://www.postgresql.org/mailpref/pgsql-hackers