Re: [18] Policy on IMMUTABLE functions and Unicode updates

Peter Eisentraut Wed, 24 Jul 2024 11:11:02 -0700

On 24.07.24 14:20, Robert Haas wrote:

On Wed, Jul 24, 2024 at 12:42 AM Peter Eisentraut <pe...@eisentraut.org> wrote:

Fair enough.  My argument was, that topic is distinct from the topic of
this thread.


OK, that's fair. But I think the solutions are the same: we complain
all the time about glibc and ICU shipping collations and not
versioning them. We shouldn't make the same kinds of mistakes. Even if
ctype is less likely to break things than collations, it still can,
and we should move in the direction of letting people keep the v17
behavior for the foreseeable future while at the same time having a
way that they can also get the new behavior if they want it (and the
new behavior should be the default).

Versioning is possibly part of the answer, but I think it would bedifferent versioning from the collation version.

The collation versions are in principle designed to change rarely. Somelanguages' rules might change once in twenty years, some never. Maybeyou have a database mostly in English and a few tables in, I don't know,Swedish (unverified examples). Most of the time nothing happens duringupgrades, but one time in many years you need to reindex the Swedishtables, and the system starts warning you about that as soon as youaccess the Swedish tables. (Conversely, if you never actually accessthe Swedish tables, then you don't get warned about.)

If we wanted a similar versioning system for the Unicode updates, itwould be separate. We'd write the Unicode version that was current whenthe system catalogs were initialized into, say, a pg_database column.And then at run-time, when someone runs say the normalize() function orsome regular expression character classification, then we check what theversion of the current compiled-in Unicode tables are, and then we'dissue a warning when they are different.

A possible problem is that the Unicode version changes in practice withevery major PostgreSQL release, so this approach would end up warningusers after every upgrade. To avoid that, we'd probably need to keepsupport for multiple Unicode versions around, as has been suggested inthis thread already.

Re: [18] Policy on IMMUTABLE functions and Unicode updates

Reply via email to