Replace pg_mblen() with bounds-checked versions. A corrupted string could cause code that iterates with pg_mblen() to overrun its buffer. Fix, by converting all callers to one of the following:
1. Callers with a null-terminated string now use pg_mblen_cstr(), which raises an "illegal byte sequence" error if it finds a terminator in the middle of the sequence. 2. Callers with a length or end pointer now use either pg_mblen_with_len() or pg_mblen_range(), for the same effect, depending on which of the two seems more convenient at each site. 3. A small number of cases pre-validate a string, and can use pg_mblen_unbounded(). The traditional pg_mblen() function and COPYCHAR macro still exist for backward compatibility, but are no longer used by core code and are hereby deprecated. The same applies to the t_isXXX() functions. Security: CVE-2026-2006 Backpatch-through: 14 Co-authored-by: Thomas Munro <[email protected]> Co-authored-by: Noah Misch <[email protected]> Reviewed-by: Heikki Linnakangas <[email protected]> Reported-by: Paul Gerste (as part of zeroday.cloud) Reported-by: Moritz Sanft (as part of zeroday.cloud) Branch ------ REL_14_STABLE Details ------- https://git.postgresql.org/pg/commitdiff/cecedb912aeea391332bcb307d8e089266dbb2be Modified Files -------------- contrib/btree_gist/btree_utils_var.c | 21 +++-- contrib/dict_xsyn/dict_xsyn.c | 8 +- contrib/hstore/hstore_io.c | 8 +- contrib/ltree/lquery_op.c | 4 +- contrib/ltree/ltree.h | 3 +- contrib/ltree/ltree_io.c | 16 ++-- contrib/ltree/ltxtquery_io.c | 4 +- contrib/pageinspect/heapfuncs.c | 2 +- contrib/pg_trgm/trgm.h | 4 +- contrib/pg_trgm/trgm_op.c | 48 ++++++---- contrib/pg_trgm/trgm_regexp.c | 21 +++-- contrib/unaccent/unaccent.c | 7 +- src/backend/catalog/pg_proc.c | 2 +- src/backend/tsearch/dict_synonym.c | 8 +- src/backend/tsearch/dict_thesaurus.c | 18 ++-- src/backend/tsearch/regis.c | 37 ++++---- src/backend/tsearch/spell.c | 123 ++++++++++++------------- src/backend/tsearch/ts_locale.c | 97 ++++++++------------ src/backend/tsearch/ts_utils.c | 4 +- src/backend/tsearch/wparser_def.c | 3 +- src/backend/utils/adt/encode.c | 10 +-- src/backend/utils/adt/formatting.c | 22 ++--- src/backend/utils/adt/jsonfuncs.c | 2 +- src/backend/utils/adt/jsonpath_gram.y | 2 +- src/backend/utils/adt/levenshtein.c | 14 +-- src/backend/utils/adt/like.c | 18 ++-- src/backend/utils/adt/like_match.c | 3 +- src/backend/utils/adt/oracle_compat.c | 33 ++++--- src/backend/utils/adt/regexp.c | 6 +- src/backend/utils/adt/tsquery.c | 25 +++--- src/backend/utils/adt/tsvector.c | 11 +-- src/backend/utils/adt/tsvector_op.c | 10 ++- src/backend/utils/adt/tsvector_parser.c | 29 +++--- src/backend/utils/adt/varbit.c | 8 +- src/backend/utils/adt/varlena.c | 41 +++++---- src/backend/utils/adt/xml.c | 11 ++- src/backend/utils/mb/mbutils.c | 150 +++++++++++++++++++++++++++++-- src/include/mb/pg_wchar.h | 7 ++ src/include/tsearch/ts_locale.h | 34 +++++-- src/include/tsearch/ts_utils.h | 14 ++- src/test/modules/test_regex/test_regex.c | 3 +- 41 files changed, 537 insertions(+), 354 deletions(-)
