Make escaping functions retain trailing bytes of an invalid character. Instead of dropping the trailing byte(s) of an invalid or incomplete multibyte character, replace only the first byte with a known-invalid sequence, and process the rest normally. This seems less likely to confuse incautious callers than the behavior adopted in 5dc1e42b4.
While we're at it, adjust PQescapeStringInternal to produce at most one bleat about invalid multibyte characters per string. This matches the behavior of PQescapeInternal, and avoids the risk of producing tons of repetitive junk if a long string is simply given in the wrong encoding. This is a followup to the fixes for CVE-2025-1094, and should be included if cherry-picking those fixes. Author: Andres Freund <and...@anarazel.de> Co-authored-by: Tom Lane <t...@sss.pgh.pa.us> Reported-by: Jeff Davis <pg...@j-davis.com> Discussion: https://postgr.es/m/20250215012712...@rfd.leadboat.com Backpatch-through: 13 Branch ------ REL_17_STABLE Details ------- https://git.postgresql.org/pg/commitdiff/3abe6e04cc69d1076a695d90e179dd64010a2667 Modified Files -------------- src/fe_utils/string_utils.c | 91 ++++++++++++++++-------------------------- src/interfaces/libpq/fe-exec.c | 69 ++++++++++++++------------------ 2 files changed, 65 insertions(+), 95 deletions(-)