Re: Support LIKE with nondeterministic collations

Heikki Linnakangas Mon, 11 Nov 2024 05:25:47 -0800

On 04/11/2024 10:26, Peter Eisentraut wrote:

On 29.10.24 18:15, Jacob Champion wrote:

libfuzzer is unhappy about the following code in MatchText:

+            while (p1len > 0)
+            {
+                if (*p1 == '\\')
+                {
+                    found_escape = true;
+                    NextByte(p1, p1len);
+                }
+                else if (*p1 == '_' || *p1 == '%')
+                    break;
+                NextByte(p1, p1len);
+            }


If the pattern ends with a backslash, we'll call NextByte() twice,
p1len will wrap around to INT_MAX, and we'll walk off the end of the
buffer. (I fixed it locally by duplicating the ERROR case that's
directly above this.)


Thanks.  Here is an updated patch with that fixed.

Sadly the algorithm is O(n^2) with non-deterministic collations.Is thereany way this could be optimized? We make no claims on how expensive anyfunctions or operators are, so I suppose a slow implementation isnevertheless better than throwing an error.

Let's at least add some CHECK_FOR_INTERRUPTS(). For example, this takesa very long time and is uninterruptible:


 SELECT repeat('x', 100000) LIKE '%xxxy%' COLLATE ignore_accents;

--
Heikki Linnakangas
Neon (https://neon.tech)

Re: Support LIKE with nondeterministic collations

Reply via email to