Re: [PATCH v2] middle-end/104854: Limit strncmp overread warnings

Siddhesh Poyarekar Tue, 15 Mar 2022 18:25:12 -0700

On 16/03/2022 02:06, Martin Sebor wrote:

The intended use of the strncmp bound is to limit the comparison to
at most the size of the arrays or (in a subset of cases) the length
of an initial substring. Providing an arbitrary bound that's not
related to the sizes as you describe sounds very much like a misuse.

Nothing in the standard says that the bound is related to the sizes ofinput buffers. I don't think deducing that intent makes sense either,nor concluding that any other use case is misuse.

As a historical note, strncmp was first introduced in UNIX v7 where
its purpose, alongside strncpy, was to manipulate (potentially)
unterminated character arrays like file names stored in fixed size
arrays (typically 14 bytes).  Strncpy would fill the buffers with
ASCII data up to their size and pad the rest with nuls only if there
was room.

Strncmp was then used to compare these potentially unterminated
character arrays (e.g., archive headers in ld and ranlib).  The bound
was the size of the fixed size array.  Its other use case was to compare
leading portions of strings (e.g, when looking for an environment
variable or when stripping "./" from path names).


Thanks for sharing the historical perspective.

Since the early UNIX days, both strncpy and to a lesser extent strncmp
have been widely misused and, along with many other functions in
<string.h>, a frequent source of bugs due to common misunderstanding
of their intended purpose.  The aim of these warnings is to detect
the common (and sometimes less common) misuses and bugs.

They're all valid uses however since they do not violate the standard.If we find at compile time that the strings don't terminate at thebounds, emitting the warning is OK but the more pessimistic check seemslike overkill.

I haven't seen these so I can't very well comment on them.  But I can
assure you that warning for the code above is intentional.  Whether
or not the arrays are nul-terminated, the expected way to call
the function is with a bound no greater than their size (some coding
guidelines are explicit about this; see for example the CERT C Secure
Coding standard rule ARR38-C).

(Granted, the manual makes it sound like -Wstringop-overread only
detects provable past-the-end reads.  That's a mistake in
the documentation that should be fixed.  The warning was never quite
so limited, nor was it intended to be.)

The contention is not that it's not provable, it's more that it'sdoesn't even pass the "based on available information this is definitelybuggy" assertion, making it more a strong suggestion than a warning thatsomething is definitely amiss. Which is why IMO it is more suitable asan analyzer check than a warning.


Thanks,
Siddhesh

Re: [PATCH v2] middle-end/104854: Limit strncmp overread warnings

Reply via email to