Re: UTF-8(4) versus UTF-8(6) security issues

srintuar Sat, 17 Jan 2004 17:19:39 -0800

Your example does not work, because


Well, I semi-agree. It is possible to write a parser that
has no problems. On the other hand, in the real world one
meets many parsers that were not well-written, so the
security risk exists in the real world.


Any such parser would be unsafe and non-compliant regardless
of 4 or 6 byte utf-8 considerations, and thus that point is
thoroughly moot.

If you encounter any such parser you must consider it to be
severely flawed. (After an initial validating pass it is safe
to use degenerate parsers which make assumptions about structure,
but not before.)


--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Re: UTF-8(4) versus UTF-8(6) security issues

Reply via email to