Re: ZIP-based packages and URI references into them ODF proposal

Julian Reschke Mon, 29 Dec 2008 04:18:41 -0800


Ian Hickson wrote:

The way that IE and Firefox handle bytes with values greater than 0x7Fwhen a file is labelled as being encoded as ASCII differs -- IE ignoresthe 8th bit, and only looks at the first seven bits, whereas Firefoxtreats bytes in the range 0x80 to 0xFF as being encoded as Windows-1252.This leads to security bugs, wherein the two browsers might treat the twostrings differently (in particular, what looks like <script></script> toIE might look like something quite different to Firefox).
I believe the ASCII specification should have defined how to convert anyrandom byte stream into characters, including bytes that aren't in therange 0-127. That it didn't means that every language that allows ASCIIhas to define how to handle it, which is an abstraction violation, andresults in different specs having different rules. In many cases, thelayers above ASCII didn't define this, and we've ended up with very realsecurity problems, such as the example above.
Now in the case of ASCII doing this would be trivial -- e.g. just say thatall bytes that aren't in the range 0x00 - 0x7F must be treated as 0x3F,and say that producers must not use bytes that aren't in the table. Butyes, it should be in the ASCII spec.

Your assumption seems to be that there's a single "good" way to definethis error handling. I disagree with that.

For instance, for XML, sending non-ASCII characters when the declaredencoding is US-ASCII is a fatal error, and I definitively want to stayit that way.


BR, Julian

Re: ZIP-based packages and URI references into them ODF proposal

Reply via email to