Re: PaceIRI status

Martin Duerst Tue, 25 Jan 2005 07:26:23 -0800

At 23:59 05/01/25, Julian Reschke wrote: >Martin Duerst wrote: >> At 22:29 05/01/25, Julian Reschke wrote: >> >The big difference here is that XMLNS uses IRIs/URIs as identifiers and only for that. However, what is an XSLT that transforms Atom content to HTML supposed to do when it encounters a IRI which isn't a legal URI? For instance, it can't put it into an HTML href attribute without producing invalid HTML. >> First, the Atom spec just says that HTML or XHTML goes into certain >> elements, it leaves it to other specs to say what HTML or XHTML is. >> So Atom doesn't deal with the question of whether IRIs are allowed >> there or not, and I hope PaceIRI is worded correctly in that respect. > > > > Second, HTML href attributes are defined as CDATA, so in terms of > > validity, any Unicode character goes anyway. > ><http://www.w3.org/TR/html401/struct/links.html#adef-href> > >As far as I can tell, HTML 4.01 normatively refers to RFC2396, which doesn't allow non-ASCII characters. The *DTD* may allow any kind of string here, but the spec text doesn't.

I know. You were using 'invalid', not 'illegal'.
The former is a technical term that is very strongly related to DTDs.

>> Third, the HTML 4 recommendation, dating back to 1997
>> (http://www.w3.org/TR/REC-html40-971218/appendix/notes.html#h-B.2)
>> contains language that in today's terms amounts to saying
>> "browsers should treat URIs with non-ASCII characters as IRIs,
>> even though strictly speaking, they're illegal". Many browsers
>> to some extent already do, and when the IRI RFC gets out, I guess
>> some more will.
>
>So are there plans to add an erratum to HTML 4.01?

If there was a working errata process for HTML 4.01, I guess
such an erratum would already have happened.

>And what's the situation for XHTML?

For XHTML 1.0/1.1, that's based on HTML 4..., so see above.
For XHTML 2.0, that will just refer to the IRI RFC, I guess.

Regards,    Martin.

Re: PaceIRI status

Reply via email to