Re: XHR LC comments

Julian Reschke Wed, 14 May 2008 06:04:04 -0700


Anne van Kesteren wrote:

On Sun, 04 May 2008 11:47:13 +0200, Julian Reschke<[EMAIL PROTECTED]> wrote:
Review of <http://www.w3.org/TR/2008/WD-XMLHttpRequest-20080415/>.
General points:
a) I'm confused about the approach to this document. On the one hand,we're being told that it can't define anything not already in use (andthat new stuff belongs into XHR2), on the other hand it relies onHTML5, which is a moving target. It's good that this stuff is beingwritten down, but if it relies on HTML5, I'd propose to consider otherpublication options.
The problem is that concepts such "origin" and determining the encodingof a text/html stream are not defined anywhere else. It's not reallyclear to me what to do about that.

In some cases, it may be possible to copy the current definition. Inother cases, it may be possible just not to depend on it (for instance,by not specifying encoding sniffing).

b) Algorithms: the spec uses a method to describe algorithms that IMHOis extremely hard to read (see for instance send() method). This maybe good for implementors, but seems to be bad for everybody else.Minimally, the lists should be structured for better readability.
Could you elaborate on what kind of change you envision? I'm not surehow they are not structured right now.


An example would be steps 8..11 in the description of open():

- these steps deal with credentials, and the whole list would be morereadable if each group of steps that belong together would me structuredthat way;

- optimally, thing like this shouldn't be expressed as a set ofinstructions, but in a declarative way.

c) Structure: It would be nice if Section 4 had more structure. Rightnow it's ugly to navigate and refer to.
This is better in XMLHttpRequest Level 2. I rather not revise thatentire section editorially as it might introduce new errors.


But then, it makes a comparison with XHR2 harder. Please reconsider.

2.1 Dependencies

"DOM
A conforming user agent must support some subset of thefunctionality defined in DOM Events and DOM Core that thisspecification relies upon. [DOM2Events] [DOM3Core]"
That reads a bit strange. Must the subset be non-empty?
Yes, as stated it must be a subset that matches what XMLHttpRequestrequires from the eventing and core specifications.


Then it would be clearer if it said "the subset" instead of "some subset".

2.2 Terminology
"Two URIs are same-origin if after performing scheme-basednormalization on both URIs as described in section 5.3.3 of RFC 3987the scheme, ihost and port components are identical. If either URIdoes not have an ihost component the URIs must not be consideredsame-origin. [RFC3987]"
Why are we referring to the IRI spec (RFC3987) when talking aboutURIs, as defined RFC3986?
For scheme-bases normalization and ihost. Maybe I should use IRI insteadof URI?

Well, if we're talking about URIs (and I think we do), then we need torefer to RFC3986 grammar and comparison rules.

Besides that: this may be a non-optimal example unless we can point toa definition of "HttpOnly cookies". Can we?
I don't believe we can, but since this was put in mostly for HttpOnlycookies I rather not remove that. I think it will be clear enough forpeople reading the document.

So why don't we refer to the specification for httpOnly? Do you considerit a problem that it's a Microsoft document?

- TRACK??? There's probably a rational for that. If there is, pleaseinclude it in the spec.
It's a security issue, as should be clear from the next bullet point.

As TRACK doesn't seem to be documented anywhere, and not implemented incurrent IIS versions anymore, I'd really like to see this made a footnode. The way it's written now is just totally confusing to every readerwho doesn't know the full story around it.

"If the user argument was not omitted and is not null let stored userbe user encoded using the encoding specified in the relevantauthentication scheme or UTF-8 if the scheme fails to specify anencoding."
Why is XHR talking about the encoding here? Is "stored user" a stringor a byte array?
(same for password)
They're a string (in the API).

When they are a string, then taking about character encoding doesn'tmake any sense here.

"If the value argument is null terminate these steps. (Do not raise anexception.)."
This makes it impossible to set empty headers, which are allowed inHTTP. Even worse, it silently fails.
Empty headers can be set using the empty string, no? Not raising anexception is consistent with implementations and I don't think itmatters much as it doesn't have any effect.


Sorry, was reading one thing, but thinking about something else.

Thinking of it, could you please add a clarification that setting to anempty string is legal, and MUST NOT be ignored? I recall thatMicrosoft's original XHR (ActiveX) implementation got that wrong, notsetting the header at all.

"For security reasons, these steps should be terminated if the headerargument case-insensitively matches one of the following headers:
     * Accept-Charset
     * Accept-Encoding
     * Connection
     * Content-Length
     * Content-Transfer-Encoding
     * Date
     * Expect
     * Host
     * Keep-Alive
     * Referer
     * TE
     * Trailer
     * Transfer-Encoding
     * Upgrade
     * Via "
It's unclear why there's a security reason not to allow things like"Accept-Charset" or "Accept-Encoding". Please explain.
This was done based on implementation feedback. I haven't investigatedwhat the reasons were for the various headers. If implementors read thismaybe they could chime in and point it out.

Please. And if they don't, please remove all headers for which nobodycan explain why they are in this list.

General comment on "setRequestHeader(header, value), method": the wayit is specified makes it impossible for a client to reliably setheaders. We need a way to either retrieve the current value forinspection, or a way to reset the header. Or both.
http://lists.w3.org/Archives/Public/public-webapi/2008May/0139.html


Yes, we continue to disagree on this.

"If stored method is GET act as if the data argument is null."

Another case of HTTP/1.1 being profiled. Don't do it.


This was done on request of implementations.

That's IMHO not sufficient reason to do it. Please add a convincingrational, or leave this to the HTTP WG.

"Serialize data into a namespace well-formed XML document and encodedusing the encoding given by data.inputEncoding, when not null, orUTF-8 otherwise. Or, if this fails because the Document cannot beserialized act as if data is null."
Silent failure????
Yes.


Very bad.

Does anybody rely on that? I would be very suprised.

"If no Content-Type header has been set using setRequestHeader()append a Content-Type header to the list of request headers with avalue of application/xml;charset=charset where charset is theencoding used to encode the document."
This will result in an invalid Content-Type header if the UA hasinitialized the headers with a default (which I think the speccurrently allows; and at least one UA was reported to do). See commentabove about header handling.
Rephrased.


Pointer?

"If the user agent supports HTTP State Management it should persist,discard and send cookies (as received in the Set-Cookie andSet-Cookie2 response headers, and sent in the Cookie header) asapplicable. [RFC2965]"
This should probably include a reference to the Set-Cookie (notSet-Cookie2) spec as well (RFC2109).
I believe it used to do that and it was pointed out that thatspecification is not useful in practice and would actually do more harmthan good. I'm not really sure what to do here.

Well, the one that is not used in practice is RFC2965, not RFC2109. Thatbeing said, you probably need to reference both.

"// The following script:
var client = new XMLHttpRequest();
client.open("GET", "test.txt", true);
client.send();
client.onreadystatechange = function() {
  if(this.readyState == 3) {
   print(this.getAllResponseHeaders());
  }
}

// ...should output something similar to the following text:
Date: Sun, 24 Oct 2004 04:58:38 GMT
Server: Apache/1.3.31 (Unix)
Keep-Alive: timeout=15, max=99
Connection: Keep-Alive
Transfer-Encoding: chunked
Content-Type: text/plain; charset=utf-8"

I think examples like this would be more readable (and take lessspace) when using the syncr. mode.


I would like to avoid encouraging authors to use the synchronous API.


Disagreed. I think readability and compactness is more important here.

"status of type unsigned short, readonly
On getting, if available, it must return the HTTP status code sent bythe server (typically 200 for a successful request). Otherwise, if notavailable, the user agent must raise an INVALID_STATE_ERR exception."
This may be incorrect when the UA caches (304 vs 200).
That's why it says typically.


Hm, no.

When the UA caches, and the server sent 304, the client will potentiallysee a 200. This would contradict what *this* paragraph says.

"statusText of type DOMString, readonly
On getting, if available, it must return the HTTP status textsent by the server (appears after the status code). Otherwise, if notavailable, the user agent must raise an INVALID_STATE_ERR exception."
Really? It seems to me that if somebody really implements this,clients are likely to break. Why not allow an empty string here?
This is what clients have implemented as far as I can tell. Though theHTTP status text could be the empty string, if that's what you mean...

Does the "if not available" apply to any of the existingimplementations? Why would it be "not available"? Please clarify.

Finally, my main other issue with this spec that it is silent aboutthe recommended behaviour for unsafe methods, about which RFC2616 saysin Section 9.1.1(<http://greenbytes.de/tech/webdav/rfc2616.html#rfc.section.9.1.1>):
"Implementors should be aware that the software represents the user intheir interactions over the Internet, and should be careful to allowthe user to be aware of any actions they might take which may have anunexpected significance to themselves or others.
In particular, the convention has been established that the GET andHEAD methods SHOULD NOT have the significance of taking an actionother than retrieval. These methods ought to be considered "safe".This allows user agents to represent other methods, such as POST, PUTand DELETE, in a special way, so that the user is made aware of thefact that a possibly unsafe action is being requested."
Thus, allowing a web page to submit a PUT, POST or DELETE requestwithout user interaction seems to be a very dangerous thing to me, andthe spec should point that out (see also<http://ietf.osafoundation.org:8080/bugzilla/show_bug.cgi?id=237>).
All requirements from HTTP are taken over unless explicitly stated so Idon't think this is needed.


Well, the spec repeats lots of things specified somewhere else already.

The warning from the HTTP spec is relevant and should appear here, asXHR is related to UAs, and existing UAs are known to ignore thissecurity consideration.


BR, Julian

Re: XHR LC comments

Reply via email to