Re: [whatwg] URL: spec review - basic_parser

Sam Ruby Mon, 13 Oct 2014 16:06:25 -0700

On 10/13/2014 10:05 AM, Anne van Kesteren wrote:


Not yet.  I'm still seeing a large set of differences between what I am
producing and what is in urltestdata.txt and need to track down whether the
problems are in my implementation, the spec, or in the test results.

Once those three are in sync; I'll try to look at the bigger picture.


Cool. Sounds great.


New test results:

http://intertwingly.net/stories/2014/10/13/urltest-results/

The fourth column ("Notes") indicates which properties differ betweenwhat my software produces and what the testdata indicates should be theexpected results. These fall into three basic categories:

1) rows where the notes merely say "href" are cases where parse errorsare thrown and failure is returned. The expected results are an objectthat returns the original href, but empty values for all otherproperties. I don't see this behavior in the spec:


https://url.spec.whatwg.org/#url-parsing

2) rows that contain "href hostname" appear to be ones where theexpected results do not appear to be updated to include the host to IDNAmapping.

3) rows that contain "href protocol hostname pathname" need furtherinvestigation. I suspect that these are based on my using a library tonormalize the IDNA mapping, and it "helpfully" cleans up other problemslike removing U+0000 characters from the input.


My implementation can be found here:

http://intertwingly.net/stories/2014/10/13/url_rb.html

Note the comments linking back to spec sections, and comments thatidentify step numbers.


- Sam Ruby

P.S. I didn't update to the latest test data yet; but from what I cansee the changes wouldn't materially affect the results, so I ampublishing now.

P.P.S. Preview of what is yet to come, ruby2js run against myimplementation produces:


http://intertwingly.net/stories/2014/10/13/url_js.html

This will need some additional work to get running, for example lines54, 65, 82, 85, and 267 call out to libraries that aren't available toJavaScript. Lines 275 to 277 are debugging lines that will be removedshortly.

Re: [whatwg] URL: spec review - basic_parser

Reply via email to