Re: xml:base in your Atom feed

James Holderness Fri, 31 Mar 2006 20:50:03 -0800


A. Pagaltzis wrote:

I agree that this would be good. The test suite seems quite
awkward so far. I just can’t think of any ways to automate tests
to the extent you propose. Have you got any suggestions for a
format to use?

I know not everyone is going to agree with me on this, but my firstsuggestion would be to cut down on the number of tests. Mark Pilgrim hassomething like 3000+ tests in his UFP test suite. That's great; that's whataggregator developers should be doing, since those kind of tests can easilybe automated. But that's not the level of testing that we can put up on awiki and hope people will manually contribute results.

What we need is fewer tests with more coverage, and that involves tradeoffs.Take relative URIs as an example. I probably have 50 odd unit tests, but youdon't need all of those in a conformance test. You certainly don't need eachof those tested on each element that uses a URI. Pick one element that'seasy to test (say html content using img tags) and include a small subset ofthe trickier URIs.

You create "good" images at the destinations that would be reached if theparsing has been performed correctly and, where possible, "bad" images atthe destination that an aggregator might reach if they're doing somethingwrong (often this won't be possible, but in many cases aggregators getthings predictably wrong). From this you get fairly comprehensive coverageof the different forms of relative URI in a simple set of tests that can beevaluated at a glance.

Now to test what level of support there is for relative URIs in otherelements, you just pick one reasonably complicated (but not the worst) testcase and test each element just once. In many cases this will involvefollowing a link to see whether it's going to the right place, but you canmake this more pleasant by including a "good" page at the expecteddestination and, as with the images, a couple of "bad" pages in all theplaces you think an aggregator might go if it's getting it wrong.

The tradeoff here is that you're assuming an aggregator that can handle allthe variations you've thrown at it in html content would also be able handlethose variations in other elements as long as it can pass the single testthat you've tested everywhere else. That may not always be the case, butyour other choice is an exponentially larger test suite that just isn'tmanageable.

For most tests it's actually not that complicated though. If you're testingsomething that can be seen somewhere in the UI, like say a title, date orauthor, simple include exactly what the value should be right there in themessage content for that entry. When you're doing a set of tests along thesame lines (like say titles encoded in different ways) try and make sureevery test produces the same results so you can see at a glance if they'reall correct.

When testing things like the different forms of escaping which can have ahuge number of permutations, it's often easier to test several differentcombinations at the same time in one test (similar to the html acid tests).When you fail a test you can't easily tell the exact cause of the failure,but it does cut down the number of tests that have to be checked.

And on the subject of huge sets of tests, it's obviously a lot easier totest a single feed with 20 tests than it is to have to subscribe to 20different feeds to run each test separately. However, it's sometimes a goodidea to also have each test separately in case you have an aggregator thatflat out refuses to subscribe to a feed because of one test, but wouldotherwise be perfectly capable of passing all the others. I find it easierto do this with a PHP script and some mod_rewriteing to make the URLslooking prettier.

Also, when you need a lot of separate feed tests (either for the reasonsmentioned above, or for tests that can't be grouped) it can be helpful ifyou include an opml file that can import all the tests in one go rather thanhaving to add them individually.


Some other suggestions...

Make sure your ids are always unique (I know this goes without saying, butit's easy to overlook when you're throwing together a test). Someaggregators will refuse to display entries with ids they've already seenbefore even if the feed from which those ids came has long since beendeleted. It's also advisable to include an alternate link on all elementsunless that would interfere with your test since I think there are someaggregators that fail to subscribe to feeds without element links

Avoid using xhtml content or weird escaping techniques unless you'reactually testing those features. I should probably double check this, but Ithink the safest content type is html with named entity escaping (lt, amp) -no numeric entities or cdata escaping. It's also advisable to keep yourdates very recent (this is easy if tests are generated via php and you canmake them up on the fly) since some aggregators won't display entries thatare older than a certain date.


That's about all I can think of for the moment.

Regards
James

Re: xml:base in your Atom feed

Reply via email to