Re: [html4all] the alt attribute debate

Henri Sivonen Tue, 25 Sep 2007 23:32:49 -0700

Hi,


On Sep 25, 2007, at 18:35, Steven Faulkner wrote:

>Those who add a bogus alt for validation are a subset of people who
>include a bogus alt.

and what size is this subset (who knows)

Presumably the population whose behavior is swayed by what is deemedvalid (i.e. syntactically correct) is significant enough for thehtml4all group to be concerned about what validity says about the altattribute.

why not develop a validator that looks for and fails the page if ithas bogus alt?

Because it only leads to an arms race-like escalation that makes morejunk to be served to users.

If I make the validator check that each image has an alt text that islonger than the empty string, those generators that do not have analternative text available, will be programmed to emit a bogus stringthat is at least one character long.

If I make the validator check that each image has an alt text that islonger than on character, those generators that do not have analternative text available, will be programmed to emit a bogus stringthat is at least two characters long.

And so on. (The point stays the same if you substitute anotherheuristic for the length test.)

This is not only an issue with alt. There's non-alt precedent to thiskind of behavior. When HTML 4 said that paragraphs must not be empty,people who saw value in emitting qualitatively empty paragraphsstarted putting a no-break space in paragraphs that werequalitatively empty. To address this, Hixie went ahead and defined aconcept of Significant Inline content to make a single no-break spaceinvalid in HTML5. Will that make people no longer see value in emptyparagraphs. My bet is on "No". They'll just generate something thatfools the new test. I've suggested Hixie that instead of trying tooutsmart people who want to do "bad" empty paragraph we stop theescalation instead.

When HTML 4.01 Strict banned target='', it didn't make people not towant to open links in new windows any longer. Instead, it begat this:

http://www.alistapart.com/articles/popuplinks

Moral of the story: You can't use the concept of validity to stoppeople from achieving the results they want. They just figure out away to do it in a less detectable way which means browsers have aharder time offering counter-measures to the users.

using heauristics, that shouldn't be so hard should it?

Heuristics work when the uncooperative data sources are indifferentto the heuristics (that is, they don't actively try to fool theheuristic). This, presumably, would be the case if alt-relatedreasonable heuristics were deployed in AT. The heuristic of readingthe URI is so bad a heuristic that you have effectively been arguingthat authors defeat that particular heuristic.

Heuristics don't work when the uncooperative data sources areactively hostile to the heuristic (i.e. try to fool it) *and* theyknow what the heuristic is so that they *can* fool it. This is whysearch engines keep their anti-SEO spam heuristics secret and complexenough to be resilient to black-box reverse engineering.

Precedent suggests that in the case of validators, people will seekto fool them if the concept of validity stands in the way of theresults they want and they have a requirement (for whatever reason)to be valid.

Now I'm going to shortly follow up to John Foliot's email and thenexcuse myself and write some validator software instead of talkingabout it.


--
Henri Sivonen
[EMAIL PROTECTED]
http://hsivonen.iki.fi/

Re: [html4all] the alt attribute debate

Reply via email to