--On Monday, January 12, 2004 12:37:57 AM -0500 Sean 'Captain Napalm' Conner <[EMAIL PROTECTED]> wrote:

It was thus said that the Great Walter Underwood once stated:

        The record starts with one or more User-agent lines, followed by
      one or more Disallow lines, as detailed below. Unrecognised headers
      are ignored.

Right there---last line---"Unrecognised headers are ignored."

Good, the the format can be upgraded, within limits.

[skip to robustness principle discussion]

  By the same token, it is the robots.txt parser that "accepts" the
robots.txt file, so by the robustness principle, you need to ignore
directives you don't understand.

Right, but the robustness principle means that sender cannot rely on the receiver doing that. If each end makes allowance for bugs in the other end, the the protocol is much more likely to work in the real world.

  -spc (And be thankful Martijn didn't decide to use RFC-822 style
        header lines ... )

It wasn't just Martijn, it was a group of people, all actively involved in robots and web sites. I'm sure that someone was aware that they did not need to fit robots.txt lines on an 80-column Hollerith card.

Walter Underwood
Principal Architect
Verity Ultraseek

Robots mailing list

Reply via email to