On 02/18/2010 04:07 AM, Rob / EViLSLuT wrote:

> Is the syntax of the robots.txt correct? Could be wrong.

Well, technically, it should say "Googlebot" instead of just
"Google".  But this is such a common mistake that Googlebot
answers to the name Google, and no harm is done.
 
> To my knowledge this is what google likes,
> 
> User-agent: *
> Disallow: /
> 
> User-agent: Googlebot
> Allow: /

That's not the recommended form.  According to
  http://www.robotstxt.org/robotstxt.html

there is no "Allow:" directive.  Certainly there is no advantage 
to saying "Allow: /" ... and no disadvantage to using the canonical
form "Disallow: " which disallows nothing.

There are situations where an "Allow:" directive would be helpful,
but this is not one of them.

Also, due to differences in opinion as to the interpretation of 
the robots.txt non-standard, it is a bit unpredictable whether 
bots will respond to the "first match" or "best match" ... so 
it is good practice to put more-specific directives ahead of 
less-specific ones.  In particular, the "*" wildcard should be
last, as it is currently on the site.

In any case, the larger point remains:  There are plenty of
perfectly reasonable, desirable bots that are being excluded by 
the current file.  Conversely there are plenty of truly horrible 
bots that will never be excluded by any robots.txt file.


------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs 
proactively, and fine-tune applications for parallel performance. 
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Flightgear-devel mailing list
Flightgear-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/flightgear-devel

Reply via email to