For those that follow the rules just add a robots.txt to the root of
your web directory.

User-agent: *
Disallow: /

The above will tell all bots that you do not want to be crawled.
Google, Yahoo and etc. will respect your wish and stop. Unfriendly
bots will still continue to crawl your site.

http://chrislouden.com/robots.txt

You can also get more creative, http://thcnet.net/robots.txt


On 5/24/07, Roger Rustad <[email protected]> wrote:
What is the best way to not let google (or any other search engine)
crawl your Apache website?  I'm assuming that it's something
.htaccess-related?
_______________________________________________
909linux mailing list
[email protected]
http://909linux.org/cgi-bin/mailman/listinfo/909linux

Reply via email to