Hello Monkey team,

after my installation yesterday I saw today that a web crawler took a look
on my server:

66.249.72.234 -[05/Oct/2011 02:54:59 +0200] GET / HTTP/1.1 200 1364

Oh - seems I forgot robots.txt :-)


My idea was to avoid web crawlers from browsing by general - this could be
implemented with a single file called "robots.txt" in the standard htdocs
folder (/var/www/htdocs):

User-agent: *
Disallow: /


Greetings from germany,
Christian Stankowic <[email protected]>
http://www.stankowic-development.net

_______________________________________________
Monkey mailing list
[email protected]
http://lists.monkey-project.com/listinfo/monkey

Reply via email to