Nicolas,

My main point: Kunsheng wants people to do work for his project, but at 
least as of this morning, there is nothing to tell us what he's going to 
do with the collected information.

If I'm going to offer up bandwidth, I need to see some benefit.  Based 
on his web pages, we can only guess.  He could be harvesting E-Mail 
addresses for all I know.

(I don't think he's doing anything evil, but there just isn't much 
information at all).

To respond to your statement:

I can't block him using robots.txt if he identifies as some other, 
useful spider, and I can't block him until after he visits if I don't 
know in advance how his spider identifies itself.

For that matter, spiders have to retrieve robots.txt, parse it, and 
honor it -- that's not blocking, that's courtesy.

I'm also not saying that I *should* block him by IP.  I'm stating that 
there aren't a whole lot of ways to block.

... and I'm saying that absent some other indication, he doesn't look 
all that different from multiple copies of something like EmailSiphon.

Kunsheng took my criticisms as intended: constructively.  I pointed out 
what his project looks like, and how that could be interpreted.

As far as I can tell, your comments are about how I should run my 
business, and not about BOINC at all.

Nicolás Alvarez wrote:
> El Sáb 12 Sep 2009 18:31:14 Lynn W. Taylor escribió:
>> You're doing it in a distributed fashion, so I can't block your project
>> by IP -- requests will come from anywhere.
> 
> So what? You shouldn't block by IP, that's what robots.txt is for. You can't 
> block Google or Yahoo by IP either; they have hundreds of IPs.
> 
_______________________________________________
boinc_dev mailing list
[email protected]
http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev
To unsubscribe, visit the above URL and
(near bottom of page) enter your email address.

Reply via email to