Didn't I block this with http://jidanni.org/robots.txt ?:
18.104.22.168 - - [01/Jan/2008:02:13:46 -0800] "GET
/geo/antipodes/images/tai_par_arg.png HTTP/1.1" 200 4773
"http://image.soso.com" "Mozilla/4.0 (compatible; MSIE 6.0)"
Is there some connection here with Nutch that I'm not seeing?
PS - From our experience, there are a number of China-based bots that
don't obey robots.txt. We wind up having to block them via their IP
"If you can't find it, you can't fix it"