Michael Cafarella wrote:
PS - Does anyone know (Doug?) whether we are crawling the entire
OSU site?  Does Google have a coverage advantage?

We crawled oregonstate.edu 10 levels deep and got around 300k unique pages. OSU's Google Appliance is limited to 300k pages, but crawls both oregonstate.edu and (the deprecated domain) orst.edu. The latter gave it an advantage on several queries.


Doug



-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to