-1

        I found the instructions for doing an "Intranet crawl" extremely
helpful for getting up and running quickly.  I went back later and
figured out more about what it was actually doing.  Perhaps the name
could just be changed to "Single Site Crawling with the Nutch Shell
Script" and some explanatory text could be added.

        I'll try to take the time today to put a version of the tutorial
on the wiki that does that.  Then if folks agree, I'll put together a
patch that changes the site links for the tutorial to point at the wiki.

Thanks,
Jake.

-----Original Message-----
From: Franz Werfel [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, March 07, 2006 3:01 AM
To: [email protected]
Subject: Re: project vitality? / less documentation is more!

Hello,

Just my 2 cents: the "Intranet crawl" functionnality is VERY confusing.

If it was just taken out of the tutorial, and out of the set of
commands, that would actually help A LOT: I understood many many
things about Nutch once I tried so-called whole-web crawling, where
one has to use every command one at a time. And that would also
eliminate all the questions about "how to recrawl", etc.

Or maybe a change of name would be enough: "Intranet crawl" could be
called "fast-setup crawl", and "whole-web crawling", "serious crawling
for Intranet or whole-web projects".

What do you think?

Thanks, Frank.


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to