According to Rich Thomas:
> I've downloaded and installed the latest beta htdig-3.2.0b3 on a Solaris 8
> box.  It installed easily.

I wouldn't recommend htdig-3.2.0b3.  It's over a year old and has some
pretty serious bugs, including a nasty one in the code that handles
file:// URLs.  Pick up the latest 3.2.0b4 snapshot from
http://www.htdig.org/files/snapshots/

> Looking at the documentation I can't find where the file:// variable gets
> set.  Will file:// be significantly faster than http:// spidering if it's
> run on a local machine?

First of all, file:// isn't a variable (nor a configuration attribute)
that you set, so you'd have a hard time finding it in the documentation.
It's a type of URL, just like http:// is a type of URL.  3.1.6 only handles
http://, whereas the 3.2 betas also handle file:// internally, and can
handle others via external transports.  You set the URLs you want to
index using the start_url attribute.

See http://www.htdig.org/attrs.html#start_url
and http://www.htdig.org/FAQ.html#q4.18

Spidering will be faster on the local file system than with HTTP
transfers, because there's less overhead in fetching the files.  Whether
it's significantly faster depends on how significant the overhead is as
compared to the time to index the documents once fetched.  You can also
speed up the fetching of some http:// URLs using the local_urls attribute.
See http://www.htdig.org/attrs.html#local_urls.;  This works in 3.1.6 too.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to