According to Rich Thomas: > I've downloaded and installed the latest beta htdig-3.2.0b3 on a Solaris 8 > box. It installed easily.
I wouldn't recommend htdig-3.2.0b3. It's over a year old and has some pretty serious bugs, including a nasty one in the code that handles file:// URLs. Pick up the latest 3.2.0b4 snapshot from http://www.htdig.org/files/snapshots/ > Looking at the documentation I can't find where the file:// variable gets > set. Will file:// be significantly faster than http:// spidering if it's > run on a local machine? First of all, file:// isn't a variable (nor a configuration attribute) that you set, so you'd have a hard time finding it in the documentation. It's a type of URL, just like http:// is a type of URL. 3.1.6 only handles http://, whereas the 3.2 betas also handle file:// internally, and can handle others via external transports. You set the URLs you want to index using the start_url attribute. See http://www.htdig.org/attrs.html#start_url and http://www.htdig.org/FAQ.html#q4.18 Spidering will be faster on the local file system than with HTTP transfers, because there's less overhead in fetching the files. Whether it's significantly faster depends on how significant the overhead is as compared to the time to index the documents once fetched. You can also speed up the fetching of some http:// URLs using the local_urls attribute. See http://www.htdig.org/attrs.html#local_urls.; This works in 3.1.6 too. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

