At 12:22 AM +1000 8/3/01, Marc Cosgrove wrote:
>website using http://127.0.0.1 or similar, but if I use
>file:///var/www/html, then the DOC PDF XLS files do not index. I presume
>that the problem is that htdig does not determine a MIME type when used this
You don't mention what version you're trying to use. The 3.1.x series
most certainly won't deal with file:// URLs and the local_urls
feature in both the production branch and the 3.2 betas don't resolve
MIME types. (The 3.2 code does deal with file:// URLs directly and
has a mime_types attribute to point to a mime.types file.)
>Is there any way around this, as I would prefer using file:/// rather than
>http://127.0.0.1 to take a load of the web server.
If you insist on using file:// URLs, you will want to grab the latest
snapshot of 3.2: <http://www.htdig.org/files/snapshots/>. If you're
willing to use local_urls, but want to index certain file types
beyond .html, .htm, .txt, and .pdf, for now you'll need to hack the
code in htdig/Document.cc::Retrieve_Local to allow the extension you
want.
--
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html