At 12:22 AM +1000 8/3/01, Marc Cosgrove wrote:
>website using http://127.0.0.1 or similar, but if I use
>file:///var/www/html, then the DOC PDF XLS files do not index. I presume
>that the problem is that htdig does not determine a MIME type when used this

You don't mention what version you're trying to use. The 3.1.x series 
most certainly won't deal with file:// URLs and the local_urls 
feature in both the production branch and the 3.2 betas don't resolve 
MIME types. (The 3.2 code does deal with file:// URLs directly and 
has a mime_types attribute to point to a mime.types file.)

>Is there any way around this, as I would prefer using file:/// rather than
>http://127.0.0.1 to take a load of the web server.

If you insist on using file:// URLs, you will want to grab the latest 
snapshot of 3.2: <http://www.htdig.org/files/snapshots/>. If you're 
willing to use local_urls, but want to index certain file types 
beyond .html, .htm, .txt, and .pdf, for now you'll need to hack the 
code in htdig/Document.cc::Retrieve_Local to allow the extension you 
want.

-- 
--
-Geoff Hutchison
Williams Students Online
http://wso.williams.edu/

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to