Malcolm is correct - the URL does return an incorrect Content-Type line in 
the header. See the results from wget (a UNIX command line HTTP file 
grabber).

=================================
wget -S 
'http://fox97.com/features/Favorite_moments_from_Randy_and_Spiff.html'
--10:14:56-- 
http://fox97.com/features/Favorite_moments_from_Randy_and_Spiff.html
           => `Favorite_moments_from_Randy_and_Spiff.html'
Connecting to fox97.com:80... connected!
HTTP request sent, awaiting response... 200 OK
2 Date: Tue, 08 Jan 2002 21:14:59 GMT
3 Server: Apache/1.3.12 (Unix)
4 Vary: Host
5 Set-Cookie: Apache=203.97.83.34.215961010524499168; path=/; expires=Wed, 
02-Jul-03 21:14:59 GMT
6 Connection: close
7 Content-Type: text/html
8

    0K .......... .......... .                                @   5.12 
KB/s

10:15:01 (5.11 KB/s) - `Favorite_moments_from_Randy_and_Spiff.html' saved 
[22114]
===================================

The easiest fix is to get your webserver to return the corrent Content 
type. Alternatively, and less easy, is to find the Content-Type-Alias fix 
I posted
to htdig-dev a few months ago, which allows you to tell htdig to alter a 
document's Content-Type before parsing, and patch your htdig installation 
- 
I wrote that for this very problem.

Jamie Anstice
Search Scientist,  S.L.I. Systems, Inc
[EMAIL PROTECTED]
ph:  64 961 3262
mobile: 64 21 264 9347





Daniel Escobar <[EMAIL PROTECTED]>
Sent by: [EMAIL PROTECTED]
09/01/02 09:37
Please respond to daniel

 
        To:     Malcolm Austen <[EMAIL PROTECTED]>
        cc:     ht://Dig list <[EMAIL PROTECTED]>
        Subject:        Re: [htdig] search results and tittle


> If you could actually quote us a URL then we could check this theory but
> until then it remains a theory.

Malcolm,  below is a link to a search box that will search one of my 
sites.
You can do a query on whatever you would like, but I suggest using "Randy"
as one of your searches, since I know that will show you my problem. (some
searches show as [filename.html], and others show the actual tittle).

http://fox97.com/common/htdig/ssi/wfox-fm.html

Please let me know if you have any other ideas/comments :)

Thanks again,

-daniel

 to believe the MIME content
> type or to ignore it and believe the file extention instead.
>
> If you could actually quote us a URL then we could check this theory but
> until then it remains a theory.
>
> regards,
>         Malcolm.
>
>  [EMAIL PROTECTED]     http://users.ox.ac.uk/~malcolm/

--
Daniel Escobar
Software Engineer
Cox Radio Interactive
678.860.2749 (cellular)
404.979.7860 (office)




_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to 
<[EMAIL PROTECTED]> with a subject of 
unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html




_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to