According to Malcolm Austen:
> OK, Opera tell me that the MIME content type is indeed correct but I
> wonder whether the answer lies in the HTML coding. Maybe someone with
> better (some!) knowledge of the actual htdig code could comment better ...
> the page starts:
> 
> <!--adstart--><!--adend-->
> <!--htdig_noindex-->
> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
> 
> <html>
> <head>
> <!--/htdig_noindex-->
> <title>
> 
> 
>  wfox-fm     Randy & Spiff Bios </title>
> <!--htdig_noindex-->
...
> It might be that the (pointless) turning off of indexing through the
> doctype and html/head elements has screwed the analysis. However I can see
> another page that looks much the same and does display the title in the
> search results. My next best guess is that the bare ampersand (illegal
> surely, shouldn't it be &amp;) may be throwing something. I think I could
> shoot down that theory too but there are certainly bare ampersands lurking
> in a number of places.

No, htdig should properly handle bare ampersands, and shouldn't care
if the html/head tags get swallowed up.  I can't reproduce the problem
with 3.1.x right now.  I haven't ruled out a 3.2 bug yet, though that
too seems unlikely to me.  I'll wait to hear back from Daniel about his
version before trying further tests.

> I'm also not clear what the purpose of the metadata
> is if you then tell the search robot to ignore it!

Good point, albeit a side-issue.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to