According to Malcolm Austen: > OK, Opera tell me that the MIME content type is indeed correct but I > wonder whether the answer lies in the HTML coding. Maybe someone with > better (some!) knowledge of the actual htdig code could comment better ... > the page starts: > > <!--adstart--><!--adend--> > <!--htdig_noindex--> > <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> > > <html> > <head> > <!--/htdig_noindex--> > <title> > > > wfox-fm Randy & Spiff Bios </title> > <!--htdig_noindex--> ... > It might be that the (pointless) turning off of indexing through the > doctype and html/head elements has screwed the analysis. However I can see > another page that looks much the same and does display the title in the > search results. My next best guess is that the bare ampersand (illegal > surely, shouldn't it be &) may be throwing something. I think I could > shoot down that theory too but there are certainly bare ampersands lurking > in a number of places.
No, htdig should properly handle bare ampersands, and shouldn't care if the html/head tags get swallowed up. I can't reproduce the problem with 3.1.x right now. I haven't ruled out a 3.2 bug yet, though that too seems unlikely to me. I'll wait to hear back from Daniel about his version before trying further tests. > I'm also not clear what the purpose of the metadata > is if you then tell the search robot to ignore it! Good point, albeit a side-issue. -- Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]> Spinal Cord Research Centre WWW: http://www.scrc.umanitoba.ca/~grdetil Dept. Physiology, U. of Manitoba Phone: (204)789-3766 Winnipeg, MB R3E 3J7 (Canada) Fax: (204)789-3930 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

