Hi This is probably an old issue but I cannot search the archives efficiently on these keywords.
Anyway, when searching a hypermail archive with htdig the results are pretty useless. The subject needs to be where the page title is (or only a partial subj line), the date needs to be the email date not the .html file's date, and the first xxx lines need to be filtered from displaying and being indexed. With the messages where the mail file is still around, I can have hypermail regenerate the htmls as needed; but many thousands are just in html - the mbox is long since deleted. And when you're searching a few hundred thousand messages, it's ugly when all of the relevant hits are hundreds down and don't "make the cut" because of the repeated subject/title line (and weighting depends on how much some emails were re-re-re-quoted) I searched contrib but couldn't find anything useful. I know others must have done something like this before, and I beg forgiveness for asking something I know is answered... but have you ever tried searching on "htdig hypermail archive" ? Come to think of it, the htdig.org archive results page looks pretty messed up, so maybe it's not "fixed" If I'm using the wrong search package for this, feel free to suggest another. Going crazy here. TIA __________________________________________________ Do You Yahoo!? Yahoo! Finance - Get real-time stock quotes http://finance.yahoo.com ------------------------------------------------------- This sf.net email is sponsored by: OSDN - Tired of that same old cell phone? Get a new here for FREE! https://www.inphonic.com/r.asp?r=sourceforge1&refcode1=vs3390 _______________________________________________ htdig-general mailing list <[EMAIL PROTECTED]> To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe FAQ: http://htdig.sourceforge.net/FAQ.html

