Hi

This is probably an old issue but I cannot search the
archives efficiently on these keywords.

Anyway, when searching a hypermail archive with htdig
the results are pretty useless. The subject needs to
be where the page title is (or only a partial subj
line), the date needs to be the email date not the
.html file's date, and the first xxx lines need to be
filtered from displaying and being indexed.

With the messages where the mail file is still around,
I can have hypermail regenerate the htmls as needed;
but many thousands are just in html - the mbox is long
since deleted.

And when you're searching a few hundred thousand
messages, it's ugly when all of the relevant hits are
hundreds down and don't "make the cut" because of the
repeated subject/title line (and weighting depends on
how much some emails were re-re-re-quoted)

I searched contrib but couldn't find anything useful.
I know others must have done something like this
before, and I beg forgiveness for asking something I
know is answered... but have you ever tried searching
on "htdig hypermail archive" ?

Come to think of it, the htdig.org archive results
page looks pretty messed up, so maybe it's not "fixed"

If I'm using the wrong search package for this, feel
free to suggest another. Going crazy here.

TIA


__________________________________________________
Do You Yahoo!?
Yahoo! Finance - Get real-time stock quotes
http://finance.yahoo.com


-------------------------------------------------------
This sf.net email is sponsored by: OSDN - Tired of that same old
cell phone?  Get a new here for FREE!
https://www.inphonic.com/r.asp?r=sourceforge1&refcode1=vs3390
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to