Forrest, Just a wild, "off the top of my head" idea: can your email archives be accessed by webmail? If so, could you try pointing htdig at your webmail server? I don't know what problems you may find, but it's worth a try.
David Adams Corporate Information Services Information Systems Services University of Southampton ----- Original Message ----- From: "Forrest Aldrich" <[EMAIL PROTECTED]> To: "Jim" <[EMAIL PROTECTED]> Cc: <[EMAIL PROTECTED]> Sent: Tuesday, September 07, 2004 6:20 PM Subject: Re: [htdig] Large email archives... > Hi Jim, > > You're probably right. > > I did however expect to use Maildir, knowing that indexing an mbox file > would be pretty useless. ;-) > > I also expected to run a web interface to perform the searches. > > Now: the interesting part here would be to interface with a modern > browser like Firefox, to utilize one of the messages in the search - > somehow injecting it into a message, etc., via plugins. > > Either way, someone must have come up with a good means of performing > this - I can't possibly be the only email hoarder out on the 'net! > > > > _F > > > > Jim wrote: > > > On Mon, 6 Sep 2004, Forrest Aldrich wrote: > > > >> However, with folders that exceed 20,000 messages in some cases, > >> using a standard IMAP client can be tiresome. > >> > >> I'm wondering if anyone out there has utilized something like htdig > >> to create better indices and searches on these folders. Either in > >> mbox or Maildir format? > >> > >> There simply has to be a better way...... > > > > > > You could perhaps kludge something together for this purpose, but I am > > not aware of any elegant way to use ht://Dig for searching mail > > folders. The functionality provided by the package is very much > > focused on web content. Also, the indexing is done at the file level, > > so indexing mbox files would probably be pointless; at best a search > > would tell you that a search term was somewhere in that mailbox. There > > are some tricks that you can use to index text files straight from > > disk, so Maildir might provide some possibilities. But even if you get > > the info into the index, you would still be limited on the search > > side. You would either need to perform your searches in a web browser > > or somehow wrap htsearch and do your own parsing and presentation. > > > > In short, while you might very well be able to cobble something > > together, I > > don't think ht://Dig is a particularly good match. > > > > Jim ------------------------------------------------------- This SF.Net email is sponsored by BEA Weblogic Workshop FREE Java Enterprise J2EE developer tools! Get your free copy of BEA WebLogic Workshop 8.1 today. http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click _______________________________________________ ht://Dig general mailing list: <[EMAIL PROTECTED]> ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html List information (subscribe/unsubscribe, etc.) https://lists.sourceforge.net/lists/listinfo/htdig-general