When that time comes, there would be no issues with recompiling IMAP with larger mix data sizes right? It doesn't effect the old data files that were created using smaller MIXDATAROLL size.....?

I have mine defined at 10MB right now. And after a few weeks of usage, I do have lots of little files for my Trash folder. But on a large system, you've got a variety of users who maintain their mailboxes in different ways. Hopefully finding that magic number for MIXDATAROLL would help find that balance is not having too many little files.....

nancy

Per Foreby wrote:
On Tue, 16 Jan 2007, Mark Crispin wrote:

This topic has come up before. I ended up deferring action because analysis suggests that the problem is more cosmetic than anything else. My analysis suggested that the cost of a "cleanup" tool would be greater than its benefit.

Wouldn't a cleanup shell- or perl-script be easy to implement? Just loop through all .mix########, and unlink all files not referenced in the index file. Or am I missing something important?

UNIX filesystems have been subjected to mh, maildir, netnews, Cyrus, etc. mailstores for many many years; and mix will always do better.

But none of the directory based formats will do well if the number of files in becomes to large. On reiser, xfs, ext3 with dir_index or any other "smart" filesystem this is not a problem, but most people use filesystems which require sequential reads to find a file.

This is rarely a problem now since the mix format is new, but wait a few years, and we will se directories with huge amounts of files.

The (still-unwritten) convert-to-mix tool will write multiple data files instead of the single huge data file written by mailutil. Although the convert-to-mix tool won't use c-client to write the mix format, it will use c-client to *read* the source mailbox; thus it would be capable of doing a mix->mix conversion. So, given an extreme situation (like a mix directory with 50,000 files...) you will be able to do this.

Is it really unwritten? The scripts on http://andrew.triumf.ca/mbx/ (which were announced on this list a few months ago) seems to do the job, and are easy to modify if the original format isn't mix.

/Per
_______________________________________________
Imap-uw mailing list
[email protected]
https://mailman1.u.washington.edu/mailman/listinfo/imap-uw


--
-------------------------------------
Nancy Lin                        DECF
1109A Etcheverry Hall    510-642-7291
Office Hours:         2PM-4PM Mon-Thu
-------------------------------------
_______________________________________________
Imap-uw mailing list
[email protected]
https://mailman1.u.washington.edu/mailman/listinfo/imap-uw

Reply via email to