at the risk of repeating myself, this is what i was 
trying to get at earlier. i don't think that inventing 
"filesystem mime" for performance reasons makes sense.

what percentage of email even has attachments?

never mind a having a strictly reversable function such that
f*(f(email)) → email. in order to preserve the basic difference between
multipart/alternative and multipart/mixed, one would need a 
metadata file containing that information. in some sort of new format.

- erik

 Dave Eckhardt <[EMAIL PROTECTED]> writes

| 
| > Given that the current mbox file format doesn't scale well to
| > tens of thousands of emails, I'm not worried about the fact that
| > the file system doesn't either.
| 
| The standard hack (did MH ever pick this up?) is to split the
| directory, e.g., message 32456 is stored in 32/456 or 3/2/456.
| 
| The huge storage/locality win for separating out large attachments
| if you have venti-like storage (or want to cache virus-scan results)
| may well justify tearing the file apart on nextpart boundaries--my
| only concern is that the process be 100% reversible by upas when you
| ask.  Heck, it could be as simple as "each message is 1 directory,
| containing chunks stored one per file, named by integers, and the
| original message is formed by cat'ing the chunks in numerical order".
| That would let the storing agent decide whether the nextpart boundary
| foo belongs in the same chunk as the bits or in its own micro-chunk.
| 
| Dave Eckhardt
| 
| P.S. At present upas does the >From thing, which isn't reversible.

Reply via email to