In <[EMAIL PROTECTED]>,
Jeff Breidenbach <[EMAIL PROTECTED]> wrote:
>
>>This is the problem, HTML does not support mixed character sets.
>>Also, the charset affects the entire HTML document.  Therefore, your
>>resource settings would have to conform with the charset, and this
>>can be a big problem if messages existing in the archive have different
>>specified charsets.  It would be hard to guarantee that all messages
>>will use the same charset.
>
>I think I understand ... is this right?
>
>If an single email contains two different character sets,
>you're screwed, I understand that.

A potential solution would be to put the different message parts into
different files in the archive, and use the remainder of the message as
a container for URLs to those files, mimicking the MIME message
structure in HTML.  I haven't looked to see how/if one can do that in
MHonarc, but this seems like a problem similar to archiving a multi-part
with multiple graphic inclusions.

>If two emails are received, each with a different character set
>1) you are screwed on index pages, which will has a bunch
>   of subject lines from different character sets

Perhaps translate those to Unicode, as I think you're suggesting.
OTOH, char-sets in headers may be encoded in the RFC 2047 style,
so any translation may break out as a different problem anyway.

>2) you are screwed on message pages, because navigational aids
>   like the word "follow-ups" will be in a different character set
>   from the messages.

Rely on a graphical interface instead of text?  

BTW, while I applaud the desire to display localized headers I hope that
any reply/follow-up interface is sending the canonical RFC 822 & later
headers and keywords "on the wire", and not helping create messages like:

Subject: Re: Sv: Re: Ab: Re:


-- 
Denis McKeon

Reply via email to