> It is in UTF-8, but it is also violation of rfc822/rfc2047. Headers
> must
> be encoded. Computer program can't detect used character set, if sender
> does not specify which character set is used.

I was aware of that. However these emails exist and I can't get Outlook to 
change anyway.

> It is highly unlikely
> that
> all your malformed emails are in utf-8. You can have a mix of utf-8,
> iso-8859-1, iso-8859-13, iso-8859-15, windows-1252, windows-1257 and
> other
> character sets. Older Estonian emails are probably not in utf-8. If you
> try to fix all 8bit subjects, you will break malformed iso-8859-x
> Estonian
> texts that look ok in Outlook now.

The problem is only with the Subject header on a subset of emails. The bodies 
look OK before and after conversion. UTF8 subjects break after conversion from 
pst to dbmail via mbox.

My plan if all else fails is to generate the mbox files and then go through 
them searching for the UTF8 subjects and recode these only. Or modify libpst to 
do that which would be more efficient.

> If those utf-8 emails looked OK in Outlook, maybe problem is in libpst.

Maybe, however I remember having come across this problem with malformed 
Outlook subjects before. I'll investigate further next week.

Thanks for your comments Tomas!

Regards,

Aleksander Kamenik
System Administrator
Krediidiinfo AS
an Experian Company
Phone: +372 665 9649
Email: [email protected]
_______________________________________________
DBmail mailing list
[email protected]
http://mailman.fastxs.nl/cgi-bin/mailman/listinfo/dbmail

Reply via email to