On 2022-03-01 at 21:20:51 UTC-0500 (Tue, 01 Mar 2022 20:20:51 -0600)
Quinn Comendant <[email protected]>
is rumored to have said:

Thanks for both of your suggestions:

Not direct save but you could change view to “Show HTML Source”, […]

Bill's correct, that won't give me access to the raw HTML.

To get the HTML part of a multipart (or pure HTML) message, you need to use "Show Raw Message" […]

That's what I've been doing, but it's not convenient because the raw email part will be either base64 or quoted-printable encoded, which is not easy to decode. Base64 is easy to decode, but it seems to be less common. I'm not sure how to decode quoted-printable correctly.

A tiny Perl script I call 'decode-qp':

        #!/usr/bin/perl
        
        eval 'exec /usr/bin/perl  -S $0 ${1+"$@"}'
            if 0; # not running under some shell

        use MIME::QuotedPrint qw(decode_qp);

        while (<>) {
            print decode_qp($_);
        }

The reason I want to extract HTML messages is to analyze them to improve the spam filtering on the mail servers I manage, and it's really useful to be able to access the html.

If you're friendly with Perl, there is a tool called mimeexplode in the examples collection distributed with the MIME::Tools package which explodes a MIME message into a directory tree containing its constituent parts.

--
Bill Cole
[email protected] or [email protected]
(AKA @grumpybozo and many *@billmail.scconsult.com addresses)
Not Currently Available For Hire
_______________________________________________
mailmate mailing list
[email protected]
https://lists.freron.com/listinfo/mailmate

Reply via email to