On 4/10/23 11:32, Florian Lohoff via MIMEDefang wrote:

Hi,
i'd like to drop/replace HTML attachments/mails which contain active
components like javascript/javascript external refs.


        <script language="javascript></script>

or

        <html><head>
                <script type="text/javascript" src="http://a.b.c.d";></script>
        </head></html>

Basically going through all text/html etc parts. I am unshure whether
i'd need to really decode HTML with HTML::Parse or the like to find it
or if simple "regex" matching would be sufficient. Currently i am
dropping this by spamassassin with custom filters using regex.

Has anyone an example for this or experience which HTML perl module
is the most stable?

it can be done using HTML::Parser, and then running 
Mail::MIMEDefang::Actions:action_rebuild().
In some cases it can be tricky because html attachments could be base64 encoded.

 Giovanni

And while at it. I tried my luck to do this also with PDF with active
content, trying to parse PDF with CAM::PDF (or PDF::API2) to drop
PDFs with active content. So if anyone has suggestions here would
also be nice.

Flo


_______________________________________________
NOTE: If there is a disclaimer or other legal boilerplate in the above
message, it is NULL AND VOID.  You may ignore it.

MIMEDefang mailing list [email protected]
https://lists.mimedefang.org/mailman/listinfo/mimedefang_lists.mimedefang.org

Attachment: OpenPGP_signature
Description: OpenPGP digital signature

_______________________________________________
NOTE: If there is a disclaimer or other legal boilerplate in the above
message, it is NULL AND VOID.  You may ignore it.

MIMEDefang mailing list [email protected]
https://lists.mimedefang.org/mailman/listinfo/mimedefang_lists.mimedefang.org

Reply via email to