On 4/10/23 11:32, Florian Lohoff via MIMEDefang wrote:
Hi,
i'd like to drop/replace HTML attachments/mails which contain active
components like javascript/javascript external refs.
<script language="javascript></script>
or
<html><head>
<script type="text/javascript" src="http://a.b.c.d"></script>
</head></html>
Basically going through all text/html etc parts. I am unshure whether
i'd need to really decode HTML with HTML::Parse or the like to find it
or if simple "regex" matching would be sufficient. Currently i am
dropping this by spamassassin with custom filters using regex.
Has anyone an example for this or experience which HTML perl module
is the most stable?
it can be done using HTML::Parser, and then running Mail::MIMEDefang::Actions:action_rebuild(). In some cases it can be tricky because html attachments could be base64 encoded. Giovanni
And while at it. I tried my luck to do this also with PDF with active content, trying to parse PDF with CAM::PDF (or PDF::API2) to drop PDFs with active content. So if anyone has suggestions here would also be nice. Flo _______________________________________________ NOTE: If there is a disclaimer or other legal boilerplate in the above message, it is NULL AND VOID. You may ignore it. MIMEDefang mailing list [email protected] https://lists.mimedefang.org/mailman/listinfo/mimedefang_lists.mimedefang.org
OpenPGP_signature
Description: OpenPGP digital signature
_______________________________________________ NOTE: If there is a disclaimer or other legal boilerplate in the above message, it is NULL AND VOID. You may ignore it. MIMEDefang mailing list [email protected] https://lists.mimedefang.org/mailman/listinfo/mimedefang_lists.mimedefang.org
