On Wed, Oct 17, 2018 at 02:12:37PM -0500, Bruno Wolff III wrote:
> On Wed, Oct 17, 2018 at 14:48:52 -0400,
>   "tonynel...@georgeanelson.com" <tonynel...@georgeanelson.com> wrote:
> >> ... For html only messages you would either need to reject them or rewrite 
> >> them, both of which have issues.
> >I've used elinks to do that in an email forum I wrote. It worked better than 
> >doing it with, say, Beautiful Soup.
> 
> That is a big risk on your list serve processor. I would want to use 
> something 
> a lot safer than elinks (or lynx) to parse unsolicited email messages. What I 
> do at work is use a simple perl script, but it doesn't do a great job.

I use a perl script with these modules and some regexps to clean up
the result:

use HTML::Strip; use HTML::LinkExtor; use HTML::Entities
qw/decode_entities/; use URI::Escape qw/uri_unescape/;
_______________________________________________
devel mailing list -- devel@lists.fedoraproject.org
To unsubscribe send an email to devel-le...@lists.fedoraproject.org
Fedora Code of Conduct: https://getfedora.org/code-of-conduct.html
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org

Reply via email to