-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On Wed, Aug 17, 2005 at 10:04:07AM +1000, Voytek wrote: > what is the best way to turn html mail into text, preferebaly before it > ends in my inbox ?
lynx -dump foo.html > output.text > if procmail, is there a ready made recipe for that ? You might need to use munpack followed by a bit of perl that looks at the unpacked files and makes decisions about what to keep, what to convert and how to pack it all up again. Someone else may have a suggestion for a good Open Source MIME translation engine, certainly procmail doesn't do it natively (yet). Remember that MIME email can contain a whole tree of mixed bits possibly including multiple copies of the same message in various formats. Also, spammers routinely send malformed MIME in an attempt to sneak past statistical filters (e.g. a large text block that looks "honest" to the filter, then an HTML block containing a completely different message that will be viewed in preference to the text). Might be a complex job to get it to handle all the variations correctly. - Tel -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.6 (GNU/Linux) iQIVAwUBQwMlXsfOVl0KFTApAQIB9w/+LT1JAemw9VZLQ5MsGaYTO5go2/zgYLuA qZIWDYwAZ7fOgFyq9tJxloDGacNkRejfmY3pMiBcydS6DejLAErMbgT8r5QxiPyH pKgJ97zfdGKPV+Di2UMfROEH3NLsdUtAe4zSnruAlJhdPRpEeOpS4VpH2OjQMRkf HHymdheUjw7FqcRSXOaGC7rvPkrZ7xLLUYESj35ZIN7wk3KgG3aYe0VUaZDmv2ob 1qvSIb9Ct7///hG/eqwp9W6iDqDgL30yzmrj0zHtOQtJh+o5Y+90Odgd7FuN31U6 GNSUl3W5+Qa6qLCFodqCNW8BfYyAW9M3Q9444X5uUzgrhb3up3CjpB/J7p+G8VUK p3Uvat6KGz8IOgA2Od8rtwRN5Gm78UW1nCrlvBPS8Ecmos/S27z2KfBxDirWoa3e r84xy/+nNoL1Hyuln15abVjivMVbnVkaQZNAZvyoEOKsDpgoYoIwp4MHdDDGko89 W4gzpv7wnZxchFFavlk5/Ji7a17Ym1fZ6Db1/umkXB1WRfqLfZEXFfluIJSsOKrf cZH7umvjOuXhAGgKQZdTWQ2In+HFvU3Piwz9Rv8mxaw3nUrD2QFTaVqXUEQEgzLN Typ0FznZV/viEqXHglRKx9DkS+aSGSje3lpZIHLpF93VMMFdfxMLmctJ+Nx+diYn bs8QDlcPFuw= =LM8t -----END PGP SIGNATURE----- -- SLUG - Sydney Linux User's Group Mailing List - http://slug.org.au/ Subscription info and FAQs: http://slug.org.au/faq/mailinglists.html