[PHP] biff_decode function

2004-08-02 Thread Richard Lynch
[de-lurk]
For some time now, the amount of unwanted email has been increasing 
in my Inbox.

The server has 'Assassin, and I've been using some Eudora 
hand-crafted filters to trash 'Assassin-marked emails, as well as 
re-directing various mailing lists to sub-folders.

Alas, that recently reached the point where I couldn't use my desktop 
machine to work, as it was spending far too many cpu cycles filtering 
email...

So now I have a PHP script on the server, whacking away at the junk, 
and putting things in the correct sub-folders, and my desktop box is 
useful again.

In addition to 'Assassin-marked junk, which catches about 85% of the 
junk, I've crafted some custom filters to get most of the rest of the 
junk, and can almost get back to work...

The single remaining feature I'd like to add to my custom PHP 
filtering is one that would catch emails whose subject lines have 
been encoded in a B1FF-like manner to disguise their junkiness.

EG:  Val'um in the subject line should be detectable as junk.
I've searched the web a bit, but am finding far too many references 
to the X-based email client named biff  (Argh!)

Thus, I need your help finding/writing a biff_decode function.
Perhaps a function which, given a short (subject length) input of 
text, and a dictionary of bad words, would return some kind of 
value indicating a percent match of how close the text was to any 
of the bad words.

Or perhaps somebody on this list has a better idea how to tackle the problem.
Please cc me, as I am still struggling with catching up on mail/news, 
and would most likely not see the solution otherwise.

THANKS!!!
--
Like Music?
http://l-i-e.com/artists.htm
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


Re: [PHP] biff_decode function

2004-08-02 Thread Jason Wong
On Tuesday 03 August 2004 04:20, Richard Lynch wrote:

 Perhaps a function which, given a short (subject length) input of
 text, and a dictionary of bad words, would return some kind of
 value indicating a percent match of how close the text was to any
 of the bad words.

PHP has a number of builtin String Functions which does string similarity 
comparisons. I'm not sure how useful they would be for your purpose so you 
would have to play around with them to find out.

-- 
Jason Wong - Gremlins Associates - www.gremlins.biz
Open Source Software Systems Integrators
* Web Design  Hosting * Internet  Intranet Applications Development *
--
Search the list archives before you post
http://marc.theaimsgroup.com/?l=php-general
--
/*
Numeric stability is probably not all that important when you're guessing.
*/

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php