I think the best and most simple way is to use a limit for the length of a
single word (I mean a string without a space) in the subject
[^\s]{20,29}=>0.5
[^\s]{30,39}=>1
[^\s]{40,}=>3
Thomas
Von: K Post <[email protected]>
An: ASSP development mailing list <[email protected]>
Datum: 17.04.2011 21:40
Betreff: Re: [Assp-test] Blocking SpamWordsThatAreAllTogether
>
> (summarized)
> Fritz Borgstedt schreibt:
> > put this into bombSubjectRe
> (?ms-i:[a-z][A-Z].*?[a-z][A-Z].*?[a-z][A-Z])=>0.5
> should work
>
Thanks Fritz (and thomas for your posts in the other thread)
That should help. I'm a bit fearful of scoring legit mails. I can't
think
of real phrases that would trigger, but what if these were real?
eBayPayments, iTiVo, iPhonePro
I was hoping for some (magic) way of SPLITTING the long subjects based on
caps and then to a bayesian analysis of the extracted words. How tough
would it be where bayesian checking is done to have code split out strings
longer than say 20 characters by using caps as delimiters?
------------------------------------------------------------------------------
Benefiting from Server Virtualization: Beyond Initial Workload
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve
application availability and disaster protection. Learn more about
boosting
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test
DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally
privileged and protected in law and are intended solely for the use of the
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no
known virus in this email!
*******************************************************
------------------------------------------------------------------------------
Benefiting from Server Virtualization: Beyond Initial Workload
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve
application availability and disaster protection. Learn more about boosting
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test