I think the best and most simple way is to use a limit for the length of a 
single word (I mean a string without a space) in the subject

[^\s]{20,29}=>0.5
[^\s]{30,39}=>1
[^\s]{40,}=>3

Thomas






Von:    K Post <[email protected]>
An:     ASSP development mailing list <[email protected]>
Datum:  17.04.2011 21:40
Betreff:        Re: [Assp-test] Blocking SpamWordsThatAreAllTogether




>
> (summarized)



> Fritz Borgstedt schreibt:
> > put this into bombSubjectRe
> (?ms-i:[a-z][A-Z].*?[a-z][A-Z].*?[a-z][A-Z])=>0.5
> should work
>

Thanks Fritz (and thomas for your posts in the other thread)

That should help.  I'm a bit fearful of scoring legit mails.  I can't 
think
of real phrases that would trigger, but what if these were real?
eBayPayments, iTiVo, iPhonePro

I was hoping for some (magic) way of SPLITTING the long subjects based on
caps and then to a bayesian analysis of the extracted words.  How tough
would it be where bayesian checking is done to have code split out strings
longer than say 20 characters by using caps as delimiters?
------------------------------------------------------------------------------
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 

application availability and disaster protection. Learn more about 
boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test




DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 

individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no 
known virus in this email!
*******************************************************


------------------------------------------------------------------------------
Benefiting from Server Virtualization: Beyond Initial Workload 
Consolidation -- Increasing the use of server virtualization is a top
priority.Virtualization can reduce costs, simplify management, and improve 
application availability and disaster protection. Learn more about boosting 
the value of server virtualization. http://p.sf.net/sfu/vmware-sfdev2dev
_______________________________________________
Assp-test mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to