Joe Emenaker <[EMAIL PROTECTED]> wrote:

>         (\w)\1{2}
[...]
> You probably want to give some points to any word with three or more 
> (since a scan through my spell dict didn't find ANY English words with 
> more than two of the same char in a row), [...]

Note that \w matches letters and numbers, so that regex will 
match the string "000", which is pretty common in various 
contexts, as well as less common number strings.  You might 
want to use '[A-Za-z]' in place of '\w' (or use '[a-z]' with 
the /i modifier at the end of the regex to make it case-
insensitive).

Even with that change, it will match "www.google.com" and 
"American Automobile Association (AAA)" and "Henry VIII" and 
"Baadasssss! (2004)" and "page xxxii" and various other non-
spam-indicating words, so be very careful with constructing and 
scoring such a rule.

-- 
Keith C. Ivey <[EMAIL PROTECTED]>
Washington, DC

Reply via email to