Joe Emenaker <[EMAIL PROTECTED]> wrote:
> (\w)\1{2}
[...]
> You probably want to give some points to any word with three or more
> (since a scan through my spell dict didn't find ANY English words with
> more than two of the same char in a row), [...]
Note that \w matches letters and numbers, so that regex will
match the string "000", which is pretty common in various
contexts, as well as less common number strings. You might
want to use '[A-Za-z]' in place of '\w' (or use '[a-z]' with
the /i modifier at the end of the regex to make it case-
insensitive).
Even with that change, it will match "www.google.com" and
"American Automobile Association (AAA)" and "Henry VIII" and
"Baadasssss! (2004)" and "page xxxii" and various other non-
spam-indicating words, so be very careful with constructing and
scoring such a rule.
--
Keith C. Ivey <[EMAIL PROTECTED]>
Washington, DC