I believe I found why the regex crashes ASSP, but I'm not good enough in
writing regex's to solve it.

As I said earlier the regex causing the crash here is the one for
"side-effect" from the ASSP wiki:

\b[S$5]+\s?\S?\s?\W?[I1\!\|lt]+\s?\S?\s?\W?D+\s?\S?\s?\W?(\s?\S?\s?\W?)+
\s?\S?\s?\W?\s?\S?\s?\W?F+\s?\S?\s?\W?F+\s?\S?\s?\W?\s?\S?\s?\W?C+\s?\S?
\s?

I tested this on rexv.org and found out that the regex works fine to a
certain extent. In the following sentence "side-effect" is caught:

There will be no side-effect

But things change when the string contains something similar like
"side", for example the word "said":

Said betty, there will be no side-effect

The regex does not just match "side-effect" but matches the whole
sentence starting from "said". So "said betty, there will be no
side-effect" is matched completely, not only "side-effect".

One of the spam mails that crashes my ASSP contains some text and a gif,
in total about 80k. At the beginning the mail contains "Said betty..."
following a lot of data. So the regex matches starting from "Said" to
somewhere at the end of the mail. So the matching string is about 50k
long. Rexv.org cannot do it and I believe ASSP chokes on this too.

I hope this makes sense and that someone finds this information useful.

Cheers
Dajo



-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Assp-user mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/assp-user

Reply via email to