Hello Larry,

Wednesday, January 21, 2004, 11:37:09 PM, you wrote:

LG> Along the same lines, I had the following:

LG> describe MY_RBDY_INVTXTSZ1  MY: Invisible text size
LG> rawbody  MY_RBDY_INVTXTSZ1  /font\s+.*\bsize=.-\d\D/i
LG> score    MY_RBDY_INVTXTSZ1  0.5

LG> describe MY_RBDY_INVTXTSZ2  MY: Invisible text size with style
LG> rawbody  MY_RBDY_INVTXTSZ2  /size=.-\d\D style=.font-size: \dpx;\D/i
LG> score    MY_RBDY_INVTXTSZ2  0.5

LG> describe MY_RBDY_INVIMGSZE  MY: Invisible image size
LG> rawbody  MY_RBDY_INVIMGSZE  /width=.1\D height=.1\D/i
LG> score    MY_RBDY_INVIMGSZE  0.5

LG> They seem to hit legit opt-in advertisements though.  I would be
LG> curious to see the results against your corpus if you are inclined
LG> and willing to spend the time.

MY_RBDY_INVTXTSZ1 -- 5002s/876h of 91714 corpus (74113s/17601h) 01/22/04

Note that a <font size=-1> or -2 is not an invisible font size, but
simply a reduced font size.

MY_RBDY_INVTXTSZ2 -- 463s/0h of 91714 corpus (74113s/17601h) 01/22/04

I wouldn't think that any font-size in pixels would be "invisible", since
one could do anything from tiny to gigantic using that specification, but
in my corpus it hits only spam.

MY_RBDY_INVIMGSZE -- 2190s/158h of 91714 corpus (74113s/17601h) 01/22/04

ham hit is mostly HTML newsletters from accepted sources and/or
YahooGroups.

Bob Menschel





-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to