There are a number of tactics of URL obfuscation that could easily
permanently kill a filter that was totally reliant on scanning urls.

First off, note that in an HTML link, the text displayed for a link is not
tied to the link itself.  The URL below could easily say
http://www.yahoo.com/ and still point to http://24.203.43.129/freegoop.html.
Thus, yes... your grandmother would still be fooled.

Second off, there is the issue with throw-away domains.  Spamcop probably
has a reasonably fast path to actually discovering a new throw-away domain
and thus, the domain will make its way into sburl.  However, also note that
Spamassassin has been around for some years now and is widely used by many
more people than those use spamassassin or even razor.  With improvements to
code and a broader user-base Razor could foreseeably become as effective as
SpamCop in registering new findings.

Third off, there are a significant number of ways to obfuscate/encode URLs.
Commonly, most spam still uses a straight hostname based URL which keeps
sburl pretty effective.  However, it is also feasible that more and more
spammers will use a legitimate looking text display for a link and use an
encoded URL, not visible to the average user, within the A HREF tag.  Such
methods could include using the IP; which could be broken up a number of
ways, not just into 4 octets; or using Unicode or hex characters to provide
a high number of unique combinations that, if used effectively, would take a
very long time to get into sburl.

I do not know enough about the source code  or razor to know if the same
methods would spell havoc as well.  I would imagine so.  In any case, the
main point of this email is that this will be an ever evolving battle
against spam since there a variety of tactics that are not being used yet or
even thought of.  I don't see any one solution that will be wholey effective
on its own.  Sure, sburl works great now but that could easily not be the
case a year from now since its approach is somewehat limited.

Joe Gilbert


-----Original Message-----
From: John Andersen [mailto:[EMAIL PROTECTED]
Sent: Wednesday, April 21, 2004 11:58 PM
To: [EMAIL PROTECTED]
Subject: Re: [Razor-users] Poor detection ratio


On Wednesday 21 April 2004 04:35, Tom Allison wrote:

> > spamassassin wasn't catching had a very high percentage of links to
> > domains that were under a month old, most under a week old.
>
> This would work up until they moment that they change their addresses to:
> http://24.203.43.129/freegoop.html

True, but even my grandmother wouldn't trust numeric IPs in
a url....

-- 
_____________________________________
John Andersen


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Razor-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/razor-users


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Razor-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/razor-users

Reply via email to