On Wed, 27 Dec 2017 12:47:00 -0500
Alex <mysqlstud...@gmail.com> wrote:

> It [fetching URLs] would also probably lead to inadvertently
> unsubscribing people from mailing lists.

Yes, if the lists use badly-written mailing list software.

At most, I would do a HEAD on a URL and not a GET.  HEAD is probably safer
and will usually tell you if the link is a redirect.  You also want
to fake the user-agent to be a common Windows browser because some
malware servers look at the User-Agenet and return a 404 if they think
the client is not a real Web browser.

Even a HEAD can be dangerous; there's an Internet "security" [sic] company
out there that shall remain nameless; these geniuses view HEAD requests
as attacks and report you to your ISP.  It took me 2+ weeks to sort out
their BS "abuse" complaints.

> I'd like to think some intelligence could be built into such a system,
> and know many of the spam companies like Symantec and Mimecast are
> doing this to differing degrees.

I know of one company that collects URLs and has a central server farm that
analyzes them (ie, the URL fetching is done on a completely different
set of machines than the spam filtering.)  They have all kinds of heuristics
and special-case code to make it relatively safe.

Regards,

Dianne.

Reply via email to