http://bugzilla.spamassassin.org/show_bug.cgi?id=3886





------- Additional Comments From [EMAIL PROTECTED]  2004-10-20 13:57 -------
Subject: Re:  update URIBL whitelist with persistent high volume only

> I may need to re-engineer things then.  The data is already getting
> somewhat large.  Is it more important to have more days or more hits?
> Could I reduce the sampling rate, but increase the time period?
> I.e. the same number of queries over 20 days, etc.

The data should take very little space.  Just produce summaries: collect
as much data is reasonably possible each day, then produce a single
summary count for each day.  Once that summary is produced, delete the
raw data.

each day:

  N yahoo.com
  M aol.com
  etc.

then add the totals for each domain across all days (and those summaries
should be relatively very small for each day allowing a very large
number of days)





------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.

Reply via email to