[Mimedefang] two md_check_against_smtp_server questions

John Rudd Sun, 03 Dec 2006 18:55:54 -0800

1) what does MD fill in if you leave the $helo argument blank? Does itfill in the hosts own hostname? try to send a blank? what? I have 1mimedefang-filter that I deploy on 5 machines... it'd be nice to nothave to customize this in any way. If MD doesn't fill in a blank with"the right thing", can I make this into a feature request?

2) Has anyone set up a means of caching results? I don't want to hit myback-line servers constantly with these requests. I would prefer tohave results cached for, say, 2 hours. I'm trying to think of a goodway to do this.

One thought I had was to have each machine have an external databasewhere the email address is the key, and it has 2 values: time lastchecked, and account state (ok, unknown, over-quota). Then I'd processit like this:


If (address is cached) && ((now - last_checked) <= cache_life)
   use the cached result

If (address is not cached) || ((now - last_checked) > cache_life)
   if the address is valid (via md_check_against_smtp_server() )
      if the address is an account
         if the account is over quota
            state = over-quota
         else
            state = ok
      else
         state = ok
   else
      state = unknown

   update the cache with the new result and last_checked time.


Anyone have thoughts about good and bad ways to do that?

I could just store it in a hash, but that means each child process willcheck on its own. That's potentially 30 children * 5 machines * 30,000addresses = 4.5 million md_check_against_smtp_server() calls ... whichdoesn't even include the actual SMTP deliveries.

If I cache it in a local database, that's easy and cheap. I then cutthat down to 5 machines * 30,000 addresses, or .15 million calls per 2hours. Plus, I can potentially cut it further by having an externalprocess that goes through and cleans things up every hour or so (seedthe database with known good addresses from our account managementsystem; do the quota checks so they don't have to be done in real time,etc.). That might significantly cut down the number of calls. And, ifI'm really confident about the seeding process, I might even be able toomit the md_check_against_smtp_server() calls entirely, because theseeding process already told me everything I needed to know.

I could also use an external database server, but then I'm introducingpoints of failure into the process, and shifting "lots of calls to thebackend server" to "lots of calls to the database server".

I'm sort of leaning toward the "local database" approach, but I've neverreally played with ties and such before.





_______________________________________________
NOTE: If there is a disclaimer or other legal boilerplate in the above
message, it is NULL AND VOID.  You may ignore it.

Visit http://www.mimedefang.org and http://www.roaringpenguin.com
MIMEDefang mailing list [email protected]
http://lists.roaringpenguin.com/mailman/listinfo/mimedefang

[Mimedefang] two md_check_against_smtp_server questions

Reply via email to