Policyd'ers,
                I have postfix-policyd running on three PF servers,
connected to a dual master set of MySQL servers.  Two of the PFs are on the
local subnet of the MySQL servers.  One of the PF servers (a low MX,
secondary server) resides in another data center.  (180-220ms away).

For the past three months the setup has been performing brilliantly, and
have only recently noticed problems.  After restarting the postfix-policyd
daemon it will function as normal for 2-5 minutes before it starts to
return:
-----
Jun 15 11:59:00 secondary postfix-policyd: shutdown(11): Transport endpoint
is not connected
-----

A little while longer and Postfix begins to defer *most* client connections:
-----
Jun 15 12:01:28 secondary postfix/smtpd[32590]: NOQUEUE: reject: RCPT from
mta2.mes2.com[210.9.188.146]: 451 4.3.5 Server configuration problem;
from=<[EMAIL PROTECTED]> to=<xxxx> proto=ESMTP
helo=<mta2.mes2.com>

Jun 15 12:02:47 secondary postfix/smtpd[2603]: warning: problem talking to
server 127.0.0.1:10031: Connection timed out Jun 15 12:02:48 secondary
postfix/smtpd[2601]: warning: timeout on 127.0.0.1:10031 while reading input
attribute name
-----

When trawling through debug=3 logs, this the interesting snippets:
-----
Jun 15 12:02:55 secondary postfix-policyd: connection from: 127.0.0.1 port:
49516 slots: 8 of 4096 used Jun 15 12:02:55 secondary postfix-policyd:
DEBUG: fd: 8 select(): fd 8 is ready for read Jun 15 12:02:55 secondary
postfix-policyd: DEBUG: fd: 8 connection got an EOF, data_read = 0 Jun 15
12:02:55 secondary postfix-policyd: DEBUG: fd: 8 shutting down fd 8 Jun 15
12:02:55 secondary postfix-policyd: DEBUG: fd: 9 select(): fd 9 is ready for
write Jun 15 12:02:55 secondary postfix-policyd: DEBUG: fd: 9 returning
after 14 bytes of data written Jun 15 12:02:55 secondary postfix-policyd:
DEBUG: fd: 10 select(): fd 10 is ready for write Jun 15 12:02:55 secondary
postfix-policyd: DEBUG: fd: 10 returning after 14 bytes of data written Jun
15 12:02:55 secondary postfix-policyd: DEBUG: fd: 11 select(): fd 11 is
ready for read Jun 15 12:02:55 secondary postfix-policyd: DEBUG: fd: 11
connection got an EOF, data_read = 0 Jun 15 12:02:55 secondary
postfix-policyd: DEBUG: fd: 11 shutting down fd 11 Jun 15 12:02:55 secondary
postfix-policyd: shutdown(11): Transport endpoint is not connected
-----


Relevant Postfix config used:
-----
# Set a sensible timeout for our policy daemons smtpd_policy_service_timeout
= 60s ...

# Decisions made after Rcpt To
smtpd_recipient_restrictions =
...
   # Query policyd (costs SELECTs,INSERTs,UPDATEs)
    check_policy_service inet:127.0.0.1:10031
-----


Relevant policyd config used:
-----
MYSQLHOST="xxxx"
MYSQLDBASE="policyd"
MYSQLUSER="policyd"
MYSQLPASS="xxxx"
MYSQLOPT=""
FAILSAFE=1
DATABASE_KEEPALIVE=1

DEBUG=0
DAEMON=1
BINDHOST=127.0.0.1
BINDPORT=10031
PIDFILE=/var/run/policyd.pid
-----


Monitoring the network inbetween the MySQL servers this machine, there is no
packet loss, and consistent latency (as above ~200ms).  Using mysql client
from the same server, logged policyd queries all return in .25 sec:

-----
mysql> SELECT COUNT(*) FROM whitelist_dnsname WHERE 
mysql> 'cmx03.servicemail24.de' LIKE _whitelist;
+----------+
| COUNT(*) |
+----------+
|        0 |
+----------+
1 row in set (0.23 sec)
-----

Any ideas on what to try?


Regards
__________________________________________
Dean Manners


-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
policyd-users mailing list
policyd-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/policyd-users

Reply via email to