I’ve just configured another mailcow system as a backup MX, my primary was down for about 8 hours for maintenance, and I now have 100s queued emails to be transferred to the primary. For some reason, the process is taking forever.
The primary is showing 3 concurrent connections in the log, with 7 commands per connection before disconnect, which is I think probably a single email, and this happens every 15 minutes.
The secondary is showing the probable problem (hostnames, IPs and emails changed):
postfix/error[2165]: 5AA0419C0: to=<x@x.com>, relay=none, delay=21276, delays=20973/303/0/0, dsn=4.4.2, status=deferred (delivery temporarily suspended: conversation with mail.x.com[4.4.4.4] timed out while receiving the initial server greeting)
What could be the cause of the 21 second delay here? I’ve whitelisted the backup MX, as per the primary logs:
postfix/postscreen[2501]: WHITELISTED [5.5.5.5]:33560
When I telnet in, I get the 220 response within a second or so.
Update: I’ve tried adding the backup IP to the postscreen whitelist following https://docs.mailcow.email/manual-guides/Postfix/u_e-postfix-postscreen_whitelist/?h=postscreen, but this has made no difference.