so I am regularly getting emails with a subject of "Watchdog ALERT: postfix-mailcow
" and the following content. I am trying to figure out why this regularly happens. Since there is no timestamp in the line where it says “socket timeout” I can only look at the timestamp from when the email was sent.

I’ve installed netdata to collect statistics and see if I can spot an anomaly, will update the thread after 24h and add more info if I can spot something.

Any pointers what this warning means? How exactly is the watchdog container trying to connect to the postfix container?

SMTP OK - 0.030 sec. response time|time=0.029644s;;;0.000000
SMTP OK - 0.014 sec. response time|time=0.013510s;;;0.000000
SMTP OK - 0.024 sec. response time|time=0.023623s;;;0.000000
SMTP OK - 0.019 sec. response time|time=0.018802s;;;0.000000
SMTP OK - 0.012 sec. response time|time=0.012303s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.015647s;;;0.000000
SMTP OK - 0.045 sec. response time|time=0.044639s;;;0.000000
SMTP OK - 0.021 sec. response time|time=0.020579s;;;0.000000
SMTP OK - 0.018 sec. response time|time=0.017864s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.016137s;;;0.000000
SMTP OK - 0.021 sec. response time|time=0.020663s;;;0.000000
SMTP OK - 0.015 sec. response time|time=0.015283s;;;0.000000
SMTP OK - 0.023 sec. response time|time=0.023478s;;;0.000000
SMTP OK - 0.017 sec. response time|time=0.017011s;;;0.000000
SMTP OK - 0.018 sec. response time|time=0.018398s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.015549s;;;0.000000
SMTP OK - 0.027 sec. response time|time=0.026541s;;;0.000000
SMTP OK - 0.015 sec. response time|time=0.014855s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.016325s;;;0.000000
SMTP OK - 0.018 sec. response time|time=0.017828s;;;0.000000
SMTP OK - 0.046 sec. response time|time=0.045904s;;;0.000000
SMTP OK - 0.022 sec. response time|time=0.022293s;;;0.000000
SMTP OK - 0.019 sec. response time|time=0.019179s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.016266s;;;0.000000
SMTP OK - 0.014 sec. response time|time=0.013560s;;;0.000000
SMTP OK - 0.013 sec. response time|time=0.013314s;;;0.000000
SMTP OK - 0.019 sec. response time|time=0.018674s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.015811s;;;0.000000
SMTP OK - 0.018 sec. response time|time=0.017677s;;;0.000000
SMTP OK - 0.013 sec. response time|time=0.012654s;;;0.000000
SMTP OK - 0.046 sec. response time|time=0.046086s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.015737s;;;0.000000
SMTP OK - 0.014 sec. response time|time=0.013949s;;;0.000000
SMTP OK - 0.013 sec. response time|time=0.013136s;;;0.000000
SMTP OK - 0.020 sec. response time|time=0.019707s;;;0.000000
SMTP OK - 0.015 sec. response time|time=0.014601s;;;0.000000
SMTP OK - 0.102 sec. response time|time=0.101611s;;;0.000000
SMTP OK - 0.036 sec. response time|time=0.036221s;;;0.000000
SMTP OK - 0.070 sec. response time|time=0.070323s;;;0.000000
SMTP OK - 0.034 sec. response time|time=0.034183s;;;0.000000
SMTP OK - 0.019 sec. response time|time=0.019157s;;;0.000000
SMTP OK - 0.016 sec. response time|time=0.015868s;;;0.000000
SMTP OK - 0.030 sec. response time|time=0.029833s;;;0.000000
SMTP OK - 0.025 sec. response time|time=0.025087s;;;0.000000
CRITICAL - Socket timeout
SMTP OK - 0.029 sec. response time|time=0.029462s;;;0.000000
CRITICAL - Socket timeout
SMTP OK - 0.033 sec. response time|time=0.033177s;;;0.000000
CRITICAL - Socket timeout
SMTP OK - 0.045 sec. response time|time=0.045429s;;;0.000000
CRITICAL - Socket timeout
SMTP OK - 0.029 sec. response time|time=0.028832s;;;0.000000

Since the alert contains no time stamps this seems pretty useless or am I missing something?
Would it not be better to include timestamps in these alerts?

Have something to say?

Join the community by quickly registering to participate in this discussion. We'd like to see you joining our great moo-community!

a year later

Hi all, I’m seeing the same errors - are there any further explanations what this means or how to fix it?

No one is typing