Hi,
Situation:
Distributed setup, 1 master 2 slave nodes.
Master node has 1.4.5-pre running with postgresql backend.
Slaves nodes have 1.4.4 running with mysql backend.
Problem:
When one of the slave nodes crash / stop (havent had time to check for the erros yet, logfiles are already cycle'd away) the master node will keep running but not poll / receive data anymore. Trigger evaluation and action handling still works (looks like the symptoms related to the infinite loop in the trapper processes, but there is no load increase).
Restart of the zabbix_server daemon will restore functionality
Question:
Is this specific issue known / fixed in 1.4.5? I dont really want to do the trial and error thing while the fix is unknown.
I expect the slave and master node were exchanging data at the point of the crash (and thus leaving the master node in some wait state because the announced amount of data hasn't been received yet.
Kind regards,
Situation:
Distributed setup, 1 master 2 slave nodes.
Master node has 1.4.5-pre running with postgresql backend.
Slaves nodes have 1.4.4 running with mysql backend.
Problem:
When one of the slave nodes crash / stop (havent had time to check for the erros yet, logfiles are already cycle'd away) the master node will keep running but not poll / receive data anymore. Trigger evaluation and action handling still works (looks like the symptoms related to the infinite loop in the trapper processes, but there is no load increase).
Restart of the zabbix_server daemon will restore functionality
Question:
Is this specific issue known / fixed in 1.4.5? I dont really want to do the trial and error thing while the fix is unknown.
I expect the slave and master node were exchanging data at the point of the crash (and thus leaving the master node in some wait state because the announced amount of data hasn't been received yet.
Kind regards,
Comment