Hello
We periodically recieve a bunch of errors regarding JMX and web check. I happens 1-2 times a day, every day in different periods of time. Java and http pollers are not such busy, about 10-20%, we do not have any peaks in graphs in this times.
Timeout is set as 15 minutes. Zabbix alerts "JMX agent unreachable" on multiple of hosts at the same time.
Web checks go through proxy server, but when errors happen we do not see these request in log in proxy server.
What can cause these problems?
We periodically recieve a bunch of errors regarding JMX and web check. I happens 1-2 times a day, every day in different periods of time. Java and http pollers are not such busy, about 10-20%, we do not have any peaks in graphs in this times.
Timeout is set as 15 minutes. Zabbix alerts "JMX agent unreachable" on multiple of hosts at the same time.
32741:20140912:053342.929 JMX agent item "jmx["com.mchange.v2.c3p0:type=PooledDataSource[z8kflu931wazdbw19rw4xg|2d977ced]",threadPoolNumTasksPending]" on host "aaa" failed: first network error, wait for 15 seconds
32739:20140912:053351.211 JMX agent item "jmx["com.mchange.v2.c3p0:type=PooledDataSource[2rxg6v94jh73691j0xnig|388b401d]",numIdleConnections]" on host "bbb" failed: first network error, wait for 15 seconds
32730:20140912:053352.999 JMX agent item "jmx["com.mchange.v2.c3p0:type=PooledDataSource[2rxg6w94jewwlb83ledp|78c1a023]",numConnections]" on host "ccc" failed: first network error, wait for 15 seconds
32740:20140912:053403.937 JMX agent item "jmx["java.lang:type=Runtime",Uptime]" on host "ddd" failed: first network error, wait for 15 seconds
32641:20140912:053404.471 resuming JMX agent checks on host "aaa": connection restored
32632:20140912:053406.430 resuming JMX agent checks on host "bbb": connection restored
32640:20140912:053407.603 resuming JMX agent checks on host "ccc": connection restored
32626:20140912:053418.414 resuming JMX agent checks on host "ddd": connection restored
32739:20140912:053351.211 JMX agent item "jmx["com.mchange.v2.c3p0:type=PooledDataSource[2rxg6v94jh73691j0xnig|388b401d]",numIdleConnections]" on host "bbb" failed: first network error, wait for 15 seconds
32730:20140912:053352.999 JMX agent item "jmx["com.mchange.v2.c3p0:type=PooledDataSource[2rxg6w94jewwlb83ledp|78c1a023]",numConnections]" on host "ccc" failed: first network error, wait for 15 seconds
32740:20140912:053403.937 JMX agent item "jmx["java.lang:type=Runtime",Uptime]" on host "ddd" failed: first network error, wait for 15 seconds
32641:20140912:053404.471 resuming JMX agent checks on host "aaa": connection restored
32632:20140912:053406.430 resuming JMX agent checks on host "bbb": connection restored
32640:20140912:053407.603 resuming JMX agent checks on host "ccc": connection restored
32626:20140912:053418.414 resuming JMX agent checks on host "ddd": connection restored
32682:20140913:051731.924 cannot process step "aaa" of web scenario "aaa" on host "aaa": Transferred a partial file
32682:20140913:051731.929 cannot process step "bbb" of web scenario "bbb" on host "bbb": Couldn't connect to server
32682:20140913:051731.933 cannot process step "ccc" of web scenario "ccc" on host "ccc": Couldn't connect to server
32682:20140913:051731.938 cannot process step "ddd" of web scenario "ddd" on host "ddd": Couldn't connect to server
32682:20140913:051731.929 cannot process step "bbb" of web scenario "bbb" on host "bbb": Couldn't connect to server
32682:20140913:051731.933 cannot process step "ccc" of web scenario "ccc" on host "ccc": Couldn't connect to server
32682:20140913:051731.938 cannot process step "ddd" of web scenario "ddd" on host "ddd": Couldn't connect to server
Comment