BEGIN:VCALENDAR
PRODID:-//vBulletin 6//EN
VERSION:2.0
CALSCALE:GREGORIAN
BEGIN:VEVENT
UID:f8fa9d52-b951-4960-b6fd-f195c8a6eb9a
DTSTAMP:20260430T115332Z
SUMMARY:Disconnected graphs snmp as well as zabbix-agent \,  no data receiv
 ed alerts
DESCRIPTION:Hello All\,\n\nThere seems to be this weird scenario i am facin
 g with respect to the agents off late  where graphs show disconnected dots
  \, no straight lines\, keep getting no data recieved alerts but telnet to
  agent port looks fine. I really need help here to figure out what's going
  wrong. Zabbix queue is also piled up bigtime.\nI tried changing to active
  agent config\, that didn't help either\, where increasing pollers also di
 dn't help.\nI have mix and match items with intervals ranging from 60s min
  to 1day max\, \n\n \n\nNumber of hosts (enabled/disabled/templates)\n 569
 3\n 3018 / 2289 / 386\n \n\nNumber of items (enabled/disabled/not supporte
 d)\n 24641\n 22095 / 1513 / 1033\n \n\nNumber of triggers (enabled/disable
 d [problem/ok])\n 7277\n 6618 / 659 [373 / 6245]\n \n\nNumber of users (on
 line)\n 73\n 23\n \n\nRequired server performance\, new values per second\
 n 84.56\n \n\n \nZabbix Queue looks like this\n\n \n\ntems 5 seconds 10 se
 conds 30 seconds 1 minute 5 minutes More than 10 minutes \n\nZabbix agent\
 n 16\n 27\n 8\n 41\n 22\n 90\n \n\nZabbix agent (active)\n 0\n 0\n 3\n 17\
 n 11\n 174\n \n\nSimple check\n 82\n 127\n 14\n 24\n 0\n 0\n \n\nSNMPv1 ag
 ent\n 0\n 0\n 0\n 0\n 0\n 0\n \n\nSNMPv2 agent\n 8\n 6\n 1\n 0\n 4\n 10\n 
 \n\n \nThere's only minimal active items added\, but max on zabbix_aget an
 d snmp checks\, below is proxy config \, this is pretty similar to other 6
  proxies. Data gathering process is only 60% rest all is cool (from the da
 ta gathering process graph). values processed is about 125+\, and queue fo
 r this proxy is avg 900.\n\n\nServer=XX.XX.XX.XX\nHostname=Zabbix-Proxy\nL
 ogFile=/var/log/zabbix/zabbix_proxy.log\nLogFileSize=300\nDebugLevel=4\nPi
 dFile=/var/run/zabbix/zabbix_proxy.pid\nDBName=zabbix\nDBUser=zabbix\nDBPa
 ssword=password\nProxyLocalBuffer=3\nProxyOfflineBuffer=4\nConfigFrequency
 =120\nDataSenderFrequency=30\nStartPollers=275\nStartPollersUnreachable=12
 0\nStartTrappers=60\nStartPingers=90\nStartSNMPTrapper=1\nHousekeepingFreq
 uency=3\nCacheSize=1G\nStartDBSyncers=50\nHistoryCacheSize=1G\nHistoryInde
 xCacheSize=1G\nTimeout=30\nUnreachablePeriod=90\nFpingLocation=/usr/local/
 sbin/fping\nLogSlowQueries=300\n\n\nServers are not utilised much. proxy s
 ervers have enough ram \, only 60% used\,  cpu load looks fine 20-30% (all
  the proxies). Below is the server config\n\n\nLogFile=/var/log/zabbix/zab
 bix_server.log\nLogFileSize=500\nDebugLevel=4\nPidFile=/var/run/zabbix/zab
 bix_server.pid\nDBName=zabbix\nDBUser=zabbix\nDBPassword=password\nStartPo
 llers=300\nStartIPMIPollers=1\nStartPollersUnreachable=150\nStartTrappers=
 130\nStartPingers=120\nStartDiscoverers=10\nStartSNMPTrapper=1\nListenIP=0
 .0.0.0\nHousekeepingFrequency=2\nMaxHousekeeperDelete=300\nSenderFrequency
 =360\nCacheSize=1G\nCacheUpdateFrequency=300\nStartDBSyncers=15\nHistoryCa
 cheSize=256M\nHistoryIndexCacheSize=256M\nTrendCacheSize=1G\nValueCacheSiz
 e=128M\nTimeout=30\nTrapperTimeout=180\nUnreachablePeriod=600\nUnavailable
 Delay=180\nAlertScriptsPath=/etc/zabbix/alert.d/\nFpingLocation=/usr/local
 /sbin/fping\nLogSlowQueries=300\nStartProxyPollers=2\nProxyDataFrequency=1
 80\n\nzabbx server details\n\n\n[root@zbx_server ~]# free -g\n            
  total       used       free     shared    buffers     cached\n\nMem:     
       125         86         39          0          0         60\n\n-/+ bu
 ffers/cache:         25        100\n\nSwap:            9          0       
    9\n[root@zbx_server ~]#\nCPU - 40 Core\, MySql ~ 170G db\n\nmysql confi
 g\n\n\n[mysqld]\ndatadir=/var/lib/mysql\nsocket=/var/lib/mysql/mysql.sock\
 nuser=mysql\nsymbolic-links=0\nlong_query_time = 10\nlog-queries-not-using
 -indexes=YES\ninnodb_lock_wait_timeout=500\ninnodb_locks_unsafe_for_binlog
 =1\ninnodb_file_per_table\ninnodb_flush_method=O_DIRECT\ninnodb_log_file_s
 ize=1G\ninnodb_buffer_pool_size=48G\ninnodb_file_per_table\nmax_allowed_pa
 cket = 128M\ninnodb_additional_mem_pool_size = 30M\ninnodb_thread_concurre
 ncy = 8\nkey_buffer_size = 60M\nmax_connections=700\ntable_cache=4096\ntmp
 _table_size = 32M\nthread_cache_size = 64\nquery_cache_limit=64M\nthread_c
 ache_size=512\nread_buffer_size=2M\nlog-bin=mysql-bin\nbinlog-do-db=zabbix
 \nserver-id=9\nexpire_logs_days=3\nmax_binlog_size=100M\n[mysqld_safe]\nlo
 g-error=/var/log/mysqld.log\npid-file=/var/run/mysqld/mysqld.pid\n[client]
 \nuser=root\npassword=password\n\n\nI am like more confused now. Trying to
  figure out what's the optimal configurations to get these working.  I am 
 not in a position to do multiple start/stop to zabbix severs as well as pr
 oxies (as they are production) but two-four can be tried. Earlier when the
 se happened\, i tried increasing the pollers and it went away\, now increa
 sing pollers isn't helping. Logs doesn't show much - except network_error 
 \, trying after 15 seconds and some ZBX_TCP_READ() timed out - where as co
 ntinues \n\n\nnc -z host_ip 10050   OR nc -z host_ip 161 \n \nboth works f
 lawlessley. I mean i don't see disconnections using continues NC commands 
 or ping's so definitely network isn't the issue.\n\nZabbix is approximatel
 y 4-5 years old where i first installed 2.0.6 and now in 3.0.13 (all are c
 entos 6.3/6.5 server plus 6 proxies with mysql as backed)\nNumber of hosts
  monitored via server and proxies.\nServer : ~ 1500 \nProxies together : ~
  1500\n\nCurrently i am stuck with below points\n1). Disconnected graphs o
 n both SNMP monitored items as well as zabbix-agents \, number of such hav
 ing issue  ~ 20+ on agent about 30 on snmp.\n2). SNMP traffic data is disc
 onnected\, and showing less data than router shows\, i.e if router says in
 terface clocking 300Mb\, i am seeing only ~100Mb on zabbix - this is anoth
 er issue( interval for traffic is 5mins)\n3). No-data received alerts on a
 gents. I am able to get 1 when i manually do agent_ping but zabbix shows a
 lerts\, and on the latest data as well i am seeing data is missing ( inter
 val for agent.ping is 1m )\n4). External scripts failing to run. I had som
 e expect/shell/perl based scripts to login to routers to run and get some 
 data show on zabbix - this use to work flawlessly on pervious version ( 2.
 2.12)\, but in this they are failing - 3.0.13 - either they show timeout r
 unning \, or  they run hanging on the console.\n\nAny pointers are greatly
  helpful.\n\nThanks
URL:https://www.zabbix.com/forum/node/360355
DTSTART;VALUE=DATE:20180612
END:VEVENT
END:VCALENDAR
