View Full Version : Graph gaps at night
alextsr
26th January 2009, 10:28
Hello all,
I deployed Centreon 2 about some a couple of weeks ago and all went fine. Centreon was monitoring and graphing everything I threw in him. Last days though I found out that late at night Centreon started to show some large gaps on almost all the graphs as shown bellow (except those that had about 30min interval)
Disk Space
http://i93.photobucket.com/albums/l55/alextsr/disk.png
Internet Traffic
http://i93.photobucket.com/albums/l55/alextsr/inet.png
Ping Response
http://i93.photobucket.com/albums/l55/alextsr/ping.png
Has this happened to anyone?
Any Ideas of what to check? The service logs show nothing weird.
naparuba
26th January 2009, 12:02
Can you launch a nagiostat?
alextsr
26th January 2009, 12:21
Nagios Stats 3.0.6
Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
Last Modified: 12-01-2008
License: GPL
CURRENT STATUS DATA
------------------------------------------------------
Status File: /usr/local/nagios/var/status.log
Status File Age: 0d 0h 0m 1s
Status File Version: 3.0.6
Program Running Time: 2d 22h 43m 39s
Nagios PID: 13756
Used/High/Total Command Buffers: 0 / 0 / 4096
Total Services: 378
Services Checked: 378
Services Scheduled: 378
Services Actively Checked: 378
Services Passively Checked: 0
Total Service State Change: 0.000 / 55.070 / 0.256 %
Active Service Latency: 0.002 / 1.225 / 0.542 sec
Active Service Execution Time: 0.052 / 10.648 / 1.513 sec
Active Service State Change: 0.000 / 55.070 / 0.256 %
Active Services Last 1/5/15/60 min: 98 / 322 / 378 / 378
Passive Service Latency: 0.000 / 0.000 / 0.000 sec
Passive Service State Change: 0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit: 340 / 9 / 17 / 12
Services Flapping: 0
Services In Downtime: 0
Total Hosts: 84
Hosts Checked: 82
Hosts Scheduled: 82
Hosts Actively Checked: 84
Host Passively Checked: 0
Total Host State Change: 0.000 / 0.000 / 0.000 %
Active Host Latency: 0.000 / 1.586 / 0.961 sec
Active Host Execution Time: 0.000 / 0.876 / 0.090 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min: 51 / 73 / 82 / 82
Passive Host Latency: 0.000 / 0.000 / 0.000 sec
Passive Host State Change: 0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
Hosts Up/Down/Unreach: 84 / 0 / 0
Hosts Flapping: 0
Hosts In Downtime: 0
Active Host Checks Last 1/5/15 min: 61 / 130 / 370
Scheduled: 47 / 93 / 256
On-demand: 14 / 37 / 114
Parallel: 47 / 93 / 256
Serial: 7 / 24 / 77
Cached: 7 / 12 / 36
Passive Host Checks Last 1/5/15 min: 0 / 0 / 0
Active Service Checks Last 1/5/15 min: 84 / 322 / 1010
Scheduled: 84 / 322 / 1010
On-demand: 0 / 0 / 0
Cached: 0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 0 / 0
External Commands Last 1/5/15 min: 0 / 0 / 0
Note that the time that the gaps show is almost dead time for the network.
Thanks
naparuba
26th January 2009, 12:25
Do you see services errors during this gap?
alextsr
27th January 2009, 10:01
Nope. no service errors.
It just happened right now so I got the chance to log to the server and saw that MySQL is using 100% of the cpu, so I guess that Centreon is not able to fill in the data at that time.
Now I have to find out what is MySQL doing at that time.
Does Centreon has any specific cpu intensive task executed from time to time?