Operating System Metrics
Bleemeo agent automatically monitor your operating system metrics (check that your operating system is a supported OS)
The monitoring cover:
- Gather system resources utilization: CPU, memory, disk, ...
- Alarm for loss of connection to the Bleemeo Cloud platform
- Alarm for overutilization of resources: CPU, memory, disk space and swap. Default thresholds are 80% for warning status and 90% for critical status
- Alarm for network errors
- Alarm for pending security updates
Agent gathers the following metrics:
Metric | Description | OS Supported | Alerting |
---|---|---|---|
agent_config_warning | Bleemeo agent configuration files issues | ||
agent_gather_time | Time spent to gather metrics by Bleemeo agent in seconds | ||
agent_status | Status of Agent connection | Yes | |
cpu_idle | CPU idle in percent | ||
cpu_interrupt | CPU used by low-level driver in percent | ||
cpu_nice | CPU used by niced applications in percent | ||
cpu_other | CPU not used by user or system in percent | ||
cpu_softirq | CPU used by driver in percent | ||
cpu_steal | CPU used by hypervisor in percent | ||
cpu_system | CPU used by system call in percent | ||
cpu_used | CPU used in percent | Default thresholds are: above 80% for warning status and above 90% for critical status. | |
cpu_used_status | Status of CPU usage | ||
cpu_user | CPU used by applications in percent | ||
cpu_wait | CPU idle while waiting for IO operation in percent | ||
cpu_guest_nice | CPU used by niced guest VM in percent | ||
cpu_guest | CPU used by guest VM in percent | ||
disk_free | Filesystem space available in bytes | ||
disk_inodes_free | Number of inodes available | ||
disk_inodes_total | Number of inodes for this filesystem | ||
disk_inodes_used | Number of used inodes | ||
disk_total | Filesystem size in bytes | ||
disk_used | Filesystem space used in bytes | ||
disk_used_perc | Filesystem space used in percent | Default thresholds are: above 80% for warning status and above 90% for critical status. | |
disk_used_perc_status | Status of disk usage | ||
io_read_merged | Number of read operations that were merged before hitting disk | ||
io_write_merged | Number of write operations that were merged before hitting disk | ||
io_read_bytes | Disk read throughput in bytes per second | ||
io_read_time | Time spent reading in milliseconds per second | ||
io_reads | Number of reads completed per second | ||
io_time | Time spent doing I/O in milliseconds per second | ||
io_utilization | Disk IO utilization in percent | ||
io_write_bytes | Disk write throughput in bytes per second | ||
io_write_time | Time spent writing in milliseconds per second | ||
io_writes | Number of writes completed per second | ||
mem_available | Memory available for application in bytes | ||
mem_available_perc | Memory available for application in percent | ||
mem_buffered | Memory used for raw block cache in bytes | ||
mem_cached | Memory used for file cache in bytes | ||
mem_free | Memory unused in bytes | ||
mem_total | Memory size in bytes | ||
mem_used | Memory used by applications in bytes | ||
mem_used_perc | Memory used by applications in percent | Default thresholds are: above 80% for warning status and above 90% for critical status. | |
mem_used_perc_status | Status of memory usage | ||
net_bits_recv | Network traffic received in bits per second | ||
net_bits_sent | Network traffic sent in bits per second | ||
net_drop_in | Number of received packets dropped per second | ||
net_drop_out | Number of sent packets dropped per second | ||
net_err_in | Number of errors per second while receiving packet | Default thresholds are: above 0 for critical status. | |
net_err_in_status | Status of network errors for received packets | ||
net_err_out | Number of errors per second while sending packet | Default thresholds are: above 0 for critical status. | |
net_err_out_status | Status of network errors for sent packets | ||
net_packets_recv | Number of packets received per second | ||
net_packets_sent | Number of packets sent per second | ||
process_status_blocked | Number of processes blocked in system call | ||
process_status_paging | Number of processes blocked by paging operation | ||
process_status_running | Number of processes currently running | ||
process_status_sleeping | Number of idle processes | ||
process_status_stopped | Number of stopped processes | ||
process_status_zombies | Number of zombie processes | ||
process_total | Number of processes | ||
process_total_threads | Number of threads | ||
swap_free | Swap unused in bytes | ||
swap_in | Swap read throughput in bytes per second | ||
swap_out | Swap write throughput in bytes per second | ||
swap_total | Swap size in bytes | ||
swap_used | Swap used in bytes | ||
swap_used_perc | Swap used in percent | Default thresholds are: above 80% for warning status and above 90% for critical status. | |
swap_used_perc_status | Status of swap usage | ||
system_load1 | System load over last minute | ||
system_load5 | System load over last 5 minutes | ||
system_load15 | System load over last 15 minutes | ||
system_pending_updates | Number of pending system updates | ||
system_pending_security_updates | Number of pending system security updates | Yes, after 24h | |
time_drift | Difference between local time and reference time in seconds | Default thresholds are: 3 minutes for warning status and 5 minutes for critical status | |
uptime | Time elapsed since last boot in seconds | ||
users_logged | Number of users currently logged in the system |