Skip to content

Operating System Metrics

Free The list of metrics is reduced
Starter The list of metrics is reduced
Professional You'll see all the metrics listed below

The Bleemeo agent automatically monitors your operating system metrics (check that your operating system is a supported OS).

The monitoring covers:

  • System resources utilization: CPU, memory, disk, …
  • S.M.A.R.T. metrics to monitor the health of your disks
  • Sensor temperatures
  • Notifications for:
    • overutilization of resources: CPU, memory, disk space and swap. Default thresholds are 80% for warning status and 90% for critical status
    • loss of connection to the Bleemeo Cloud platform
    • network errors
    • pending security updates

The agent gathers the following metrics:

Metric Description OS Alerting
agent_config_warning Bleemeo agent configuration files issues
Linux Windows
agent_gather_time Time spent to gather metrics by Bleemeo agent in seconds
Linux Windows
agent_status Status of Agent connection
Linux Windows
cpu_idle CPU idle in percent
Linux Windows
cpu_interrupt CPU used by low-level driver in percent
Linux
cpu_nice CPU used by niced applications in percent
Linux
cpu_other CPU not used by user or system in percent
Linux Windows
cpu_softirq CPU used by driver in percent
Linux Windows
cpu_steal CPU used by hypervisor in percent
Linux
cpu_system CPU used by system call in percent
Linux Windows
cpu_used CPU used in percent
Linux Windows
Above 80% warning, above 90% critical
cpu_used_status Status of CPU usage
Linux Windows
cpu_user CPU used by applications in percent
Linux Windows
cpu_wait CPU idle while waiting for IO operation in percent
Linux
cpu_guest_nice CPU used by niced guest VM in percent
Linux
cpu_guest CPU used by guest VM in percent
Linux
disk_free Filesystem space available in bytes
Linux Windows
disk_inodes_free Number of inodes available
Linux Windows
disk_inodes_total Number of inodes for this filesystem
Linux Windows
disk_inodes_used Number of used inodes
Linux Windows
disk_total Filesystem size in bytes
Linux Windows
disk_used Filesystem space used in bytes
Linux Windows
disk_used_perc Filesystem space used in percent
Linux Windows
Above 80% warning, above 90% critical
disk_used_perc_status Status of disk usage
Linux Windows
io_read_merged Number of read operations that were merged before hitting disk
Linux Windows
io_write_merged Number of write operations that were merged before hitting disk
Linux Windows
io_read_bytes Disk read throughput in bytes per second
Linux Windows
io_read_utilization Disk IO read utilization
Linux Windows
io_reads Number of reads completed per second
Linux Windows
io_utilization Disk IO utilization in percent
Linux Windows
io_write_bytes Disk write throughput in bytes per second
Linux Windows
io_write_utilization Disk IO write utilization
Linux Windows
io_writes Number of writes completed per second
Linux Windows
mem_available Memory available for application in bytes
Linux Windows
mem_available_perc Memory available for application in percent
Linux Windows
mem_buffered Memory used for raw block cache in bytes
Linux Windows
mem_cached Memory used for file cache in bytes
Linux Windows
mem_free Memory unused in bytes
Linux Windows
mem_total Memory size in bytes
Linux Windows
mem_used Memory used by applications in bytes
Linux Windows
mem_used_perc Memory used by applications in percent
Linux Windows
Above 80% warning, above 90% critical
mem_used_perc_status Status of memory usage
Linux Windows
net_bits_recv Network traffic received in bits per second
Linux Windows
net_bits_sent Network traffic sent in bits per second
Linux Windows
net_drop_in Number of received packets dropped per second
Linux Windows
net_drop_out Number of sent packets dropped per second
Linux Windows
net_err_in Number of errors per second while receiving packet
Linux Windows
Above 0 is critical
net_err_in_status Status of network errors for received packets
Linux Windows
net_err_out Number of errors per second while sending packet
Linux Windows
Above 0 is critical
net_err_out_status Status of network errors for sent packets
Linux Windows
net_packets_recv Number of packets received per second
Linux Windows
net_packets_sent Number of packets sent per second
Linux Windows
process_status_blocked Number of processes blocked in system call
Linux Windows
process_status_paging Number of processes blocked by paging operation
Linux Windows
process_status_running Number of processes currently running
Linux Windows
process_status_sleeping Number of idle processes
Linux Windows
process_status_stopped Number of stopped processes
Linux Windows
process_status_zombies Number of zombie processes
Linux Windows
process_total Number of processes
Linux Windows
process_total_threads Number of threads
Linux Windows
swap_free Swap unused in bytes
Linux Windows
swap_in Swap read throughput in bytes per second
Linux
swap_out Swap write throughput in bytes per second
Linux
swap_total Swap size in bytes
Linux
swap_used Swap used in bytes
Linux
swap_used_perc Swap used in percent
Linux Windows
Above 80% warning, above 90% critical
swap_used_perc_status Status of swap usage
Linux Windows
system_load1 System load over last minute
Linux Windows
system_load5 System load over last 5 minutes
Linux Windows
system_load15 System load over last 15 minutes
Linux Windows
system_pending_updates Number of pending system updates
Linux Windows
system_pending_security_updates Number of pending system security updates
Linux
Yes, after 24h
time_drift Difference between local time and reference time in seconds
Linux Windows
3 min warning, 5 min critical
uptime Time elapsed since last boot in seconds
Linux Windows
users_logged Number of users currently logged in the system
Linux Windows

On Linux and TrueNAS, the agent gathers the system power consumption using IPMI. It require your server to support IPMI and report the power consumption through IPMI. It should be the case for all server hardware. The agent use the freeipmi command and fallback on ipmitool command if not available. So one of those commands need to be accessible in the PATH. To configure where to find ipmi-dcmi (freeipmi) or ipmitool, see the configuration page.

On TrueNAS, ipmitool is installed by default, no additional setup is needed.

On Linux, if you don’t have ipmi-dcmi, you can install it with:

Terminal window
# On Ubuntu/Debian
apt install freeipmi
# On Fedora, CentOS, Almalinux, RockyLinux or similar
yum install freeipmi

If you installed Glouton as a package or with wget, you don’t need any additional setup, otherwise you need to allow glouton to run the command as root. In /etc/sudoers.d/glouton, add the following line (you may need to change the location of ipmi-dcmi and ipmi-sensors, use which ipmi-dcmi and which ipmi-sensors):

glouton ALL=(ALL) NOPASSWD: /usr/sbin/ipmi-dcmi, /usr/sbin/ipmi-sensors

To test whether your server support IPMI, run the following command:

sudo ipmi-dcmi --get-system-power-statistics
sudo ipmi-sensors -W discretereading --sdr-cache-recreate

The following metric will be gathered:

Metric Description
system_power_consumption System power consumption in Watt

On Linux, the agent gathers metrics using smartctl if it’s accessible in the PATH. To configure where to find smartctl and what devices to monitor, see the configuration page.

If you don’t have smartctl, you can install it with:

Terminal window
# On Ubuntu/Debian
apt install smartmontools
# On Fedora, CentOS, Almalinux, RockyLinux or similar
yum install smartmontools

If you installed Glouton as a package or with wget, you don’t need any additional setup, otherwise you need to allow glouton to run the command as root. In /etc/sudoers.d/glouton, add the following line (you may need to change the location of smartctl, use which smartctl):

glouton ALL=(ALL) NOPASSWD: /usr/sbin/smartctl

The following metrics will be gathered:

Metric Description
smart_device_health_status Disk health status
smart_device_read_error_rate Read error rate
smart_device_seek_error_rate Seek error rate
smart_device_temp_c Disk temperature in °C
smart_device_udma_crc_errors Count of errors in data transfer via the interface cable
smart_device_media_wearout_indicator Media wearout indicator
smart_device_percent_lifetime_remain Lifetime remaining in percent
smart_device_wear_leveling_count Wear leveling count

On Linux and Windows, the agent will gather the temperature of your hardware components in the metric sensor_temperature. By default only the CPU temperature is allowed (with the label sensor="coretemp_package_id_*"), you can add more metrics using metric filtering.

For example to allow metrics from all sensors you can add the following to your configuration:

metric:
allow_metrics:
- sensor_temperature

On Linux systems, the agent have the ability to collect information about MD arrays by using the /proc/mdstat file and the mdadm command.

If you don’t have mdadm, you can install it with:

Terminal window
# On Ubuntu/Debian
apt install mdadm
# On Fedora, CentOS, Almalinux, RockyLinux or similar
yum install mdadm

If you installed Glouton as a package or with wget, you don’t need any additional setup, otherwise you need to allow glouton to run the command as root. In /etc/sudoers.d/glouton, add the following line (you may need to change the location of mdadm, use which mdadm):

glouton ALL=(ALL) NOPASSWD: /usr/sbin/mdadm --detail *

The following metrics will be gathered:

Metric Description
mdstat_health_status Array health status
mdstat_disks_active_count Count of active disks
mdstat_disks_down_count Count of down disks
mdstat_disks_failed_count Count of failing disks
mdstat_disks_spare_count Count of spare disks
mdstat_disks_total_count Total count of disks

The mdstat_health_status metric description may give additional information about the time remaining for the array recovery / resynchronization.