You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
01:00.0 RAID bus controller: Broadcom / LSI MegaRAID SAS-3 3108 [Invader] (rev 02)
storcli 007.3006.0000.0000-1 on ArchLinux (installed from AUR).
# /usr/lib/monitoring-plugins/check_lsi_raid -V
check_lsi_raid: Nagios/Icinga plugin to check LSI Raid Controller status
Version: 2.5
StorCli SAS Customization Utility Ver 007.3006.0000.0000 Apr 17, 2024
Icinga service definition:
apply Service for (name => config in host.vars.lsiraid) {
import "generic-service"
check_command = "lsi-raid"
vars += config
}
object Host "myhost" {
/* Import the default host template defined in `templates.conf`. */
import "linux-server"
address = "1.1.1.1"
vars.lsiraid["LSI 3108"] = {
lsi_ignored_other_errors = 9999999
lsi_ignored_media_errors = 9999999
}
vars.lsiraid["RAID slot 1"] = {
lsi_enclosure_id=1
lsi_pd_id=0
lsi_ignored_other_errors=8
}
[... and next 15 slots as above]
}
When the default 1-minute check interval was used, host I/O performance suffered dramatically. It looks like the controller stops some I/O operations when a storcli command is executed. I discovered this by looking for processes in the IO_WAIT state. The number of waiting processes increased when storcli was executed and I experienced slowdowns of various VMs and services hosted on my server.
As a workaround I've added check_period=15m and 15m day TimePeriod window to force Icinga2 to check LSI only once a day.
I am aware that there is probably nothing you can do to fix this problem. I spent a lot of time trying to figure out what was causing I/O problems on my host, so it may be worth adding a warning to this plugin documentation for others.
The text was updated successfully, but these errors were encountered:
01:00.0 RAID bus controller: Broadcom / LSI MegaRAID SAS-3 3108 [Invader] (rev 02)
storcli 007.3006.0000.0000-1 on ArchLinux (installed from AUR).
Icinga service definition:
When the default 1-minute check interval was used, host I/O performance suffered dramatically. It looks like the controller stops some I/O operations when a storcli command is executed. I discovered this by looking for processes in the IO_WAIT state. The number of waiting processes increased when storcli was executed and I experienced slowdowns of various VMs and services hosted on my server.
As a workaround I've added check_period=15m and 15m day TimePeriod window to force Icinga2 to check LSI only once a day.
I am aware that there is probably nothing you can do to fix this problem. I spent a lot of time trying to figure out what was causing I/O problems on my host, so it may be worth adding a warning to this plugin documentation for others.
The text was updated successfully, but these errors were encountered: