Alert monitor for Lenovo BladeCenter Power health

IBM.SystemX.OOBPowerModule.Health (UnitMonitor)

Knowledge Base article:

Summary

This monitor watches for events related to the BladeCenter's power module(s) that may affect the operation of this blade.

Configuration

You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.

Hardware-platform-specific events, such as this one, are delivered asynchronously to this monitor. There is no monitoring interval to configure for this monitor.

The hardware event with this monitor is available only on an Lenovo System with the appropriate hardware sensors and with a management controller (also called a Service Processor), such as Integrated Management Module (IMM), Baseboard Management Controller (BMC), Remote Supervisor Adapter (RSA), or an equivalent management controller on an older Lenovo System.

This monitor depends on hardware instrumentation software, namely the IBM Director Platform Agent (also called Core Services) and the Intelligent Platform Management Interface (IPMI) driver stack. This software raises the hardware event to the WMI level, so that the monitor can be notified. On certain configurations, the RSA daemon can be used in place of, or in parallel with, the IPMI driver stack. See the "Additional Information" section below for more information about IBM Director Platform Agent, the IPMI driver stack and the RSA daemon.

Causes

When events occur on the BladeCenter's power module(s) that may affect the operation of this blade, the health state of this monitor is then set to the Critical or Warning state. For the exact details of this event, consult the BladeCenter view of this Management Pack and look at this blade's BladeCenter.

The relevant Lenovo hardware knowledge articles are available on a system with the Lenovo hardware Management Pack package installed.

Resolutions

Visit Lenovo BladeCenter Power Modules view and review and resolve any pending error.

To verify that the hardware problem has been resolved, review Lenovo BladeCenter Power Modules view and verify Lenovo BladeCenter Power Modules are healthy.

The monitor generates an alert when its health state changes to either Critical or Warning. After the Lenovo BladeCenter Module problem is resolved, manually reset the health state of this monitor. However, any outstanding corresponding alerts will be automatically closed. See the "Reset Health" topic in the Operations Manager's Operations User's Guide for more information.

Additional

External

Links to Lenovo resources

Element properties:

TargetIBM.SystemX.BladeOOBHeathyReflection
Parent MonitorSystem.Health.AvailabilityState
CategoryCustom
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeIBM.IBOOB.3StateManualResetMonitorType
RemotableTrue
AccessibilityInternal
Alert Message
OOB Power health alert
Please see the alert context for details.
RunAsDefault

Source Code:

<UnitMonitor ID="IBM.SystemX.OOBPowerModule.Health" Accessibility="Internal" Enabled="true" Target="IBM.SystemX.BladeOOBHeathyReflection" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="Normal" TypeID="IBM.IBOOB.3StateManualResetMonitorType" ConfirmDelivery="true">
<Category>Custom</Category>
<AlertSettings AlertMessage="IBM.SystemX.OOBPowerModule.Health_AlertMessageResourceID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
</AlertSettings>
<OperationalStates>
<OperationalState ID="OOBSuccessOpState" MonitorTypeStateID="SuccessEventRaised" HealthState="Success"/>
<OperationalState ID="OOBWarningOpState" MonitorTypeStateID="WarningEventRaised" HealthState="Warning"/>
<OperationalState ID="OOBErrorOpState" MonitorTypeStateID="ErrorEventRaised" HealthState="Error"/>
</OperationalStates>
<Configuration>
<EventID>375</EventID>
</Configuration>
</UnitMonitor>