This monitor watches for a BladeCenter event that indicates that the Blade Expansion Module of a blade server has exceeded a temperature threshold.
You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.
The BladeCenter event is delivered to this monitor asynchronously. There is no monitoring interval to configure for this monitor.
The BladeCenter event is delivered to this monitor from the AMM (Advanced Management Module) of the BladeCenter via the SNMP (Simple Network Management Protocol) protocol. It also goes through the BladeCenter runtime support of the Hardware Management Pack installed on the management server that was designated to manage the BladeCenter during the Network Device Discovery process.
For the proper BladeCenter AMM SNMP settings that are required for the Hardware Management Pack to discover BladeCenter modules and report events, consult the Hardware Management Pack's User's Guide.
When the service processor on the specified blade server has detected that the Blade Expansion Module has reached or exceeded the fault threshold, the BladeCenter's AMM generates a hardware event. The health state of this monitor is then set to the Critical or Warning state.
For a particular incident, review the history in the State Changes tab. Consult the relevant hardware knowledge articles listed below, keeping in mind the relevant event data.
The relevant Lenovo hardware knowledge articles are available on a system with the Lenovo Hardware Management Pack package installed.
BEM temperature fault
BEM over recommended temperature
BEM Failure
Expansion Module fault
BEM 2.5V over recommended voltage
BEM 2.5V under recommended voltage
BEM 3.3V over recommended voltage
BEM 3.3V under recommended voltage
BEM 5V over recommended voltage
BEM 5V under recommended voltage
BEM 12V over recommended voltage
BEM 12V under recommended voltage
BEM 18V over recommended voltage
BEM 18V under recommended voltage
BEM 1.5V over recommended voltage
BEM 1.5V under recommended voltage
BEM 1V over recommended voltage
BEM 1V under recommended voltage
BEM 12V standby over recommended voltage
BEM 12V standby under recommended voltage
BEM 1.8V over recommended voltage
BEM 1.8V under recommended voltage
BEM "Instance Number" fault
BEM "Instance Number" fault
BSE RAID fault
BSE RAID battery failure
The "Instance Number" can be found as the numeric part of the name/description for an alert or event, and it reflects the number of the bay/module. For instance, 'BEM "Instance Number" fault' could actually mean "BEM 02 fault".
Review the relevant Lenovo hardware knowledge articles listed above for information about how to resolve the hardware problem for a particular incident.
After the hardware problem is resolved, manually reset the health state of this monitor. However, any outstanding corresponding alerts will be automatically closed. See the "Reset Health" topic in the Operations Manager's Operations User's Guide for more information.
To verify that the hardware problem has been resolved, refer to the most recent health state of the corresponding "regular health checkup monitor." Be sure to refer to a health state that was reported later than the hardware event.
For the proper AMM SNMP settings needed for the Hardware Management Pack, see the "Configuring BladeCenter SNMP settings" topic in the Lenovo Hardware Management Pack for Microsoft System Center Operations Manager Installation and User's Guide.
Links to Lenovo resources
Target | IBM.BladeCenter.BladeModule | ||
Parent Monitor | System.Health.ConfigurationState | ||
Category | Custom | ||
Enabled | True | ||
Alert Generate | True | ||
Alert Severity | MatchMonitorHealth | ||
Alert Priority | Normal | ||
Alert Auto Resolve | True | ||
Monitor Type | IBM.BladeCenter.SNMPTrap.3StateManualResetMonitorTypeForModule | ||
Remotable | True | ||
Accessibility | Public | ||
Alert Message |
| ||
RunAs | Default |
<UnitMonitor ID="IBM.BladeCenter.BladeExpansionModule.Failed" Accessibility="Public" Enabled="true" Target="IBM.BladeCenter.BladeModule" ParentMonitorID="Health!System.Health.ConfigurationState" Remotable="true" Priority="Normal" TypeID="IBM.BladeCenter.SNMPTrap.3StateManualResetMonitorTypeForModule" ConfirmDelivery="false">
<Category>Custom</Category>
<AlertSettings AlertMessage="IBM.BladeCenter.BladeExpansionModule.Failed.AlertMessageID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/SnmpVarBinds/SnmpVarBind[OID='.1.3.6.1.4.1.2.6.158.3.1.1.8']/Value$</AlertParameter1>
<AlertParameter2>$Data/Context/SnmpVarBinds/SnmpVarBind[OID='.1.3.6.1.4.1.2.6.158.3.1.1.14']/Value$</AlertParameter2>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="ComponentSuccess" MonitorTypeStateID="SuccessEventRaised" HealthState="Success"/>
<OperationalState ID="ComponentWarning" MonitorTypeStateID="WarningEventRaised" HealthState="Warning"/>
<OperationalState ID="ComponentError" MonitorTypeStateID="ErrorEventRaised" HealthState="Error"/>
</OperationalStates>
<Configuration>
<EventIds>102876289|102880385|109051904|239175682|243467266|243468290|243475458|243476482|243483650|243484674|243491842|243492866|243516418|243517442|243532802|243533826|243598338|243599362|243663874|243664898|243770370|243771394|247463937|247463938|249561088</EventIds>
</Configuration>
</UnitMonitor>