Alert monitor for Lenovo BladeCenter SAS I/O module XML TCP communication failures

IBM.BladeCenter.IOXMLTCPCommunication.Failed (UnitMonitor)


Alert monitor for Lenovo BladeCenter SAS I/O module XML TCP communication failures

Knowledge Base article:

Summary

This monitor watches for a BladeCenter event that indicates that the advanced management module (AMM) is unable to communicate with a SAS I/O module.

Configuration

You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.

The BladeCenter event is delivered to this monitor asynchronously. There is no monitoring interval to configure for this monitor.

The BladeCenter event is delivered to this monitor from the AMM (Advanced Management Module) of the BladeCenter via the SNMP (Simple Network Management Protocol) protocol. It also goes through the BladeCenter runtime support of the Hardware Management Pack installed on the management server that was designated to manage the BladeCenter during the Network Device Discovery process.

For the proper BladeCenter AMM SNMP settings that are required for the Hardware Management Pack to discover BladeCenter modules and report events, consult the Hardware Management Pack's User's Guide.

Causes

When the AMM is unable to communicate with the specified SAS I/O module, the BladeCenter's AMM generates a hardware event. The health state of this monitor is then set to the Critical or Warning state. The SAS I/O module may be functioning normally, but the AMM will not be able to generate any additional events for the SAS I/O module.

For a particular incident, review the history in the State Changes tab. Consult the relevant hardware knowledge articles listed below, keeping in mind the relevant event data.

The relevant Lenovo hardware knowledge articles are available on a system with the Lenovo Hardware Management Pack package installed.

The "Instance Number" can be found as the numeric part of the name/description for an alert or event, and it reflects the number of the bay/module. For instance, 'A communication failure has occurred on the XML TCP command/response port for I/O Module "Instance Number"' could actually mean "A communication failure has occurred on the XML TCP command/response port for I/O Module 01".

Resolutions

Review the relevant Lenovo hardware knowledge articles listed above for information about how to resolve the hardware problem for a particular incident.

When the AMM is unable to communicate with the specified SAS I/O module, the monitor generates an alert when its health state changes to either Critical or Warning. After the AMM is able to communicate with a SAS I/O module, manually reset the health state of this monitor. See the "Reset Health" topic in the Operations Manager's Operations User's Guide for more information. Any outstanding corresponding alerts will be automatically closed when the health state of this monitor returns to the Healthy state.

Additional

For the proper AMM SNMP settings needed for the Hardware Management Pack, see the "Configuring BladeCenter SNMP settings" topic in the Lenovo Hardware Management Pack for Microsoft System Center Operations Manager Installation and User's Guide.

External

Links to Lenovo resources

Element properties:

TargetIBM.BladeCenter.IOModule
Parent MonitorSystem.Health.ConfigurationState
CategoryCustom
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeIBM.BladeCenter.SNMPTrap.3StateMonitorTypeForModule
RemotableTrue
AccessibilityPublic
Alert Message
Lenovo BladeCenter SAS I/O module XML TCP communication failures

{0} -- EventID = {1}
RunAsDefault

Source Code:

<UnitMonitor ID="IBM.BladeCenter.IOXMLTCPCommunication.Failed" Accessibility="Public" Enabled="true" Target="IBM.BladeCenter.IOModule" ParentMonitorID="Health!System.Health.ConfigurationState" Remotable="true" Priority="Normal" TypeID="IBM.BladeCenter.SNMPTrap.3StateMonitorTypeForModule" ConfirmDelivery="false">
<Category>Custom</Category>
<AlertSettings AlertMessage="IBM.BladeCenter.IOXMLTCPCommunication.Failed.AlertMessageID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/SnmpVarBinds/SnmpVarBind[OID='.1.3.6.1.4.1.2.6.158.3.1.1.8']/Value$</AlertParameter1>
<AlertParameter2>$Data/Context/SnmpVarBinds/SnmpVarBind[OID='.1.3.6.1.4.1.2.6.158.3.1.1.14']/Value$</AlertParameter2>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="ComponentSuccess" MonitorTypeStateID="SuccessEventRaised" HealthState="Success"/>
<OperationalState ID="ComponentWarning" MonitorTypeStateID="WarningEventRaised" HealthState="Warning"/>
<OperationalState ID="ComponentError" MonitorTypeStateID="ErrorEventRaised" HealthState="Error"/>
</OperationalStates>
<Configuration>
<EventIds>245430529|24543053[0-8]</EventIds>
</Configuration>
</UnitMonitor>