Health monitor for memory CRC error

Lenovo.IMM.MemoryCRCAbnormal (UnitMonitor)

Knowledge Base article:

Summary

This monitor watches for a IMM event that indicates that IMM has detected a CRC error.

Configuration

You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.

The IMM(Integrated Management Module) event is delivered to this monitor asynchronously. There is no monitoring interval to configure for this monitor.

The IMM event is delivered to this monitor from the authenticated IMM. Please be sure the IMM is authenticated first. To authenticate an IMM, select an IMM from Lenovo Integrated Management Module (IMM), then run task "Authenticate IMM".

The IMM event is delivered to this monitor from the Server via network port "9500". Please be sure that this port is not blocked by firewall. You can follow below steps to create a rule to pass firewall:

Resolutions

Review the details of the hardware event. Contact Lenovo support (see links below) if the reports or relevant articles do not provide enough information to resolve the hardware problem.

After the hardware problem is resolved, manually reset the health state of this monitor. However, any outstanding corresponding alerts will be automatically closed. See the "Reset Health" topic in the Operations Manager's Operations User's Guide for more information.

To verify that the hardware problem has been resolved, refer to the most recent health state of the corresponding "regular health checkup monitor." Be sure to refer to a health state that was reported later than the hardware event.

External

Links to Lenovo resources

Element properties:

TargetLenovo.HardwareMgmtPack.IMM2.ManagementModule
Parent MonitorLenovo.IMM.MemoryMonitor.Rollup
CategoryCustom
EnabledTrue
Alert GenerateFalse
Alert Auto ResolveFalse
Monitor TypeLenovo.IMM.UMT.Plat3State
RemotableTrue
AccessibilityPublic
RunAsDefault

Source Code:

<UnitMonitor ID="Lenovo.IMM.MemoryCRCAbnormal" Accessibility="Public" Target="Lenovo.HardwareMgmtPack.IMM2.ManagementModule" Enabled="true" TypeID="Lenovo.IMM.UMT.Plat3State" ParentMonitorID="Lenovo.IMM.MemoryMonitor.Rollup">
<Category>Custom</Category>
<!--AlertSettings AlertMessage="Lenovo.IMM.Alert.Rule.AlertMessageID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Low</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/EventDescription$</AlertParameter1>
<AlertParameter2>$Data/Context/EventSourceName$</AlertParameter2>
<AlertParameter3>$Data/Context/EventData/Data/MessageID[1]$</AlertParameter3>
<AlertParameter4>$Data/Context/EventData/Data/MessageName[1]$</AlertParameter4>
</AlertParameters>
</AlertSettings-->
<OperationalStates>
<OperationalState HealthState="Success" MonitorTypeStateID="HealthyEventRaised" ID="Success"/>
<OperationalState HealthState="Warning" MonitorTypeStateID="WarningEventRaised" ID="Warning"/>
<OperationalState HealthState="Error" MonitorTypeStateID="ErrorEventRaised" ID="Error"/>
</OperationalStates>
<Configuration>
<IMMIP>$Target/Property[Type="Lenovo.HardwareMgmtPack.IMM2.ManagementModule"]/IMMIP$</IMMIP>
<UnhealthyPlat>PLAT0158</UnhealthyPlat>
<HealthyPlat>Unrecoverable</HealthyPlat>
</Configuration>
</UnitMonitor>