Health monitor for motherboard

Lenovo.IMM.MBFailure (UnitMonitor)

Knowledge Base article:

Summary

This monitor watches for a IMM event that indicates that IMM has detected that a fatal motherboard failure in the system.

Configuration

You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.

The IMM(Integrated Management Module) event is delivered to this monitor asynchronously. There is no monitoring interval to configure for this monitor.

The IMM event is delivered to this monitor from the authenticated IMM. Please be sure the IMM is authenticated first. To authenticate an IMM, select an IMM from Lenovo Integrated Management Module (IMM), then run task "Authenticate IMM".

The IMM event is delivered to this monitor from the Server via network port "9500". Please be sure that this port is not blocked by firewall. You can follow below steps to create a rule to pass firewall:

Resolutions

This is a UEFI detected event. The UEFI diagnostic code for this event can be found in the logged IMM message text. Please refer to the UEFI diagnostic code in the "UEFI diagnostic code" section of the Info Center for the appropriate user response.

Review the details of the hardware event. Contact Lenovo support (see links below) if the reports or relevant articles do not provide enough information to resolve the hardware problem.

After the hardware problem is resolved, manually reset the health state of this monitor. However, any outstanding corresponding alerts will be automatically closed. See the "Reset Health" topic in the Operations Manager's Operations User's Guide for more information.

To verify that the hardware problem has been resolved, refer to the most recent health state of the corresponding "regular health checkup monitor." Be sure to refer to a health state that was reported later than the hardware event.

External

Links to Lenovo resources

Element properties:

TargetLenovo.HardwareMgmtPack.IMM2.ManagementModule
Parent MonitorLenovo.IMM.SystemMonitor.Rollup
CategoryCustom
EnabledTrue
Alert GenerateFalse
Alert Auto ResolveFalse
Monitor TypeLenovo.IMM.UMT.Plat3State
RemotableTrue
AccessibilityPublic
RunAsDefault

Source Code:

<UnitMonitor ID="Lenovo.IMM.MBFailure" Accessibility="Public" Target="Lenovo.HardwareMgmtPack.IMM2.ManagementModule" Enabled="true" TypeID="Lenovo.IMM.UMT.Plat3State" ParentMonitorID="Lenovo.IMM.SystemMonitor.Rollup">
<Category>Custom</Category>
<!--AlertSettings AlertMessage="Lenovo.IMM.Alert.Rule.AlertMessageID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Low</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/EventDescription$</AlertParameter1>
<AlertParameter2>$Data/Context/EventSourceName$</AlertParameter2>
<AlertParameter3>$Data/Context/EventData/Data/MessageID[1]$</AlertParameter3>
<AlertParameter4>$Data/Context/EventData/Data/MessageName[1]$</AlertParameter4>
</AlertParameters>
</AlertSettings-->
<OperationalStates>
<OperationalState HealthState="Success" MonitorTypeStateID="HealthyEventRaised" ID="Success"/>
<OperationalState HealthState="Warning" MonitorTypeStateID="WarningEventRaised" ID="Warning"/>
<OperationalState HealthState="Error" MonitorTypeStateID="ErrorEventRaised" ID="Error"/>
</OperationalStates>
<Configuration>
<IMMIP>$Target/Property[Type="Lenovo.HardwareMgmtPack.IMM2.ManagementModule"]/IMMIP$</IMMIP>
<UnhealthyPlat>PLAT0795</UnhealthyPlat>
<HealthyPlat>Unrecoverable</HealthyPlat>
</Configuration>
</UnitMonitor>