Regular health checkup monitor for Lenovo hardware management software failures

IBM.WinSw.HwMgmt.Failed (UnitMonitor)


Regular health checkup monitor for Lenovo hardware management software failures

Knowledge Base article:

Summary

This monitor checks for system management software failure.

Configuration

You can disable this monitor through the Operations Manager's Operations Console. See the "Disable monitors" topic in the Operations Manager's Operations User's Guide for more information.

You can also change the interval between the health checkups by overriding the value of the "IntervalSeconds" parameter of the monitor. See the "Override" topic in the Operations Manager's Operations User's Guide.

Causes

When the system management software fails on a target system, an alert is generated to Operations Manager. The health state of this monitor is then set to the Critical or Warning state.

For a particular incident, review the history in the State Changes tab. Consult the relevant knowledge articles listed below, keeping in mind the relevant event data.

The relevant Lenovo knowledge articles are available on a system with the Lenovo Hardware Management Pack package installed.

Resolutions

Review the health checkup report's details about the system management software. Contact Lenovo support (see links below) if the reports or relevant articles do not provide enough information to help you resolve the problem.

After the problem is resolved, the overall health state of this monitor is automatically restored to the Healthy state. However, you must manually close any corresponding alerts that might have occurred.

Additional

External

Links to Lenovo resources

Element properties:

TargetIBM.WinSw.HwMgmt
Parent MonitorSystem.Health.AvailabilityState
CategoryCustom
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeIBM.WinSw.HwMgmt.Failed.MonitorType
RemotableTrue
AccessibilityPublic
Alert Message
Management Software Failed

Management Software Failed
RunAsDefault

Source Code:

<UnitMonitor ID="IBM.WinSw.HwMgmt.Failed" Accessibility="Public" Enabled="true" Target="IBM.WinSw.HwMgmt" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="Normal" TypeID="IBM.WinSw.HwMgmt.Failed.MonitorType" ConfirmDelivery="false">
<Category>Custom</Category>
<AlertSettings AlertMessage="IBM.WinSw.HwMgmt.Failed.AlertMessageResourceID">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
</AlertSettings>
<OperationalStates>
<OperationalState ID="Healthy" MonitorTypeStateID="Healthy" HealthState="Success"/>
<OperationalState ID="Warning" MonitorTypeStateID="Warning" HealthState="Warning"/>
<OperationalState ID="Critical" MonitorTypeStateID="Error" HealthState="Error"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>7200</IntervalSeconds>
<TimeoutSeconds>300</TimeoutSeconds>
</Configuration>
</UnitMonitor>