UNIX/Linux Heartbeat Monitor

Microsoft.Unix.WSMan.Heartbeat.Monitor (UnitMonitor)

UNIX/Linux Heartbeat Health

Knowledge Base article:

Summary

This monitor ensures that the CIM server daemon is running and reachable. If the state is unknown, either monitoring has not begun for this object or there are no monitors defined that monitor availability.

This heartbeat monitor allows configuration of a Missed Heartbeats threshold. See the configuration section below for information on tuning this threshold .

Configuration

Default Configuration

Parameter

Default Value

Missed Heartbeats

2

Missed Heartbeat Window (seconds)

630

To adjust the number of heartbeats that must fail before an alert is generated, override the "Missed Hearbeats" parameter with a value of 1 or greater. The Missed Heartbeat Window (seconds) value must also be adjusted with an override so that it is greater than Interval * Missed Heartbeats.

Causes

An unhealthy state for this monitor indicates that the CIM server daemon was unreachable with the WS-Management protocol.

This can happen when:

You may want to check the Operations Manager event log for errors, check the Operations Console for additional alerts, and confirm that proper Run As accounts (with proper credentials) and proper Run As profiles are set up.

View all current alerts from this object using this link:

Alerts

Resolutions

Recoveries may be enabled to restart the agent.

In the event that the recovery is disabled or was not successful, you can navigate to the "State Change Events" tab and run the recovery from the details pane.

Element properties:

TargetMicrosoft.Unix.Computer
Parent MonitorSystem.Health.AvailabilityState
CategoryAvailabilityHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeMicrosoft.Unix.WSMan.Heartbeat.MonitorType
RemotableTrue
AccessibilityPublic
Alert Message
Heartbeat failed
The System is not responding to heartbeats.
RunAsDefault

Source Code:

<UnitMonitor ID="Microsoft.Unix.WSMan.Heartbeat.Monitor" Enabled="true" Accessibility="Public" Target="Microsoft.Unix.Computer" TypeID="Microsoft.Unix.WSMan.Heartbeat.MonitorType" ParentMonitorID="SystemHealth!System.Health.AvailabilityState">
<Category>AvailabilityHealth</Category>
<AlertSettings AlertMessage="Microsoft.Unix.WSMan.Heartbeat.AlertMessage">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
</AlertSettings>
<OperationalStates>
<OperationalState HealthState="Success" MonitorTypeStateID="Available" ID="Available"/>
<OperationalState HealthState="Error" MonitorTypeStateID="NotAvailable" ID="NotAvailable"/>
</OperationalStates>
<Configuration>
<Interval>300</Interval>
<MissedHeartbeats>2</MissedHeartbeats>
<MissedWindowSeconds>630</MissedWindowSeconds>
</Configuration>
</UnitMonitor>