Health Service Heartbeat Failure

Microsoft.SystemCenter.HealthService.Heartbeat (AggregateMonitor)

This monitor checks the availability of each System Center Management Health Service in the management group.

Knowledge Base article:

Summary

The System Center Management Health Service has stopped on a computer.

This alert is generated by a Heath Service Watcher. This object is run on the All Management Servers Resource Pool and monitors the health of all System Center Management Health Services in a Management Group. When a System Center Management Health Service fails to heartbeat, as set of Diagnostics are run and Recoveries are then executed to attempt to fix this problem with the remote agent.

Causes

This can happen when:

Resolutions

Recoveries may be enabled to restart the agent.

In the event that the recovery is disabled or was not successful, you can navigate to the "State Change Events" tab and run the recovery from the details pane. To open the State Change Events tab, right-click the alert, point to Open, and then click Health Explorer. In Health Explorer, select the alert that is in a critical state, and then in the right-hand pane, click the State Change Events tab. In the details pane, click the recovery task.

Note: A diagnostic task for this monitor runs ping. The default firewall settings for computers running Windows Vista or Windows Server 2008 prevent the computers from providing a ping response. To configure these computers to allow incoming ICMP traffic, see "Create an Inbound ICMP Rule on Windows Vista or Windows Server 2008" ( http://go.microsoft.com/fwlink/?LinkId=161045).

Element properties:

TargetMicrosoft.SystemCenter.HealthServiceWatcher
Parent MonitorSystem.Health.AvailabilityState
AlgorithmWorstOf
CategoryStateCollection
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityHigh
Alert Auto ResolveTrue
RemotableTrue
AccessibilityPublic
Alert Message
Health Service Heartbeat Failure
The System Center Management Health Service on computer {0} failed to heartbeat.

Source Code:

<AggregateMonitor ID="Microsoft.SystemCenter.HealthService.Heartbeat" Accessibility="Public" Enabled="true" Target="SCLibrary!Microsoft.SystemCenter.HealthServiceWatcher" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="High">
<Category>StateCollection</Category>
<AlertSettings AlertMessage="Microsoft.SystemCenter.HealthService.Heartbeat.AlertMessage">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>High</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Target/Property[Type="SCLibrary!Microsoft.SystemCenter.HealthServiceWatcher"]/HealthServiceName$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<Algorithm>WorstOf</Algorithm>
</AggregateMonitor>