Root Management Server Unavailable

Microsoft.SystemCenter.SDKService.PrincipleManagementServerDown.Alert (Rule)

Alert generating rule for when the Root Management Server is not reachable from this Management Servers Data Access service.

Knowledge Base article:

Summary

Either the Root Management Server is down or the Root Management Servers HealthService is not available.

Causes

This can be caused by:

If these alerts are generated during or shortly after installation of the Data Warehouse or reporting components likely cause is due to the opening the flow of alerts and state transitions to the DW. However, the corresponding data structures are not there yet since Data Warehouse did not deploy its components as of yet. Deployment happens within ~30 minutes of installation.

While deployment is not yet done, data will be retained by alert and state sync workflows to be later written to the DW. Thus, these workflows do not put data back to data sources and that in turn generates the alerts in question. Once deployment is complete, data is written and alerts clear up automatically.

If alerts do not auto-resolve within 120 minutes of Data Warehouse installation then another problem exisits.

Resolutions

Check the availability of the Root Management Server or start up the HeathService on the Root Management Server.

Additional

The below table explains what the possible reason codes are that you see in the alert description:

Reason Code

Description

17

The HealthService windows service is paused.

25

The HealthService Action Account is misconfigured or has invalid credentials.

41

The HealthService failed to parse the new configuration.

42

The HealthService failed to load the new configuration.

43

A System Rule failed to load.

49

Collection of Object State Change Events is stalled.

50

Collection of Monitor State Change Events is stalled.

51

Collection of Alerts is stalled.

97

The HealthService is unable to register with the Event Log Service. The HealthService cannot log additional Heartbeat and Connector events.

98

The HealthService is unable to parse configuration XML.

Element properties:

TargetMicrosoft.SystemCenter.ManagementServer
CategoryAlert
EnabledTrue
Event_ID271828
Event SourceHealth Service Watcher
Alert GenerateTrue
Alert SeverityWarning
Alert PriorityNormal
RemotableTrue
Alert Message
Health Service Not Running
{0}
Event LogOperations Manager
CommentRule used by Data Access service to generate alerts from each Management Server when the RHS is down

Member Modules:

ID Module Type TypeId RunAs 
EventDS DataSource Microsoft.Windows.EventProvider Default
GenerateAlert WriteAction System.Health.GenerateAlert Default

Source Code:

<Rule ID="Microsoft.SystemCenter.SDKService.PrincipleManagementServerDown.Alert" Comment="Rule used by Data Access service to generate alerts from each Management Server when the RHS is down" Enabled="true" Target="SCLibrary!Microsoft.SystemCenter.ManagementServer" ConfirmDelivery="true" Remotable="true" Priority="High" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="EventDS" TypeID="Windows!Microsoft.Windows.EventProvider">
<ComputerName>$Target/Host/Property[Type="Windows!Microsoft.Windows.Computer"]/NetworkName$</ComputerName>
<LogName>Operations Manager</LogName>
<Expression>
<And>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery>PublisherName</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value>Health Service Watcher</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery>EventDisplayNumber</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value>271828</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
</And>
</Expression>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="GenerateAlert" TypeID="Health!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>1</Severity>
<AlertName/>
<AlertDescription/>
<AlertMessageId>$MPElement[Name='Microsoft.SystemCenter.SDKService.PrincipleManagementServerDown.Alert.AlertMessage']$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue/>
</Suppression>
</WriteAction>
</WriteActions>
</Rule>