Alert generating rule for when the Root Management Server is not reachable from this Management Servers Data Access service.
Either the Root Management Server is down or the Root Management Servers HealthService is not available.
This can be caused by:
The Root Management Server is offline.
The HealthService on the Root Management Server is not running.
An administrative restart of the HealthService on the Root Management Server.
If these alerts are generated during or shortly after installation of the Data Warehouse or reporting components likely cause is due to the opening the flow of alerts and state transitions to the DW. However, the corresponding data structures are not there yet since Data Warehouse did not deploy its components as of yet. Deployment happens within ~30 minutes of installation.
While deployment is not yet done, data will be retained by alert and state sync workflows to be later written to the DW. Thus, these workflows do not put data back to data sources and that in turn generates the alerts in question. Once deployment is complete, data is written and alerts clear up automatically.
If alerts do not auto-resolve within 120 minutes of Data Warehouse installation then another problem exisits.
Check the availability of the Root Management Server or start up the HeathService on the Root Management Server.
The below table explains what the possible reason codes are that you see in the alert description:
Reason Code | Description |
17 | The HealthService windows service is paused. |
25 | The HealthService Action Account is misconfigured or has invalid credentials. |
41 | The HealthService failed to parse the new configuration. |
42 | The HealthService failed to load the new configuration. |
43 | A System Rule failed to load. |
49 | Collection of Object State Change Events is stalled. |
50 | Collection of Monitor State Change Events is stalled. |
51 | Collection of Alerts is stalled. |
97 | The HealthService is unable to register with the Event Log Service. The HealthService cannot log additional Heartbeat and Connector events. |
98 | The HealthService is unable to parse configuration XML. |
Target | Microsoft.SystemCenter.ManagementServer | ||
Category | Alert | ||
Enabled | True | ||
Event_ID | 271828 | ||
Event Source | Health Service Watcher | ||
Alert Generate | True | ||
Alert Severity | Warning | ||
Alert Priority | Normal | ||
Remotable | True | ||
Alert Message |
| ||
Event Log | Operations Manager | ||
Comment | Rule used by Data Access service to generate alerts from each Management Server when the RHS is down |
ID | Module Type | TypeId | RunAs |
---|---|---|---|
EventDS | DataSource | Microsoft.Windows.EventProvider | Default |
GenerateAlert | WriteAction | System.Health.GenerateAlert | Default |
<Rule ID="Microsoft.SystemCenter.SDKService.PrincipleManagementServerDown.Alert" Comment="Rule used by Data Access service to generate alerts from each Management Server when the RHS is down" Enabled="true" Target="SCLibrary!Microsoft.SystemCenter.ManagementServer" ConfirmDelivery="true" Remotable="true" Priority="High" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="EventDS" TypeID="Windows!Microsoft.Windows.EventProvider">
<ComputerName>$Target/Host/Property[Type="Windows!Microsoft.Windows.Computer"]/NetworkName$</ComputerName>
<LogName>Operations Manager</LogName>
<Expression>
<And>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery>PublisherName</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value>Health Service Watcher</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery>EventDisplayNumber</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value>271828</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
</And>
</Expression>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="GenerateAlert" TypeID="Health!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>1</Severity>
<AlertName/>
<AlertDescription/>
<AlertMessageId>$MPElement[Name='Microsoft.SystemCenter.SDKService.PrincipleManagementServerDown.Alert.AlertMessage']$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue/>
</Suppression>
</WriteAction>
</WriteActions>
</Rule>