Work item statistics request process failure with no recovery state

Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemStatsGet.SecondChanceMonitor (UnitMonitor)

Monitor to check if work item statistics request process recovered after a failure in a given period of time

Knowledge Base article:

Summary

The System Center Management Configuration Service has failed to obtain work items Statistics for a period of time.

The purpose of this monitor is to determine if the Configuration Service is performing correctly. Using the Configuration Services event 29183, this monitor is able to detect this situation. Event 29183 is triggered when the Configuration Service has been unable to retrieve information on the state of its work items from the database over a period of time.

Configuration

Green State: The Configuration Service is able to determine the state of each work item type.

Red State: Failed multiple consecutive times to determine the state of a work item. The monitor will retry, see causes below. See the “Work item statistics request process state” monitor for more information. An Alert is generated when the monitor enters this state.

Causes

This can happen when:

Resolutions

To further investigate the issue, consider the following:

Element properties:

TargetMicrosoft.SystemCenter.ManagementConfigurationServiceGroup
Parent MonitorMicrosoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemsHealthQueryProcessMonitor
CategoryStateCollection
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeMicrosoft.SystemCenter.ManagementConfigurationService.2SingleEventLogTimed2StateUnitMonitorType
RemotableTrue
AccessibilityPublic
Alert Message
Management Configuration Service group failed to obtain work item statistics for a period of time
Management Configuration Service group failed to obtain work item statistics for a period of time. {0}
RunAsDefault

Source Code:

<UnitMonitor ID="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemStatsGet.SecondChanceMonitor" Accessibility="Public" Enabled="true" Target="Microsoft.SystemCenter.ManagementConfigurationServiceGroup" ParentMonitorID="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemsHealthQueryProcessMonitor" Remotable="true" Priority="Normal" TypeID="Microsoft.SystemCenter.ManagementConfigurationService.2SingleEventLogTimed2StateUnitMonitorType" ConfirmDelivery="true">
<Category>StateCollection</Category>
<AlertSettings AlertMessage="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemStatsGet.SecondChanceMonitor.AlertMessage">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/Item0Context/DataItem/EventDescription$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="Good" MonitorTypeStateID="SecondEventRaised" HealthState="Success"/>
<OperationalState ID="Bad" MonitorTypeStateID="FirstEventRaisedAndNoSecondEventRaisedInTimeInterval" HealthState="Error"/>
</OperationalStates>
<Configuration>
<FirstEventId>29183</FirstEventId>
<SecondEventId>29182</SecondEventId>
<TimeIntervalSeconds>300</TimeIntervalSeconds>
</Configuration>
</UnitMonitor>