Monitor to check if work item statistics request process recovered after a failure in a given period of time
The System Center Management Configuration Service has failed to obtain work items Statistics for a period of time.
The purpose of this monitor is to determine if the Configuration Service is performing correctly. Using the Configuration Services event 29183, this monitor is able to detect this situation. Event 29183 is triggered when the Configuration Service has been unable to retrieve information on the state of its work items from the database over a period of time.
Green State: The Configuration Service is able to determine the state of each work item type.
Red State: Failed multiple consecutive times to determine the state of a work item. The monitor will retry, see causes below. See the “Work item statistics request process state” monitor for more information. An Alert is generated when the monitor enters this state.
This can happen when:
The database or database server is unavailable (networking issue, firewall, disk space, etc..)
The management servers Default Action Account no longer has the required access to the database
To further investigate the issue, consider the following:
Review the Operations Manager event log for errors indicating problems with the System Center Management Configuration Service. Filter the event log a source of “OpsMgr Management Configuration” to search for errors.
Confirm you are not seeing connection error to the Operations Manager database from the management server in the Operations Manager event log
Using the Operations Manager Console and SQL Server Management Studio, validate the Default Action Account has the correct access to the database where the Operations Manager database is installed. For more information about configuring the Default Action Account please see the Operations Manager Security Guide.
Target | Microsoft.SystemCenter.ManagementConfigurationServiceGroup | ||
Parent Monitor | Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemsHealthQueryProcessMonitor | ||
Category | StateCollection | ||
Enabled | True | ||
Alert Generate | True | ||
Alert Severity | Error | ||
Alert Priority | Normal | ||
Alert Auto Resolve | True | ||
Monitor Type | Microsoft.SystemCenter.ManagementConfigurationService.2SingleEventLogTimed2StateUnitMonitorType | ||
Remotable | True | ||
Accessibility | Public | ||
Alert Message |
| ||
RunAs | Default |
<UnitMonitor ID="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemStatsGet.SecondChanceMonitor" Accessibility="Public" Enabled="true" Target="Microsoft.SystemCenter.ManagementConfigurationServiceGroup" ParentMonitorID="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemsHealthQueryProcessMonitor" Remotable="true" Priority="Normal" TypeID="Microsoft.SystemCenter.ManagementConfigurationService.2SingleEventLogTimed2StateUnitMonitorType" ConfirmDelivery="true">
<Category>StateCollection</Category>
<AlertSettings AlertMessage="Microsoft.SystemCenter.ManagementConfigurationServiceGroup.WorkItemStatsGet.SecondChanceMonitor.AlertMessage">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/Item0Context/DataItem/EventDescription$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="Good" MonitorTypeStateID="SecondEventRaised" HealthState="Success"/>
<OperationalState ID="Bad" MonitorTypeStateID="FirstEventRaisedAndNoSecondEventRaisedInTimeInterval" HealthState="Error"/>
</OperationalStates>
<Configuration>
<FirstEventId>29183</FirstEventId>
<SecondEventId>29182</SecondEventId>
<TimeIntervalSeconds>300</TimeIntervalSeconds>
</Configuration>
</UnitMonitor>