System Error Group Monitor

Microsoft.SystemCenter.CM.AEM.Views.Internal.SystemBucketThreshold (AggregateMonitor)

This monitor indicates detection of a system error via this system error group.

Knowledge Base article:

Summary

This monitor checks for System Errors. An alert is raised when a System Error is detected by the monitor.

Causes

The System Error reported to AEM within the sampling period. This may be caused due to several factors depending on the operating system that experienced this crash. It could also be due to hardware problems on the computer running the operating system.

Resolution

Additional steps that can be taken to diagnose the problem are as follows:

If transmission of Error reports has been turned on, verify if there is a Microsoft solution available. The solution, if available, can provide additional information to resolve the problem.

Review the Error Events view to obtain link to the cab file from the crash in the Persisted Cabs folder. If this is an old Error report, the cab file may have been groomed out.

For those applications that may not get a Microsoft Solution URL response, the crash information would be required by Application support or Application development team

A recent Service Pack, driver update or Quick Fix (QFE) that has been applied to AEM Managed computers could lead to a sudden rise in number of error across these computers. Please check to see if any such software update

Element properties:

TargetMicrosoft.SystemCenter.CM.AEM.SystemBucket
Parent MonitorSystem.Health.AvailabilityState
AlgorithmWorstOf
CategoryAlert
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityHigh
Alert Auto ResolveTrue
RemotableTrue
AccessibilityPublic
Alert Message
System Error Group Monitor
The number of crashes for Machine {0} have exceeded the threshold. The total number of crashes recorded are {1}.

Source Code:

<AggregateMonitor ID="Microsoft.SystemCenter.CM.AEM.Views.Internal.SystemBucketThreshold" Target="AEMLib!Microsoft.SystemCenter.CM.AEM.SystemBucket" Accessibility="Public" Enabled="true" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="Normal">
<Category>Alert</Category>
<AlertSettings AlertMessage="Microsoft.SystemCenter.CM.AEM.Views.Internal.SystemBucketThreshold.AlertMessage">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>High</AlertPriority>
<AlertParameters>
<AlertParameter1>$Data/Context/Columns/Column[4]$</AlertParameter1>
<AlertParameter2>$Data/Context/Columns/Column[1]$</AlertParameter2>
</AlertParameters>
</AlertSettings>
<Algorithm>WorstOf</Algorithm>
</AggregateMonitor>