Service Monitoring: CCP Management service is not running

Service_Monitoring__CCP_Management_service_is_not_running_1_Rule.AdvancedAlertCriteriaMonitor (UnitMonitor)

Knowledge Base article:

Management Pack
Summary

This alert is generated when the Microsoft Compute Cluster Management service has stopped running.

When this service is not running on a head node, then the entire cluster management infrastructure will not function. No configuration changes will be propagated from the head node to the cluster, and no status updates or job statistics will be accepted from the nodes.

When this service is not running on a compute node, changes to the cluster will not be propagated to that node, and status and job statistics will not be reported to the Compute Cluster Server job scheduler on the cluster head node.

Jobs that are already running on compute nodes will continue to run as usual.

 
Causes

This error can be caused by any of the following:

  • The Microsoft Compute Cluster Management service encountered an error and had to stop running.
  • The Microsoft Compute Cluster Management service was intentionally or accidentally disabled.
  • Group Policy does not allow this service to start.
 
Resolutions

To troubleshoot and fix this problem:

  1. Service Control Manager produces an error event if a service is terminated unexpectedly. Start Event Viewer on the affected computer and check for any system events from Service Control Manager, or application events from the Microsoft Compute Cluster Management service (CcpManagement). Resolve any errors reported by this event.
  2. Restart the service on the affected node.
  3. If the service cannot be restarted, contact the network domain administrator to make sure this service is not disabled by domain policy.
  4. If none of above solves the problem, uninstall and install Compute Cluster Pack on the affected computer.
 
© 2006 Microsoft Corporation, all rights reserved.

Element properties:

TargetMicrosoft.Windows.Server.ComputeCluster.2003.Head_Node_Class
Parent MonitorCCPManagement
CategoryStateCollection
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeSystem.Mom.BackwardCompatibility.StateAlert.RuleGenerated.AdvancedRuleCriteriaMonitor
RemotableTrue
AccessibilityInternal
Alert Message
Service Monitoring: CCP Management service is not running
{1}
RunAsDefault
CommentMom2005ID='{27EA9096-EDCC-4230-9A31-E0AC5698502C}'

Source Code:

<UnitMonitor ID="Service_Monitoring__CCP_Management_service_is_not_running_1_Rule.AdvancedAlertCriteriaMonitor" TypeID="MomBackwardCompatibility!System.Mom.BackwardCompatibility.StateAlert.RuleGenerated.AdvancedRuleCriteriaMonitor" Accessibility="Internal" Target="Microsoft.Windows.Server.ComputeCluster.2003.Head_Node_Class" Enabled="true" ParentMonitorID="CCPManagement" Comment="Mom2005ID='{27EA9096-EDCC-4230-9A31-E0AC5698502C}'">
<Category>StateCollection</Category>
<AlertSettings AlertMessage="Service_Monitoring__CCP_Management_service_is_not_running_1_Rule.AdvancedAlertCriteriaMonitor.StringResource">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/Name$</AlertParameter1>
<AlertParameter2>$Data/Context/Description$</AlertParameter2>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState HealthState="Success" MonitorTypeStateID="Success" ID="AlertLevelSuccess"/>
<OperationalState HealthState="Warning" MonitorTypeStateID="Warning" ID="AlertLevelWarning"/>
<OperationalState HealthState="Error" MonitorTypeStateID="Error" ID="AlertLevelError"/>
</OperationalStates>
<Configuration>
<ServerRole>Head Node</ServerRole>
<Component>CCPManagement</Component>
<ServerRoleInstance>$Target/Property[Type="Microsoft.Windows.Server.ComputeCluster.2003.Head_Node_Class"]/Server_Name$</ServerRoleInstance>
<RuleId>$MPElement[Name="Service_Monitoring__CCP_Management_service_is_not_running_1_Rule"]$</RuleId>
<ServiceUnavailableExpression>
<And>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="String">AlertContext/DataItem/Params/Param[10]</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="Integer">2</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<Or>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="String">AlertContext/DataItem/Params/Param[9]</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="String">1</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="String">AlertContext/DataItem/Params/Param[9]</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="String">3</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
</Or>
</Expression>
</And>
</ServiceUnavailableExpression>
<SecurityIssueExpression>
<Not>
<Expression/>
</Not>
</SecurityIssueExpression>
<CriticalErrorExpression>
<Not>
<Expression/>
</Not>
</CriticalErrorExpression>
<ErrorExpression>
<Not>
<Expression/>
</Not>
</ErrorExpression>
<WarningExpression>
<Not>
<Expression/>
</Not>
</WarningExpression>
<SuccessExpression>
<Not>
<Expression/>
</Not>
</SuccessExpression>
<InformationExpression>
<Not>
<Expression/>
</Not>
</InformationExpression>
</Configuration>
</UnitMonitor>