NameNode Memory Heap Usage

Ambari.SCOM.UnitMonitor.NameNodeMemoryHeapUsed (UnitMonitor)

Gives warning/critical alert if percentage of used NameNode memory heap is more than lower/upper threshold.

Knowledge Base article:

Summary

Gives warning/critical alert if percentage of used NameNode memory heap is more than lower/upper threshold.

Causes

TBD

Resolutions

TBD

Element properties:

TargetAmbari.SCOM.HostComponent.NameNode
Parent MonitorSystem.Health.PerformanceState
CategoryPerformanceHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeAmbari.SCOM.UnitMonitorType.HostComponentThreeStateThreshold
RemotableTrue
AccessibilityPublic
Alert Message
NameNode is working under high memory pressure.
There are {1}\% of committed heap memory used by NameNode of cluster "{0}".
RunAsDefault

Source Code:

<UnitMonitor ID="Ambari.SCOM.UnitMonitor.NameNodeMemoryHeapUsed" TypeID="Ambari.SCOM.UnitMonitorType.HostComponentThreeStateThreshold" Target="Ambari.SCOM.HostComponent.NameNode" ParentMonitorID="Health!System.Health.PerformanceState" Remotable="true" Priority="Normal" Accessibility="Public" Enabled="true" ConfirmDelivery="true">
<Category>PerformanceHealth</Category>
<AlertSettings AlertMessage="Ambari.SCOM.UnitMonitor.NameNodeMemoryHeapUsed.AlertMessage">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Target/Host/Host/Property[Type="Ambari.SCOM.Host.Private"]/ClusterName$</AlertParameter1>
<AlertParameter2>$Data/Context/Property[@Name='calculated.memheapusedpercent']$</AlertParameter2>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="Healthy" MonitorTypeStateID="Healthy" HealthState="Success"/>
<OperationalState ID="Warning" MonitorTypeStateID="Warning" HealthState="Warning"/>
<OperationalState ID="Critical" MonitorTypeStateID="Critical" HealthState="Error"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>900</IntervalSeconds>
<TimeoutSeconds>300</TimeoutSeconds>
<PropertyName>calculated.memheapusedpercent</PropertyName>
<TheGreaterTheBetter>false</TheGreaterTheBetter>
<WarningThreshold>80</WarningThreshold>
<CriticalThreshold>90</CriticalThreshold>
</Configuration>
</UnitMonitor>