Monitor Node Health

AP.Isilon.Monitor.Node.Health (UnitMonitor)

Monitors Node Health via SNMP

Knowledge Base article:

Summary

Uses SNMP to get health state of the Isilon Node.

Health States

SNMP Information

SNMP OIDs Used;

Element properties:

TargetAP.Isilon.Node
Parent MonitorSystem.Health.AvailabilityState
CategoryAvailabilityHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityNormal
Alert Auto ResolveTrue
Monitor TypeAP.Isilon.Monitor.Node.Health.SNMP
RemotableTrue
AccessibilityPublic
Alert Message
Monitor Node Health
{0} has change HealthState, please log into the Isilon Cluster to view more info.
{0}
RunAsDefault

Source Code:

<UnitMonitor ID="AP.Isilon.Monitor.Node.Health" Accessibility="Public" Enabled="true" Target="AP.Isilon.Node" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" Priority="Normal" TypeID="AP.Isilon.Monitor.Node.Health.SNMP" ConfirmDelivery="false">
<Category>AvailabilityHealth</Category>
<AlertSettings AlertMessage="AP.Isilon.Monitor.Node.Health.AlertMessage">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Target/Property[Type="AP.Isilon.Node"]/Name$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="NodeOkay" MonitorTypeStateID="NodeOkay" HealthState="Success"/>
<OperationalState ID="NodeWarning" MonitorTypeStateID="NodeWarning" HealthState="Warning"/>
<OperationalState ID="NodeDown" MonitorTypeStateID="NodeDown" HealthState="Error"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>300</IntervalSeconds>
<IP>$Target/Property[Type="AP.Isilon.Node"]/IPAddress$</IP>
<NoOfRetries>3</NoOfRetries>
<Timeout>120</Timeout>
<Community>$Target/Host/Property[Type="AP.Isilon.Cluster"]/Community$</Community>
</Configuration>
</UnitMonitor>