Cisco Nexus: Fan status Health Monitor

Jalasoft.Xian.SmartManagementPacks.CiscoNexus.FanStatus.1.1 (AggregateMonitor)

This monitor catches Xian events that notifies when the Fan status has changed its state.

Knowledge Base article:

Summary

Fan status has changed.

The fan status has changed to one of the following states: other, ok, minor fault, major fault.

External

www.jalasoft.com/KnowledgeBase

Causes

Ok indicates that the fan components are functional. Other means an unknown status. The other states are caused by any of the following fan failures: if the fan tray has not been reinstalled yet, the power subsystem is not functioning properly, or the fan assembly is not settled in the backplane or has malfunctioned. When a fault the system is present, performance can be reduced if operating with a fan fault or above 30ºC (86ºF).

Resolutions

No action is required for the ok status. For the other states: determine whether or not the power subsystem is functioning properly; to ensure that the fan assembly is properly settled, loosen the captive installation screws, remove the fan assembly and reinstall it; tighten all captive installation screws, and then restart the system. Individual fan failure might require replacement. Check the switch itself, or use the show env user EXEC command to check for a failed fan. If there is an overtemperature condition, make sure that the fan intake and exhaust areas are clear.

Additional

Monitors the status of the fan on a Cisco Nexus.

Element properties:

TargetJalasoft.Xian.Common.Elements.ThirdParty.Cisco.JsXCiscoNexusChassisElement
Parent MonitorSystem.Health.PerformanceState
AlgorithmWorstOf
CategoryStateCollection
EnabledTrue
Alert GenerateTrue
Alert SeverityMatchMonitorHealth
Alert PriorityLow
Alert Auto ResolveTrue
RemotableTrue
AccessibilityPublic
Alert Message
Fan status Alert
{0}

Source Code:

<AggregateMonitor ID="Jalasoft.Xian.SmartManagementPacks.CiscoNexus.FanStatus.1.1" Accessibility="Public" Enabled="true" Target="ThirdParty_Cisco!Jalasoft.Xian.Common.Elements.ThirdParty.Cisco.JsXCiscoNexusChassisElement" ParentMonitorID="SystemHealth!System.Health.PerformanceState" Remotable="true" Priority="Normal">
<Category>StateCollection</Category>
<AlertSettings AlertMessage="FanStatus_AlertMessage">
<AlertOnState>Warning</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Low</AlertPriority>
<AlertSeverity>MatchMonitorHealth</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/Params/Param[3]$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<Algorithm>WorstOf</Algorithm>
</AggregateMonitor>