Dell Remote Access : Server : Temperature is in critical state

Dell.iDRAC7.SNMPTrap.2161 (Rule)

Knowledge Base article:

Summary

Temperature critical state alert

Causes

Temperature has generated critical alert. Probable causes and corresponding resolutions for this condition are:

Cause

Resolutions

The system board <name> temperature is less than the lower critical threshold.

Check system operating environment.

The system board <name> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The system board <name> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The memory module <number> temperature is less than the lower critical threshold.

Check system operating environment.

The memory module <number> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The memory module <number> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The <name> temperature is less than the lower critical threshold.

Check system operating environment.

The <name> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The <name> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The system inlet temperature is less than the lower critical threshold.

Check system operating environment.

The system inlet temperature is greater than the upper critical threshold.

Check system operating environment.

The system inlet temperature is outside of range.

Check system operating environment.

Disk drive bay temperature is less than the lower critical threshold.

Check system operating environment.

Disk drive bay temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

Disk drive bay temperature is outside of range.

Check system operating environment and review event log for fan failures.

The control panel temperature is less than the lower critical threshold.

Check system operating environment.

The control panel temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The control panel temperature is outside of range.

Check system operating environment and review event log for fan failures.

CPU <number> temperature is less than the lower critical threshold.

Check system operating environment.

CPU <number> temperature is greater than the upper critical threshold.

Check system operating environment, fans, and heatsinks.

CPU <number> temperature is outside of range.

Check system operating environment, fans, and heatsinks.

Resolutions

Additional information on this issue may be available. Launch the DRAC or OMSA Console to debug further.

Element properties:

TargetDell.RemoteAccess.iDRAC7
CategoryAvailabilityHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
RemotableTrue
Alert Message
Dell Remote Access : Server : Temperature is in critical state
{0}

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource System.NetworkManagement.SnmpTrapEventProvider Default
Alert WriteAction System.Health.GenerateAlert Default

Source Code:

<Rule ID="Dell.iDRAC7.SNMPTrap.2161" Enabled="true" Target="DAD!Dell.RemoteAccess.iDRAC7" ConfirmDelivery="false" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>AvailabilityHealth</Category>
<DataSources>
<DataSource ID="DS" TypeID="Node!System.NetworkManagement.SnmpTrapEventProvider">
<IP>$Target/Property[Type="DAD!Dell.RemoteAccess.RAC"]/IPAddress$</IP>
<OIDProps>
<OIDProp>.1.3.6.1.4.1.674.10892.5.3.2.1.0.2161</OIDProp>
</OIDProps>
<EventOriginId>$Target/Id$</EventOriginId>
<PublisherId>$Target/Id$</PublisherId>
<PublisherName>iDRAC</PublisherName>
<Channel>SnmpEvent</Channel>
<LoggingComputer/>
<EventNumber>2161</EventNumber>
<EventCategory>5</EventCategory>
<EventLevel>10</EventLevel>
<UserName/>
<Params/>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="SystemHealth!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>2</Severity>
<AlertName/>
<AlertDescription/>
<AlertOwner/>
<AlertMessageId>$MPElement[Name="Dell.iDRAC7.SNMPTrap.2161.Rule"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[4]/Value$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[3]/Value$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[4]/Value$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[6]/Value$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[8]/Value$</SuppressionValue>
</Suppression>
<Custom1>Alert Message ID = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[3]/Value$ </Custom1>
<Custom2>Alert Message = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[4]/Value$ </Custom2>
<Custom3>Alert Status = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[5]/Value$ </Custom3>
<Custom4>Alert Service Tag = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[6]/Value$ </Custom4>
<Custom5>Alert FQDN = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[7]/Value$ </Custom5>
<Custom6>Alert FQDD = $Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[8]/Value$ </Custom6>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
</WriteActions>
</Rule>