Dell OM : Temperature is in critical state (1)

Dell.ManagedServer.Alert.2161 (Rule)

Knowledge Base article:

Summary

Temperature critical state alert

Causes

Temperature has generated critical alert. Probable causes and corresponding resolutions for this condition are:

Cause

Resolutions

The system board <name> temperature is less than the lower critical threshold.

Check system operating environment.

The system board <name> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The system board <name> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The memory module <number> temperature is less than the lower critical threshold.

Check system operating environment.

The memory module <number> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The memory module <number> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The <name> temperature is less than the lower critical threshold.

Check system operating environment.

The <name> temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The <name> temperature is outside of range.

Check system operating environment and review event log for fan failures.

The system inlet temperature is less than the lower critical threshold.

Check system operating environment.

The system inlet temperature is greater than the upper critical threshold.

Check system operating environment.

The system inlet temperature is outside of range.

Check system operating environment.

Disk drive bay temperature is less than the lower critical threshold.

Check system operating environment.

Disk drive bay temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

Disk drive bay temperature is outside of range.

Check system operating environment and review event log for fan failures.

The control panel temperature is less than the lower critical threshold.

Check system operating environment.

The control panel temperature is greater than the upper critical threshold.

Check system operating environment and review event log for fan failures.

The control panel temperature is outside of range.

Check system operating environment and review event log for fan failures.

CPU <number> temperature is less than the lower critical threshold.

Check system operating environment.

CPU <number> temperature is greater than the upper critical threshold.

Check system operating environment, fans, and heatsinks.

CPU <number> temperature is outside of range.

Check system operating environment, fans, and heatsinks.

The system is automatically turned off because of insufficient cooling.

Replace the fans that have stopped functioning, or add the minimum number of fans recommended. For more information, see the system Owners Manual available at the support site.

Resolutions

Additional information on this issue may be available. Launch the iDRAC Console to debug further.

Element properties:

TargetDell.ManagedServer
CategoryAlert
EnabledTrue
Event_ID2161
Event SourceLifeCycle Controller Log
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
RemotableTrue
Alert Message
Dell OM : Temperature is in critical state (1)
Event Description: {0}
Event LogSystem

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource Microsoft.Windows.EventProvider Default
Alert WriteAction System.Health.GenerateAlert Default
WriteToDW WriteAction Microsoft.SystemCenter.DataWarehouse.PublishEventData Default

Source Code:

<Rule ID="Dell.ManagedServer.Alert.2161" Enabled="true" Target="DellManagedServer!Dell.ManagedServer" ConfirmDelivery="false" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="DS" TypeID="Windows!Microsoft.Windows.EventProvider">
<ComputerName>$Target/Property[Type="DellManagedServer!Dell.ManagedServer"]/HostName$</ComputerName>
<LogName>System</LogName>
<Expression>
<And>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="UnsignedInteger">EventDisplayNumber</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="UnsignedInteger">2161</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="String">PublisherName</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="String">LifeCycle Controller Log</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
</And>
</Expression>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="Health!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>2</Severity>
<AlertMessageId>$MPElement[Name="Dell.ManagedServer.Alert.2161.Rule"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventDescription$</SuppressionValue>
</Suppression>
<Custom1/>
<Custom2/>
<Custom3/>
<Custom4/>
<Custom5/>
<Custom6/>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
<WriteAction ID="WriteToDW" TypeID="SCDW!Microsoft.SystemCenter.DataWarehouse.PublishEventData"/>
</WriteActions>
</Rule>