Dell OM : SEL is in critical state

Dell.ManagedServer.Alert.2377 (Rule)

Knowledge Base article:

Summary

SEL critical state alert

Causes

SEL has generated critical alert. Probable causes and corresponding resolutions for this condition are:

Cause

Resolutions

Logging is disabled.

If unintended, re-enable logging.

All event logging is disabled.

If unintended, re-enable logging.

Log is full.

Backup and clear log.

Could not create or initialize the system event log.

Reboot the management controller or iDRAC. Cycle system input power. If problem persists call support.

An unknown system hardware failure detected.

Reduce system to minimum supported configuration. If issues persists, contact support.

Chassis management controller (CMC) redundancy is lost.

Check CMC network cables and network connections. Check that CMC firmware versions match.

The chassis management controller (CMC) is not redundant. Insufficient resources to maintain normal operations.

Check overall power consumption and power status.

The Chassis Controller is not responding or is not inserted properly. The status of Chassis Controller is critical.

Do one of the following: 1) Reinsert Chassis Controller, if removed. 2) If Chassis Controller is present but is not responding, reset the Chassis Controller. To reset the Chassis Controller using the graphical user interface (GUI), click Chassis Overview > Power > Control > Reset CMC. To reset by running the RACADM command at the command line interface (CLI), enter: "racadm racreset".

The sensor indicating the inlet temperature is not responding either because the sensor is damaged, or because of damaged circuit lines for I2C bus, or a faulty sensor state.

Do the following: 1) Reset iDRAC: a) By using graphical user interface (GUI), click Overview > Server > Quick Launch Tasks > Reset iDRAC. b) by running RACADM command at the command line interface (CLI), enter racadm racreset. 2) Change the corresponding I2C sensor by contacting your service provider.

An I2C sensor is not responding either because it is damaged, or because of damaged circuit lines for I2C bus, or a faulty sensor state.

To find the sensor number that is not responding, type "racadm getsel -E" and check the Event Data Byte 1. To recover the sensor from the faulty state, do one of the following: 1) Reset iDRAC: a) By using graphical user interface (GUI), click Overview > Server > Quick Launch Tasks > Reset iDRAC. b) by running RACADM command at the command line interface (CLI), enter racadm racreset. 2) Change the corresponding I2C sensor by contacting your service provider.

Resolutions

Additional information on this issue may be available. Launch the iDRAC Console to debug further.

Element properties:

TargetDell.ManagedServer
CategoryAlert
EnabledTrue
Event_ID2377
Event SourceLifeCycle Controller Log
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
RemotableTrue
Alert Message
Dell OM : SEL is in critical state
Event Description: {0}
Event LogSystem

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource Microsoft.Windows.EventProvider Default
Alert WriteAction System.Health.GenerateAlert Default
WriteToDW WriteAction Microsoft.SystemCenter.DataWarehouse.PublishEventData Default

Source Code:

<Rule ID="Dell.ManagedServer.Alert.2377" Enabled="true" Target="DellManagedServer!Dell.ManagedServer" ConfirmDelivery="false" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="DS" TypeID="Windows!Microsoft.Windows.EventProvider">
<ComputerName>$Target/Property[Type="DellManagedServer!Dell.ManagedServer"]/HostName$</ComputerName>
<LogName>System</LogName>
<Expression>
<And>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="UnsignedInteger">EventDisplayNumber</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="UnsignedInteger">2377</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
<Expression>
<SimpleExpression>
<ValueExpression>
<XPathQuery Type="String">PublisherName</XPathQuery>
</ValueExpression>
<Operator>Equal</Operator>
<ValueExpression>
<Value Type="String">LifeCycle Controller Log</Value>
</ValueExpression>
</SimpleExpression>
</Expression>
</And>
</Expression>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="Health!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>2</Severity>
<AlertMessageId>$MPElement[Name="Dell.ManagedServer.Alert.2377.Rule"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventDescription$</SuppressionValue>
</Suppression>
<Custom1/>
<Custom2/>
<Custom3/>
<Custom4/>
<Custom5/>
<Custom6/>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
<WriteAction ID="WriteToDW" TypeID="SCDW!Microsoft.SystemCenter.DataWarehouse.PublishEventData"/>
</WriteActions>
</Rule>