Dell Remote Access : Server : Watchdog Timer is in critical state

Dell.iDRAC7.SNMPTrap.2233 (Rule)

Knowledge Base article:

Summary

Watchdog Timer critical state alert

Causes

Watchdog Timer has generated critical alert. Probable causes and corresponding resolutions for this condition are:

Cause

Resolutions

The watchdog timer expired.

Check the operating system, application, hardware, and system event log for exception events.

The watchdog timer reset the system.

Check the operating system, application, hardware, and system event log for exception events.

The watchdog timer powered off the system.

Check the operating system, application, hardware, and system event log for exception events.

The watchdog timer power cycled the system.

Check the operating system, application, hardware, and system event log for exception events.

The watchdog timer interrupt was initiated.

Check the operating system, application, hardware, and system event log for exception events.

The BIOS watchdog timer reset the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer reset the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer shutdown the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer powered down the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer powered cycle the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer powered off the system.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer expired.

Check the operating system, application, hardware, and system event log for exception events.

The OS watchdog timer pre-timeout interrupt was initiated.

Check the operating system, application, hardware, and system event log for exception events.

Resolutions

Additional information on this issue may be available. Launch the DRAC or OMSA Console to debug further.

Element properties:

TargetDell.RemoteAccess.iDRAC7
CategoryAvailabilityHealth
EnabledTrue
Alert GenerateTrue
Alert SeverityError
Alert PriorityNormal
RemotableTrue
Alert Message
Dell Remote Access : Server : Watchdog Timer is in critical state
{0}

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource Dell.SNMPTrap.DSMT Default
Alert WriteAction System.Health.GenerateAlert Default

Source Code:

<Rule ID="Dell.iDRAC7.SNMPTrap.2233" Enabled="true" Target="DAD!Dell.RemoteAccess.iDRAC7" ConfirmDelivery="false" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>AvailabilityHealth</Category>
<DataSources>
<DataSource ID="DS" TypeID="Dell.SNMPTrap.DSMT">
<IP>$Target/Property[Type="DAD!Dell.RemoteAccess.RAC"]/IPAddress$</IP>
<CommunityString>$Target/Property[Type="DAD!Dell.RemoteAccess.RAC"]/CommunityString$</CommunityString>
<AllTraps>false</AllTraps>
<OIDProps>
<OIDProp>.1.3.6.1.4.1.674.10892.5.3.2.1.0.2233</OIDProp>
</OIDProps>
<EventOriginId>$Target/Id$</EventOriginId>
<PublisherId>$Target/Id$</PublisherId>
<PublisherName>iDRAC</PublisherName>
<Channel>SnmpEvent</Channel>
<LoggingComputer/>
<EventNumber>2233</EventNumber>
<EventCategory>5</EventCategory>
<EventLevel>10</EventLevel>
<UserName/>
<Params/>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="SystemHealth!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>2</Severity>
<AlertName/>
<AlertDescription/>
<AlertOwner/>
<AlertMessageId>$MPElement[Name="Dell.iDRAC7.SNMPTrap.2233.Rule"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventData/DataItem/SnmpVarBinds/SnmpVarBind[5]/Value$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/Property[@Name="drsAlertMessageID"]$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/Property[@Name="drsAlertFQDD"]$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/Property[@Name="drsAlertCurrentStatus"]$</SuppressionValue>
<SuppressionValue>$Data/EventData/DataItem/Property[@Name="drsSystemServiceTag"]$</SuppressionValue>
</Suppression>
<Custom1>Alert Message ID = $Data/EventData/DataItem/Property[@Name="drsAlertMessageID"]$ </Custom1>
<Custom2>Alert Message = $Data/EventData/DataItem/Property[@Name="drsAlertMessage"]$ </Custom2>
<Custom3>Alert Status = $Data/EventData/DataItem/Property[@Name="drsAlertCurrentStatus"]$ </Custom3>
<Custom4>Alert Service Tag = $Data/EventData/DataItem/Property[@Name="drsSystemServiceTag"]$ </Custom4>
<Custom5>Alert FQDN = $Data/EventData/DataItem/Property[@Name="drsAlertFQDN"]$ </Custom5>
<Custom6>Alert FQDD = $Data/EventData/DataItem/Property[@Name="drsAlertFQDD"]$ </Custom6>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
</WriteActions>
</Rule>