Temperature Sensor Unit Monitor

Dell.WindowsServer.ServerTemperatureSensorUnitMonitor (UnitMonitor)

Unit Monitor to get health of Temperature Sensor

Knowledge Base article:

Summary

This Health Monitor fetches the State of the Dell Server temperature Sensor. The status can be one of the following three types:

Success: The temperature sensor has returned to a normal state.

Warning: The temperature sensor is reporting a warning state.

Critical: The temperature sensor has failed.

Causes

Warning: A Warning status may display if a temperature sensor in the specified system is in a throttled state.

Critical: A Critical status may display if a temperature in the specified system has failed, is disabled, has a configuration error, or experienced a thermal trip. See the product knowledge of the last modified Dell OpenManage alert that caused the state change for additional details.

Resolutions

Perform the following steps for the Warning and Critical messages:

Warning: A Warning message indicates a possible future problem. Plan to replace the failed temperature sensor, based on the impact to operations, if the temperature sensor status goes to a critical state.

Critical: A Critical message indicates actual or imminent loss of data or loss of function. Change the failed temperature sensor. See the product knowledge of the last modified Dell OpenManage alert that caused the state change for additional details.

When the issue has been resolved, the status changes to Success in the next polling cycle.

Element properties:

TargetDell.WindowsServer.TemperatureSensor
Parent MonitorSystem.Health.AvailabilityState
CategoryCustom
EnabledTrue
Alert GenerateFalse
Alert Auto ResolveFalse
Monitor TypeDell.WindowsServer.Detailed.SensorHealthCookDownUMT
RemotableTrue
AccessibilityPublic
RunAsDefault

Source Code:

<UnitMonitor ID="Dell.WindowsServer.ServerTemperatureSensorUnitMonitor" Accessibility="Public" Enabled="true" Target="Dell.WindowsServer.TemperatureSensor" ParentMonitorID="SystemHealth!System.Health.AvailabilityState" Remotable="true" Priority="Normal" TypeID="Dell.WindowsServer.Detailed.SensorHealthCookDownUMT" ConfirmDelivery="false">
<Category>Custom</Category>
<OperationalStates>
<OperationalState ID="Success" MonitorTypeStateID="Success" HealthState="Success"/>
<OperationalState ID="Critical" MonitorTypeStateID="Error" HealthState="Error"/>
<OperationalState ID="Warning" MonitorTypeStateID="Warning" HealthState="Warning"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>21600</IntervalSeconds>
<LogLevel>0</LogLevel>
<Computer>$Target/Host/Host/Host/Property[Type="Windows!Microsoft.Windows.LogicalDevice"]/DeviceID$</Computer>
<SensorType>2</SensorType>
<ElementName>$Target/Property[Type="Dell.WindowsServer.TemperatureSensor"]/ElementName$</ElementName>
</Configuration>
</UnitMonitor>