Dell Server Memory Health
Dell Server Memory Unit Monitor
If Memory is in warning state, causes/resolutions for this condition:
Cause | Resolutions |
Persistent correctable memory errors detected on a memory device at location(s) <location>. | Re-install the memory component. If the problem persists, contact technical support. Refer to the product documentation to choose a convenient contact method. |
Memory device at location <location> is throttled. | If unexpected, review system logs for power or thermal exceptions. |
Memory device at location <location> is absent. | If unexpected, check presence, then re-install. |
Correctable memory error rate exceeded for <location>. | Re-install the memory component. If the problem continues, contact support. |
Memory device at location <location> failed to transition to in test. | Re-install the memory component. If the problem continues, contact support. |
Memory device at location <location> is in a degraded state. | Re-install the memory component. If the problem continues, contact support. |
Memory RAID redundancy is degraded. Check memory device at location(s) <location>. | Re-install the memory component. If the problem continues, contact support. |
Memory mirror redundancy is degraded. Check memory device at location <location>. | Review system logs for memory exceptions. Re-install memory at location <location> |
Memory redundancy is degraded. | Review system logs for memory exceptions. Re-install memory at location <location> |
If Memory is in critical state, causes/resolutions for this condition:
Cause | Resolutions |
Multi-bit memory errors detected on a memory device at location(s) <location>. | Re-install the memory component. If the problem persists, contact technical support. Refer to the product documentation to choose a convenient contact method. |
Parity memory errors detected on a memory device at location <location>. | Re-install the memory component. If the problem persists, contact technical support. Refer to the product documentation to choose a convenient contact method. |
Stuck bit memory error detected on a memory device at location <location>. | Re-install the memory component. If the problem persists, contact technical support. Refer to the product documentation to choose a convenient contact method. |
Memory device at location <location> is disabled. | Re-install the memory component. Review product documentation for supported memory configurations. If the problem continues, contact support. |
Persistent correctable memory error limit reached for a memory device at location(s) <location>. | Re-install the memory component. If the problem persists, contact technical support. Refer to the product documentation to choose a convenient contact method. |
Unsupported memory configuration; check memory device at location <location>. | Review product documentation for supported memory configurations |
Memory device at location <location> is over heating. | If unexpected, review system logs for power or thermal exceptions. |
Correctable memory error rate exceeded for <location>. | Re-install the memory component. If the problem continues, contact support. |
Memory device at location <location> failed to transition to a running state. | Re-install the memory component. If the problem continues, contact support. |
Memory device at location <location> failed to power off. | Re-attempt memory removal process |
Memory device at location <location> failed to transition to online. | Re-attempt memory removal process |
Memory device at location <location> failed to transition to offline. | Re-attempt memory removal process |
Memory device at location <location> is not installed correctly. | Re-install the memory component. If the problem continues, contact support. |
Memory RAID redundancy is lost. Check memory device at location(s) <location>. | Re-install the memory component. If the problem continues, contact support. |
Memory mirror redundancy is lost. Check memory device at location(s) <location>. | Review system logs for memory exceptions. Re-install memory at location <location> |
Memory spare redundancy is lost. Check memory device at location <location>. | Review system logs for memory exceptions. Re-install memory at location <location> |
Memory redundancy is lost. | Review system logs for memory exceptions. Re-install memory at location <location> |
A hardware mismatch detected for memory riser. | Review product documentation for proper memory riser installation and configuration |
Correctable memory error logging disabled for a memory device at location <location>. | Review system logs for memory exceptions. Re-install memory at location <location> |
Additional information on this issue may be available. Launch the iDRAC Console to debug further.
Target | Dell.ManagedServer.MemoryUnit |
Parent Monitor | System.Health.AvailabilityState |
Category | StateCollection |
Enabled | False |
Alert Generate | False |
Alert Auto Resolve | False |
Monitor Type | Dell.ManagedServer.ServerHealthCookDownUMT |
Remotable | True |
Accessibility | Public |
RunAs | Default |
<UnitMonitor ID="Dell.ManagedServer.MemoryUnitHealth" Accessibility="Public" Enabled="false" Target="DellManagedServer!Dell.ManagedServer.MemoryUnit" ParentMonitorID="Health!System.Health.AvailabilityState" Remotable="true" TypeID="Dell.ManagedServer.ServerHealthCookDownUMT" Priority="Normal" ConfirmDelivery="false">
<Category>StateCollection</Category>
<OperationalStates>
<OperationalState HealthState="Success" MonitorTypeStateID="Success" ID="Success"/>
<OperationalState HealthState="Error" MonitorTypeStateID="Error" ID="Critical"/>
<OperationalState HealthState="Warning" MonitorTypeStateID="Warning" ID="Warning"/>
</OperationalStates>
<Configuration>
<IntervalSeconds>21600</IntervalSeconds>
<SyncTime/>
<TimeoutSeconds>1200</TimeoutSeconds>
<InstanceIndex>$Target/Property[Type="DellManagedServer!Dell.ManagedServer.MemoryUnit"]/ID$</InstanceIndex>
<ComponentType>Dell.ManagedServer.MemoryUnit</ComponentType>
<LogLevel>0</LogLevel>
</Configuration>
</UnitMonitor>