A support CRU (also
called a Power-Fan canister) has failed and must be replaced.
What Caused the Problem?
A power-fan canister has failed and must be replaced. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.
Additional Information about the Problem
Removing a power-fan canister affects the cooling of the storage array in addition to the supply of power to the storage array.
Most trays have more than one power-fan canister. Refer to the applicable hardware manual for power-fan canister locations and detailed replacement procedures.
The storage array will remain in a Needs Attention state until the problem is resolved.
Alerts will be sent if they have been configured.
ATTENTION | Risk of unsafe operating temperature — A failed power-fan canister may cause the tray to reach an unsafe operating temperature and shut down (you may already see an over-temperature failure being reported). Always resolve a failed power-fan canister problem as soon as possible. There is a time limit (in the 5 to 15 minute range) beyond which you should not operate the affected tray with a failed power-fan canister. This time limit depends on the model of the tray. Refer to the applicable hardware manual for the specific time limit. |
ATTENTION | Possible loss of data access — Do not remove a component when either (1) the Service action (removal) allowed ( SAA ) field in the Details area of this recovery procedure is NO, or (2) the SAA LED on the affected component is OFF (note that some products do not have SAA LEDs). Removing a component while its SAA LED is OFF may result in temporary loss of access to your data. Refer to the following Important Notes for more detail. |
ATTENTION | Electrostatic discharge can damage sensitive components — Always use proper antistatic protection when handling components. Touching components without using a proper ground can damage the equipment. |
Service Action Allowed:
The
Service action (removal) allowed
field in the Details area indicates whether or not you can safely remove the component. If the SAA field is NO, then the affected component must remain in place until you service another component first.
The
Service action LED on Component
field in the Details area indicates whether or not a physical SAA LED is present on the hardware component. This field does NOT indicate whether the LED is ON or OFF (that indication is provided by the Service action (removal) allowed field).
If a component does not have an SAA LED, then it is OK to remove the component when its fault LED is lit and the
Service action (removal) allowed
field = YES in the Details area.
The
Service action (removal) allowed
field shown in the Details area and the physical SAA LED on the hardware component (if supported) MUST match before you remove the affected component. In rare cases (such as multiple problems), the status of the LED and the SAA field may not match. If there is a mismatch, then you should NOT remove the component until these indications match.
Recovery Steps
1 | Check the Service Action Allowed status for the Component requiring service in the Recovery Guru Details area.
| ||||||||||||||||||||||||||||
2 | Remove the component listed in the Component requiring service field in the Recovery Guru Details area. | ||||||||||||||||||||||||||||
3 | Check the model of the removed power-fan canister or the remaining power-fan canister and ensure that the new canister is the same model.
| ||||||||||||||||||||||||||||
4 | Insert the new power-fan canister securely into place. | ||||||||||||||||||||||||||||
5 | Click Recheck to run the Recovery Guru again.
|
Target | IBMStorageSubsystem.StorageSubsystem | ||
Parent Monitor | IBMStorageSubsystem.StorageSubsystemAvailability | ||
Category | Custom | ||
Enabled | True | ||
Alert Generate | True | ||
Alert Severity | Error | ||
Alert Priority | Normal | ||
Alert Auto Resolve | True | ||
Monitor Type | IBMStorageSubsystem.FailureUnitMonitorType | ||
Remotable | True | ||
Accessibility | Internal | ||
Alert Message |
| ||
RunAs | Default | ||
Comment | Machine generated entity |
<UnitMonitor ID="IBMStorageSubsystem.FailureID_0090_Monitor" Accessibility="Internal" Enabled="true" Target="IBMStorageSubsystem.StorageSubsystem" ParentMonitorID="IBMStorageSubsystem.StorageSubsystemAvailability" Remotable="true" Priority="Normal" TypeID="IBMStorageSubsystem.FailureUnitMonitorType" ConfirmDelivery="true" Comment="Machine generated entity">
<Category>Custom</Category>
<AlertSettings AlertMessage="IBMStorageSubsystem.REC_FAILED_SUPPORT_CRU_AlertMessageResourceID">
<AlertOnState>Error</AlertOnState>
<AutoResolve>true</AutoResolve>
<AlertPriority>Normal</AlertPriority>
<AlertSeverity>Error</AlertSeverity>
<AlertParameters>
<AlertParameter1>$Data/Context/Property[@Name='FailureDescription']$</AlertParameter1>
</AlertParameters>
</AlertSettings>
<OperationalStates>
<OperationalState ID="IBMStorageSubsystem.StateId493E93C9DDF85A4AD4A67F4143E98977" MonitorTypeStateID="NoIssue" HealthState="Success"/>
<OperationalState ID="IBMStorageSubsystem.StateId33E5CDA362909241E7267FD28FC81565" MonitorTypeStateID="IssueFound" HealthState="Error"/>
</OperationalStates>
<Configuration>
<FailureID>90</FailureID>
<IntervalSeconds>59</IntervalSeconds>
<TimeoutSeconds>300</TimeoutSeconds>
<Trace>0</Trace>
</Configuration>
</UnitMonitor>