Monitor REC_LOST_REDUNDANCY_ESM (34)

ESM Canister - Loss of Communication

What Caused the Problem?

Communication has been lost to one of the dual ESM canisters in a drive enclosure. The drive enclosure has only one I/O path available. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.

Caution: Electrostatic discharge can damage sensitive components. Always use proper antistatic protection when handling components. Touching components without using a proper ground may damage the equipment.

Important Notes

Correct this failure as soon as possible. Although the drive enclosure is still operational, a level of redundancy has been lost. If the other ESM canister or its communication path fails, all I/O to that drive enclosure will fail.
The Recovery Guru will report separate problems for any drive enclosures it cannot reach through the particular loop (channel) containing the affected drive enclosure.
The Recovery Guru Details area will report the affected drive enclosure and the
working
channel over which it can communicate with the drive enclosure.
The fault LED on an ESM canister only glows when performing its self-test during power up or has failed the self-test. The fault LED turns off after the self-test is completed successfully.
Both ESM canisters in the same drive enclosure must have the same firmware version and be of the same hardware type.
The ESM canisters should contain a label to differentiate between the different hardware types. Make sure that the labels on both ESMs match if you need to replace one of the ESM canisters.
Depending on the ESM problem, the controller may set the associated drive channel to a degraded state. This condition is covered in the recovery steps.

Recovery Steps

Fix any other problems reported by the Recovery Guru before attempting to fix this problem.

To determine the non-working channel, start at the drive channel on the controller enclosure corresponding to the working channel (refer to the labels on the back of the controller enclosure if needed).

Trace the cable from the working channel to the optimal ESM canister in the affected drive enclosure.

Caution: Possible loss of data accessibility. Do not disconnect any cables on the working channel. Disconnecting any cables on the working channel will result in loss of access to your data.

Locate the other ESM canister in the affected drive enclosure (this is the canister on the non-working channel).

Reseating the ESM canister on the non-working channel may clear the failure being reported.

While the cables are still connected, remove the ESM canister from the drive enclosure.

Note: The Service Action Allowed status in the Details area is always NO for this problem because the component is not failed. In this situation, it is acceptable to remove the battery even though the Service Action Allowed is NO.

Wait 10 seconds, and then re-insert the canister firmly.

Wait 40 seconds, and then click the Recheck button to rerun the Recovery Guru to ensure the failure has been fixed.

If	Then
This failure is no longer reported in the Summary area	If a "Degraded Drive Channel" problem is also being reported in the Summary area, then go to step 10; otherwise, you are finished with this procedure.
This failure is still reported in the Summary area	Go to step 6.

If...

Then...

The cable is connected directly to the ESM canister on the non-working channel

Remove the cable.

Go to step 7.

The cable is connected to an interface transceiver such as a GBIC or SFP

Leave the cable connected and remove the interface transceiver from the ESM canister.

Go to step 7.

Repeat step 5 to check the Service action (removal) allowed status for the ESM, and then replace the ESM canister on the non-working channel using the following notes:

Make sure that the replacement ESM canister is the same as the ESM that is remaining in the drive enclosure (hardware type and firmware version).
If the affected drive enclosure uses ESM canisters with external switches, verify that they are set to the same values as the switches on the remaining ESM.
Wait 40 seconds for the storage subsystem to detect and register the ESM canister.
Go to step 8.

Reconnect the cable or interface transceiver to the replaced ESM canister and wait 40 seconds. If a "Degraded Drive Channel" problem is also being reported in the Summary area, then go to step 9; otherwise, go to step 10.

Select the

Advanced >> Troubleshooting >> Drive Channels

pull-down menu option, highlight the degraded drive channel, and then click the

Set Optimal

button.

Note: The drive channel status may automatically be reset to Degraded if the controller again determines that the channel is not operating properly.

Click the

Recheck

button to rerun the Recovery Guru. The failure should no longer appear in the Summary area. If the failure appears again, contact your technical support representative.

IBMStorageSubsystem.FailureID_0034_Monitor (UnitMonitor)

Knowledge Base article:

ESM Canister - Loss of Communication

Element properties:

Source Code: