Dell MD Array Failed Virtual Disk

Dell.MDStorageArray.ABBXMLEvent17 (Rule)

Knowledge Base article:

Summary

Failed Virtual Disk

The causes and resolutions refer to the Dell Modular Disk Storage Manager recovery guru. Launch Dell Modular Disk Storage Manager to diagnose and fix the recovery failure as follows:

Causes

One or more virtual disks on the storage array have failed. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.

Caution: Possible loss of data accessibility. Do not remove a component when either (1) the Service Action (removal) Allowed (SAA) field in the Details area of this recovery procedure is NO, or (2) the SAA LED on the affected component is OFF (note that some products do not have SAA LEDs). Removing a component while its SAA LED is OFF may result in temporary loss of access to your data. Refer to the following Important Notes for more detail.

Caution: Electrostatic discharge can damage sensitive components. Always use proper antistatic protection when handling components. Touching components without using a proper ground may damage the equipment.

Important Notes

Resolutions

1

In the Array Management Window (AMW), verify the status of the physical disks associated with the failed virtual disks:

If...

Then...

The status of the associated physical disks is Optimal

It may be possible to recover data from the failed virtual disks. Do not continue with the remaining recovery steps and contact your Technical Support Representative. Performing any recovery actions before contacting your Technical Support Representative could jeopardize any chance of recovering data.

The status of the associated physical disks is NOT Optimal

Go to step 2.

2

If...

Then...

You mistakenly removed the wrong physical disk while performing a degraded virtual disk recovery procedure

You can return the virtual disk back to the Degraded state by replacing the physical disk you removed. After the virtual disks return to the Degraded state, click the Recheck button and perform the recovery steps listed for a "Degraded Virtual Disk" procedure. You are finished with this procedure.

You did NOT mistakenly remove the wrong physical disk while performing a degraded virtual disk recovery procedure

It may be possible to recover data from the failed virtual disks. If you wish to attempt a data recovery, do not continue with the remaining recovery steps and contact your Technical Support Representative. Performing any recovery actions before contacting your Technical Support Representative could jeopardize any chance of recovering data.

If you prefer to recover from an existing backup, go to step 3.

3

Important: You will delete the disk pool or disk group later in these recovery steps. If you wish to re-create the disk pool or disk group later using the same configuration, select the Monitor > Reports > Storage Array Profile menu option and then click the Save As button to save a copy of the Storage Array Profile. Make sure the "Storage" option is selected in the Save As dialog.

There are several different types of virtual disks that can exist in a disk pool or disk group. Use the Recovery Guru details area to determine the affected disk pool or disk group. Then, find the disk pool or disk group on the Storage and Copy Services tab in the AMW. Use the information provided by the AMW to determine the types of virtual disks on the affected disk pool or disk group. Step through every entry in the following table and perform all procedures associated with the virtual disk type combination for the affected disk pool or disk group.

If...

Then...

The affected disk pool or disk group contains one or more source or target virtual disks in a copy operation

Perform the following steps:

a

Select the Copy Services > Virtual Disk Copy > Manage Copies... menu option from the AMW.

b

Check to see if any of the copy operations involving the affected virtual disks have a copy status of Pending, In Progress, or Failed.

c

Highlight the copy pair that contains the affected virtual disk and select the Copy > Stop menu option.

d

Check to see if any of the Target virtual disks have read-only enabled.

e

Disable read-only by selecting ALL target virtual disks that have read-only enabled and then selecting the Change > Target Virtual Disk Permissions > Disable Read-Only menu option.

f

Go to step 4.

One or more snapshot virtual disks exist on the affected disk pool or disk group

The information on the snapshots is no longer valid and cannot be retrieved.

Delete all snapshot virtual disks associated with the affected disk pool or disk group by highlighting the snapshot virtual disk and selecting the Copy Services > Snapshot (Legacy) > Delete menu option. You will be able to create any needed snapshot virtual disks after this procedure has been completed.

One or more snapshot repository virtual disks exist on the affected disk pool or disk group

The information on the snapshot virtual disks associated with the affected snapshot repository virtual disks is no longer valid and cannot be retrieved, even if the associated snapshot virtual disks exist on a different disk pool or disk group.

Delete all snapshot virtual disks associated with the snapshot repositories on the affected disk pool or disk group by highlighting the associated snapshot virtual disks and selecting the Copy Services > Snapshot (Legacy) > Delete menu option. You will be able to create any needed snapshots after this procedure has been completed.

The remote replication repository virtual disks exist on the affected disk pool or disk group

Perform the following steps:

a

Save the Storage Array Profile by selecting the Monitor > Reports > Storage Array Profile menu option and then selecting the Save As button. The profile will give you a roadmap of any replication relationships you may want to recreate after re-activating Synchronous Replication.

b

Remove all replication relationships on this storage array by highlighting any primary virtual disk and selecting the Copy Services > Replication > Synchronous Replication > Remove Replication Relationship menu option. You can then select all replication relationships on the storage array.

c

Deactivate the Synchronous Replication feature by selecting the Copy Services > Replication > Deactivate... menu option.

d

Re-activate the Synchronous Replication feature by selecting the Copy Services > Replication > Activate... menu option. Your remote replication repository virtual disks will now reside on a different disk pool or disk group. Once the affected disk group or disk pool is restored, step 9 will help you restore your replication relationships.

One or more SynchronousReplication primary or secondary virtual disks exist on the affected disk group, but the remote replication repository virtual disks exist on a different disk pool or disk group

Perform the following steps:

a

Save the Storage Array Profile by selecting the Monitor > Reports > Storage Array Profile menu option and then selecting the Save As button. The profile will give you a roadmap of any replication relationships you may want to recreate after re-activating Synchronous Replication.

b

Remove the replication relationships for only those primary or secondary virtual disks on the affected disk pool or disk group by highlighting any of the affected primary virtual disks and selecting the Copy Services > Replication > Synchronous Replication > Remove Replication Relationship menu option . You can then select only those replication relationships that exist on the affected disk pool or disk group.

c

Once the affected disk pool or disk group is restored, step 9 will help you restore your replication relationships for the virtual disks on the affected disk pool or disk group. The replication relationships that use secondary virtual disks on the affected disk pool or disk group will synchronize automatically once the affected disk pool or disk group is restored.

d

Go to step 4.

Only standard virtual disks exist on the affected disk pool or disk group

Go to step 4.

4

Locate all failed physical disks associated with this disk pool or disk group (the fault indicator lights on the failed physical disks should be lit).

Note: To determine the associated physical disks, select one of the affected virtual disks (identified in the Details area) on the Storage and Copy Services tab in the AMW. Each associated physical disk will have an association dot underneath it.

5

Remove each failed physical disk.

6

Wait 30 seconds, then insert the new physical disks into the same slots (if you want to keep the disk pool or disk group on physical disks in the same slot locations). The fault indicator light on the new physical disks may become lit for a short time (one minute or less).

Note: Wait until the replaced physical disks are ready (fault indicator light off) before going to step 7.

7

Important: All data on the disk pool or disk group will be lost once you complete this step. Be sure that you have an adequate backup, or go back to step 1 if you want to attempt data recovery.

If...

Then...

The affected object is a disk pool

Perform the following steps:

a

On the Storage and Copy Services tab in the AMW, highlight the affected disk pool.

b

Select the Storage > Disk Pool > Delete... menu option.

c

Follow the instructions in the dialog, and then type "yes" to confirm the operation.

d

Go to step 8.

The affected object is a disk group

Perform the following steps:

a

On the Storage and Copy Services tab in the AMW, highlight the affected disk pool.

b

Select the Storage > Disk Group > Delete... menu option.

c

Follow the instructions in the dialog, and then type "yes" to confirm the operation.

d

Go to step 8.

8

If...

Then...

You want to re-create the disk pool or disk group with the same configuration before the failure

If...

Then...

The affected object is a disk pool

Perform the following steps:

a

On the Storage and Copy Services tab in the AMW, select a Free Capacity node in the tree.

b

Select the Storage > Disk Pool > Create... menu option.

c

While creating the new disk pool, refer to the Storage Array Profile you saved in step 3 for information about how the previous disk pool was configured.

d

Go to step 9.

The affected object is a disk group

Perform the following steps:

a

On the Storage and Copy Services tab in the AMW, select a Free Capacity node in the tree.

b

Select the Storage > Disk Group > Create... menu option.

c

While creating the new disk group, refer to the Storage Array Profile you saved in step 3 for information about how the previous disk group was configured.

d

Go to step 9.

You do not want to re-create the disk pool or disk group

Go to step 12.

9

If...

Then...

You deleted one or more snapshot virtual disks or snapshot repository virtual disks in step 3.

If desired, create new snapshot virtual disks to replace those you deleted.

You stopped one or more copy operations in step 3.

If desired, re-create any copies you stopped by highlighting the copy pairs in the Copy Manager and selecting Copy > Re-Copy.

You disabled read-only on any target virtual disks instep 3.

Restore that data on those virtual disks from backup.

You removed replication relationships during step 3

Re-create any desired replication relationships by selecting any disk you wish to be a primary (note that some of these virtual disks may reside on remote storage arrays) and selecting the Storage > Virtual Disk > Create > Synchronous Replicated Pair... menu option.

10

Add the virtual disks in the new disk pool or disk group back to the operating system. You may need to reboot the system to see the virtual disks.

Note: Do not start I/O to these virtual disks until after you restore from backup.

11

Restore the data for the new virtual disks from backup.

12

Click the Recheck button to rerun the Recovery Guru. The failure should no longer appear in the Summary area. If the failure appears again, contact your Technical Support Representative.

Element properties:

TargetMicrosoft.SystemCenter.ManagementServer
CategoryAlert
EnabledTrue
Alert GenerateTrue
Alert SeverityWarning
Alert PriorityNormal
RemotableTrue
Alert Message
Dell MD Array Failed Virtual Disk
{0}

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource Microsoft.Windows.ScriptGenerated.EventProvider Default
Alert WriteAction System.Health.GenerateAlert Default
WriteToDW WriteAction Microsoft.SystemCenter.DataWarehouse.PublishEventData Default

Source Code:

<Rule ID="Dell.MDStorageArray.ABBXMLEvent17" Enabled="onEssentialMonitoring" Target="SystemCenter!Microsoft.SystemCenter.ManagementServer" ConfirmDelivery="true" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="DS" TypeID="Windows!Microsoft.Windows.ScriptGenerated.EventProvider">
<ComputerName>$Target/Host/Property[Type="Windows!Microsoft.Windows.Computer"]/NetworkName$</ComputerName>
<ScriptName>RBODEventGenerator</ScriptName>
<EventNumber>17</EventNumber>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="SystemHealth!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>1</Severity>
<AlertMessageId>$MPElement[Name="Dell.MDStorageArray.ABBXMLEvent17.StringResource"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventDescription$</SuppressionValue>
</Suppression>
<Custom1/>
<Custom2/>
<Custom3/>
<Custom4/>
<Custom5/>
<Custom6/>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
<WriteAction ID="WriteToDW" TypeID="SCDW!Microsoft.SystemCenter.DataWarehouse.PublishEventData"/>
</WriteActions>
</Rule>