Dell MD Array Failed I/O Host Card

Dell.MDStorageArray.ABBXMLEvent150 (Rule)

Knowledge Base article:

Summary

Failed I/O Host Card

The causes and resolutions refer to the Dell Modular Disk Storage Manager recovery guru. Launch Dell Modular Disk Storage Manager to diagnose and fix the recovery failure as follows:

Causes

An I/O host card in one of the RAID controller modules is not functioning properly. The Recovery Guru Details area provides specific information you will need as you follow the recovery steps.

Caution:Electrostatic discharge can damage sensitive components. Always use proper antistatic protection when handling components. Touching components without using a proper ground may damage the equipment.

Important Notes

Resolutions

If...

Then...

Your storage array has one RAID controller module

Go to 'Procedure for Storage Arrays with One RAID Controller Module.'

Your storage array has two RAID controller modules

If there are any hosts connected to this storage array that are NOT running a host-based, multi-path failover driver, stop I/O to the storage array from each of these hosts.

Go to 'Procedure for Storage Arrays with Two RAID Controller Modules.'

Procedure for Storage Arrays with One RAID Controller Module

1

Check the replacement part number of the affected RAID controller module to ensure that the new RAID controller module has the same replacement part number.

a

On the Hardware tab in the Array Management Window (AMW), select the affected RAID controller module.

b

Identify the "Replacement part number" in the Properties pane.

If...

Then...

The replacement RAID controller module has the same part number

Go to step 2.

The replacement RAID controller module does NOT have the same part number

Do not continue with the remaining recovery steps and contact your Technical Support Representative.

2

Stop all I/O to this storage array.

3

Turn off power to all power-fan canisters in the enclosure containing the failed RAID controller module.

4

Remove the affected RAID controller module. Refer to the Enterprise Management Window to view which management method you are using to manage this storage array.

If...

Then...

You are using In-Band management for ALL hosts attached to this storage array

Go to step 5.

You are using Out-of-Band management for ANY host attached to this storage array

Before you insert a new RAID controller module into the storage array, you must update the DHCP/BOOTP server for each Out-of-Band managed host so that it will associate the new RAID controller module's hardware Ethernet (MAC) address with the DNS/network name and IP address previously assigned to the removed RAID controller module.

To update the DHCP/BOOTP server, find the entry associated with the removed RAID controller module and replace its Ethernet (MAC) address with the new RAID controller module's Ethernet (MAC) address. The RAID controller module's Ethernet (MAC) address is located on an Ethernet ID label on the RAID controller module in the form xx.xx.xx.xx.xx.xx.

When you are finished, go to step 5.

5

If necessary, insert the battery from the old RAID controller module into the new replacement RAID controller module. Make sure at least 1 minute has elapsed and then insert the new (compatible) RAID controller module firmly into place.

6

Turn on power to all power-fan canisters in the enclosure. Wait until all physical disks have completed the spin-up process, and then go to step 7.

7

On the Hardware tab in the AMW, select the affected RAID controller module and view the status of the RAID controller module in the Properties pane.

8

Click the Recheck button to rerun the Recovery Guru. The failure should no longer appear in the Summary area. If the failure appears again, contact your Technical Support Representative.

Procedure for Storage Arrays with Two RAID Controller Modules

1

Place the affected RAID controller module offline.

a

Select the RAID controller module on the Hardware tab in the Array Management Window.

b

Select the Hardware > RAID Controller Module > Advanced > Place > Offline... menu option.

c

Follow the instructions in the dialog, then click the Yes button.

2

Read all of the following steps before taking any action. The remaining recovery steps will no longer be accessible from the Recovery Guru dialog after you complete step a.

a

Click the Recheck button to rerun the Recovery Guru.

b

Select the "Offline RAID Controller Module" problem that is being reported in the Summary area.

c

Complete the recovery steps in the "Offline RAID Controller Module" recovery procedure to replace the affected RAID controller module.

Element properties:

TargetMicrosoft.SystemCenter.ManagementServer
CategoryAlert
EnabledTrue
Alert GenerateTrue
Alert SeverityWarning
Alert PriorityNormal
RemotableTrue
Alert Message
Dell MD Array Failed I/O Host Card
{0}

Member Modules:

ID Module Type TypeId RunAs 
DS DataSource Microsoft.Windows.ScriptGenerated.EventProvider Default
Alert WriteAction System.Health.GenerateAlert Default
WriteToDW WriteAction Microsoft.SystemCenter.DataWarehouse.PublishEventData Default

Source Code:

<Rule ID="Dell.MDStorageArray.ABBXMLEvent150" Enabled="onEssentialMonitoring" Target="SystemCenter!Microsoft.SystemCenter.ManagementServer" ConfirmDelivery="true" Remotable="true" Priority="Normal" DiscardLevel="100">
<Category>Alert</Category>
<DataSources>
<DataSource ID="DS" TypeID="Windows!Microsoft.Windows.ScriptGenerated.EventProvider">
<ComputerName>$Target/Host/Property[Type="Windows!Microsoft.Windows.Computer"]/NetworkName$</ComputerName>
<ScriptName>RBODEventGenerator</ScriptName>
<EventNumber>150</EventNumber>
</DataSource>
</DataSources>
<WriteActions>
<WriteAction ID="Alert" TypeID="SystemHealth!System.Health.GenerateAlert">
<Priority>1</Priority>
<Severity>1</Severity>
<AlertMessageId>$MPElement[Name="Dell.MDStorageArray.ABBXMLEvent150.StringResource"]$</AlertMessageId>
<AlertParameters>
<AlertParameter1>$Data/EventDescription$</AlertParameter1>
</AlertParameters>
<Suppression>
<SuppressionValue>$Data/EventDisplayNumber$</SuppressionValue>
<SuppressionValue>$Data/Channel$</SuppressionValue>
<SuppressionValue>$Data/PublisherName$</SuppressionValue>
<SuppressionValue>$Data/LoggingComputer$</SuppressionValue>
<SuppressionValue>$Data/EventCategory$</SuppressionValue>
<SuppressionValue>$Data/EventLevel$</SuppressionValue>
<SuppressionValue>$Data/UserName$</SuppressionValue>
<SuppressionValue>$Data/EventNumber$</SuppressionValue>
<SuppressionValue>$Data/EventDescription$</SuppressionValue>
</Suppression>
<Custom1/>
<Custom2/>
<Custom3/>
<Custom4/>
<Custom5/>
<Custom6/>
<Custom7/>
<Custom8/>
<Custom9/>
<Custom10/>
</WriteAction>
<WriteAction ID="WriteToDW" TypeID="SCDW!Microsoft.SystemCenter.DataWarehouse.PublishEventData"/>
</WriteActions>
</Rule>