Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1003876.1
Update Date:2010-10-29
Keywords:

Solution Type  Problem Resolution Sure

Solution  1003876.1 :   Sun StorEdge[TM] A1000/A3500 array: Failure of a single LUN can render the entire array inaccessible.  


Related Items
  • Sun Storage A3000 Array
  •  
  • Sun Storage A1000 Array
  •  
  • Sun Storage A3500 SCSI Array
  •  
  • Sun Netra st A1000 Array
  •  
  • Sun Storage A3500 FC Array
  •  
Related Categories
  • GCS>Sun Microsystems>Storage - Disk>Modular Disk - Other
  •  

PreviouslyPublishedAs
205440


Symptoms

A LUN failure on a Sun StorEdge[TM] A1000/A3500 array, may cause volumes on
optimal LUNs located on the same controller, to become inaccessible. As a
result, application downtime may occur. In the case of a Sun StorEdge A1000
array, which has a single controller, all the LUNs on it are effected.

Note: For RAID 0, a single drive failure could result in a LUN failure. For RAID
1/5, 2 or more disks would have to fail.

If the described issue occurs, messages similar to the following will be seen
upon LUN failure:

Aug 7 03:40:52 xxx unix: WARNING: /sbus@3,0/QLGC,isp@2,10000/sd@4,1 (sd60):
Aug 7 03:40:52 xxx unix: Error for Command: write(10) Error Level: Retryable
Aug 7 03:40:52 xxx unix: Requested Block: 5222672 Error Block: 5222672
Aug 7 03:40:52 xxx unix: Vendor: Symbios Serial Number: =[
Aug 7 03:40:52 xxx unix: Sense Key: Hardware Error
Aug 7 03:40:52 xxx unix: ASC: 0x84 (<vendor unique code 0x84>), ASCQ: 0x0, FRU: 0x0

In the above example, "sd60" is the RAID0 volume which is unavailable upon a
disk failure. "0x84/0x0" indicates the requested command or "Mode Select"
operation is not allowed with the logical unit in this state.

When this occurs, the target driver used to keep re-setting the target, which is
the controller in this case. As a result, all the LUNs hosted on the controller,
get re-set, and this sequence continues, causing the host to lose access to all
the LUNs on that controller.



Resolution

This is a known issue(bug 4511840) in the target(sd/ssd) driver, which was
fixed in the following patch levels:

Solaris[TM] Operating System(OS) OS patch
Solaris 2.6 OS 105356-22
Solaris 7 OS 107458-14
Solaris 8 OS 108974-18
Solaris 9 OS fix integrated

This is an old bug, so it is recommended to apply the latest rev of these
patches.

NOTE: The fix for this bug was integrated in Solaris 9 OS.



Product
Sun StorageTek A3500 Array
Sun StorageTek A3000
Netra st A1000 Array
Sun StorageTek A3500 FC Array
Sun StorageTek A1000 Array

Internal Comments

Service Request ID: 10732362

Escalation ID: 1-10982527

Solution ID: 1-10981876

Owner GEO: APAC

Customer GEO: APAC


A1000, A3500, A3500FC, 0x84, Hardware Error
Previously Published As
82441

Change History
Date: 2005-09-08
User Name: 18392
Action: Approved
Comment: Did some additional formatting.
Version: 4
Date: 2005-09-07
User Name: 86700
Action: Approved
Comment: Hi Geoff,

I have added what you requested in the document. It is in the parenthesis where we mention LUN failure. I havnt changed anything else. I didnt feel it was necessary as no matter what RAID we have, if the LUN is inaccessible, we would face this problem but as it was requested, I have done the needful.

Am putting it back in your queue so please let me know if everything is ok now.

Thanks and regards,

-Sailesh
Version: 0
Date: 2005-09-06
User Name: 18392
Action: Rejected
Comment: Added product name, product noun, trademarking, added STM, did some re-wording and formatting.
The bug report seems to indicate that there are different results depending on the RAID level. This is not made at all clear in the document. Please check the bug report as to what should be added.
Version: 0
Date: 2005-09-06
User Name: 18392
Action: Accept
Comment:
Version: 0
Date: 2005-09-06
User Name: 84756
Action: Approved
Comment: Accurate document for the A1000/A3500 storage arrays.

Thierry
Version: 0
Date: 2005-08-24
User Name: 84756
Action: Accept
Comment:
Version: 0
Date: 2005-08-24
User Name: 86700
Action: Approved
Comment: Hi KE,

This issue is old but unfortunately, we are still getting escalations on it and since it causes the whole array to come down, I thought of making it a SRDB.

Please do the review and let me know if it requires modification.

-Sailesh
Version: 0
Date: 2005-08-23
User Name: 86700
Action: Migrated
Comment: apollo 1-10981876 2005-08-08
Version: 0
Product_uuid
2a8022d4-0a18-11d6-8043-ee5a180fdb7f|Sun StorageTek A3500 Array
2a7ca41a-0a18-11d6-82f2-e96014c515ea|Sun StorageTek A3000
49f7ad4a-aa28-47c7-935a-b971312469ea|Netra st A1000 Array
b648cdf0-efb8-4d4f-93d4-b17c1baf1935|Sun StorageTek A3500 FC Array
2a792916-0a18-11d6-8d0a-c3d03933af3c|Sun StorageTek A1000 Array

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback