Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type FAB (standard) Sure Solution 1001189.1 : Early versions of StorADE health monitoring software may misrepresent disk errors from Sun StorEdge 3910/3960 and 6910/6960 Arrays and Sun StorEdge 6320 Systems.
PreviouslyPublishedAs 201584 Product Sun StorageTek 3960 Sun StorageTek 6320 System Sun StorageTek 6910 Sun StorageTek 6960 Sun StorageTek 3910 Bug Id <SUNBUG: 4988579> <SUNBUG: 4977589> <SUNBUG: 4986165> <SUNBUG: 4899903> <SUNBUG: 4976847> <SUNBUG: 4983823> <SUNBUG: 4992191> Impact Early versions of Sun StorADE health monitoring software embedded in the service processors of Sun StorEdge 3910/3960 and Sun StorEdge 6910/6960 Arrays and Sun StorEdge 6320 Systems do not correctly handle soft error messages coming from disk drives. This can lead to unnecessary replacement of disk drives or possible double device failures when an incorrectly diagnosed disk drive is pulled from a working array. Contributing Factors Affected systems/arrays include:
To determine the service processor image version, run the following command on SE3910/3960 and SE6910/6960 Arrays. # cat /etc/motd Sun Microsystems Inc. NWS Service Processor Image Revision 2.3.4 Oct 7, 2004 For SE6320 Systems, run this command. # cat /etc/release Sun StorEdge(tm) 6320 Service Processor Version 1.2.5 Patch Released March 14, 2005 Note that these commands are also run as part of "se_extract". See the results in the extractor output. The enabling of the volVerify functions in StorADE health monitoring software and the array disk_scrubber may result in an increased level of soft error messages. This occurs because the entire media is being read during the disk_scrubber cycle. During normal operation (w/o disk_scrubber enabled) only a percentage of the media is read and many of the soft errors may not be encountered. Symptoms There are several indications that this issue has been encountered. 1. The StorADE GUI Alarm view page shows ERRORs for recoverable soft errors on disk drives. Errors are indicated by a red symbol in the GUI alarm view page. Look for LogEvents with the red Error symbol. 2. LogEvent notification receipts with no apparent state or status changes in the disk drives operational behavior. Look for the absence of orange drive failure lights on the array. Example: (this is an example of a possible email message from an SE3910. The message from an SE6320 would be similar.) You requested the following events be forwarded to you from 'sp0 Site : sitename City State Agent : msp0 Severity : Error (Actionable) Category : StorEdge T3 DeviceId : t3b0 EventType: LogEvent.M.disk.u1d8.senseKey EventCode: 21.20.138 EventTime: 2005/03/18 05:50:05 DESCRIPTION: 1 array_warning(s) found in logfile /var/adm/messages.t3 (indicating problem with T3 msp0-t3b0/192.168.0.40): Mar 18 05:45:34 t3b0 ISR1[1]: N: u1d8 SCSI Disk Error Occurred (path = 0x0): Mar 18 05:45:34 t3b0 ISR1[1]: N: u1d8 SCSI Disk Error Occurred (path = 0x0) Mar 18 05:45:34 t3b0 ISR1[1]: N: u1d8 Sense Key = 0x1, Asc = 0x3, Ascq = 0x0 Mar 18 05:45:34 t3b0 ISR1[1]: N: u1d8 Sense Data Description = Peripheral Device Write Fault Mar 18 05:45:34 t3b0 ISR1[1]: N: u1d8 Valid Information = 0x3e8 PROBABLE-CAUSE: Sense Key information indicating a potential problem condition. RECOMMENDED-ACTION: 1. Replace Drive. 2. Contact support provider. Root Cause This issue occurs because policies in the StorADE health monitoring software were deficient with respect to soft disk errors. The issue is addressed with new service processor images now available for these arrays/systems. Resolution Upgrade the StorADE health monitoring software and the array firmware by installing the latest available Service Processor image. Follow these general upgrade steps.
SE6320 System Patch 1.2.5 Date: February 28, 2005 Software Components: SP Software 115589-10 FBR Patch 117106-02 PatchPro Patch 113193-05 SUNWstads Support Patch 114591-20 Array Firmware 3.1.6 115179-13 Fujitsu (MAP3735F) 72GB 10k 1601 116514-07 Fujitsu (MAP3147F) 146GB 10k 1601 116815-05 Fujitsu (MAS3367F) 36GB 15k 0801 116816-02 Fujitsu (MAS3735F) 72GB 15k 0801 116817-02 This image may be downloaded from http://edist.central or https://spe.sun.com
SE3910/3960 and SE6910/6960 Patch 2.3.4 Date: 17 Dec 2004 Software Components: SA 2.2 Release matrix patch T116393-05 Array Firmware 3.1.5 115180-07 SUNWsesp2.3.4 This image may be downloaded from http://webhome.sfbay/SIG/indy/Downloads.shtml or https://spe.sun.com IMPORTANT NOTE: To upgrade a 3910/3960 and 6910/6960, a specific upgrade path must be followed. Each has its own CD. The Field Engineer will have to determine the correct path, then get the CDs for each upgrade. The paths are:
Comments Please keep in mind that with every new release there are fixes to known issues and these fixes are making the product better. Be proactive and make it your mission to ensure all your customers storage products are running with the best possible chance of working at peak performance. Always be sure to research the current released version and obtain it for the upgrade. Refer to the README docs to ensure success. Related Information http://webhome.sfbay/SIG/indy/Downloads.shtml Previously Published As 101788 Internal Contributor/submitter Anthony Mullen Internal Eng Business Unit Group KE Authors Internal Eng Responsible Engineer Anthony Mullen Internal Services Knowledge Engineer Pete Stauffer Internal Kasp FAB Legacy ID 101788 Internal Sun Alert & FAB Admin Info Critical Category: Significant Change Date: Avoidance: Upgrade Responsible Manager: null Original Admin Info: null Product_uuid 05b0b61b-e1ba-4e07-a932-a782f3b92213|Sun StorageTek 3960 4de60cc2-a08e-4610-b8bf-6a1881cb59c6|Sun StorageTek 6320 System 681b08b4-d683-4f9e-b8ba-cbbb87b01d05|Sun StorageTek 6910 86b6cc47-00d7-43f1-8efd-81690a8d5b6f|Sun StorageTek 6960 f651ba09-f2d9-4dbc-918c-ad530528e4dc|Sun StorageTek 3910 Attachments This solution has no attachment |
||||||||||||
|