Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1164893.1
Update Date:2010-08-06
Keywords:

Solution Type  Problem Resolution Sure

Solution  1164893.1 :   Copy back not starting after replacing a faulty drive in a Sun StorageTek[TM] 2500; 6140; 6540; 6580; 6780 and Flexline 380  


Related Items
  • Sun Storage 6540 Array
  •  
  • Sun Storage 2540 Array
  •  
  • Sun Storage 6580 Array
  •  
  • Sun Storage 6780 Array
  •  
  • Sun Storage Flexline 380 Array
  •  
  • Sun Storage 6140 Array
  •  
  • Sun Storage 2510 Array
  •  
  • Sun Storage 2530 Array
  •  
Related Categories
  • GCS>Sun Microsystems>Storage - Disk>Modular Disk - 6xxx Arrays
  •  


When a Global Hot Spare (GHS) is used due to a disk drive failure, the arary will copy the data back to a new, replacement drive after insertion into the storage system. There are conditions which prevent this copy back to start, this document describes these conditions.

In this Document
  Symptoms
  Cause
  Solution


Applies to:

Sun Storage 6140 Array - Version: Not Applicable and later   [Release: NA and later ]
Sun Storage 6540 Array - Version: Not Applicable to Not Applicable   [Release: NA to NA]
Sun Storage 2510 Array - Version: Not Applicable to Not Applicable   [Release: NA to NA]
Sun Storage 2530 Array - Version: Not Applicable to Not Applicable   [Release: NA to NA]
Sun Storage 2540 Array - Version: Not Applicable to Not Applicable   [Release: NA to NA]
Information in this document applies to any platform.

Symptoms

Use case 1
  1. Drive failed by SYSTEM or USER.
  2. Reconstruction completes to GHS successfully.
  3. Failed drive is replaced in the enclosure.
Results:
  • For firmware 6.xx.xx.xx:
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, it will be queued.
  • For firmware 7.10.xx.xx (all revisions) through 7.35.xx.xx (all revisions):
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, this requires user intervention to trigger the copy back. (See the Solution below)
  • For firmware 7.50.xx.xx (all revisions):
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, it will be queued.
Use case 2
  1. Drive failed by SYSTEM or USER.
  2. Reconstruction to GHS starts.
  3. Failed drive is removed and replaced from system prior to the reconstruction completes.
  4. Reconstruction completes successfully.
Results:
  • For firmware 6.xx.xx.xx:
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, it will be queued.
  • For firmware 7.10.xx.xx (all revisions) through 7.35.xx.xx (all revisions):
    The copy back operation will not get queued and start automatically. This requires user intervention to trigger the copy back. (See the Solution below)
  • For firmware 7.50.xx.xx (all revisions):
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, it will be queued.
Use case 3
  1. Drive is pulled from system.
  2. Reconstruction to GHS starts and completes.
  3. Failed drive is replaced.
Results:
  • For firmware 6.xx.xx.xx:
    The copy back operation will start, assuming that there are not more than two (2) operation in a combination of reconstruction or copy back taking place on the system. If so, it will be queued.
  • For firmware 7.xx.xx.xx:
    The copy back operation will not get queued and start automatically. This requires user intervention to trigger the copy back. (See the Solution below)

Cause

This copy back function has changed slightly between firmware revisions. Depending on the firmware and the circumstances, a copy from GHS to the replacement drive may not happen without manual intervention.

Solution

Sun StorageTek Common Array Manager
  1. Use CAM to confirm that reconstruction jobs are completed before moving forward to the next step.
  2. Use the following CAM command line to list the drives needing replacement:

    service -d <array-name> -c replace -q list

    Location of the 'service' command:

    Solaris: /opt/SUNWsefms/bin/
    Linux:  /opt/sun/cam/private/fms/bin/
    Windows: C:\Program Files\Sun\Common Array Manager\Component\fms\bin\

    Example:
    /opt/SUNWsefms/bin/service -d st6140c -c replace -q list
    Executing the replace command on st6140c
    Drives needing replacment:
    Tray.85.Drive.02
    In use hot spares:
    Tray.85.Drive.16
    Unassigned drives available for replacment:
    Tray.85.Drive.10
    Tray.85.Drive.05
    Tray.85.Drive.11
    Tray.85.Drive.06
    Tray.85.Drive.04
    Tray.85.Drive.03
    Tray.85.Drive.08
    Tray.85.Drive.12
    Tray.85.Drive.07
    Tray.85.Drive.09
    Tray.85.Drive.13
    Tray.85.Drive.14
    Tray.85.Drive.02


    The above example shows that the drive 85,02 needs to be replaced. This drive has already been replaced but the copy back did not start.

  3. Use the following CAM command line to manually trigger the copy back:

    service -d <array-name> -c replace -t <drive_needing_replacement> -q <drive_to_be_used_for_the_replacement>

    Example:
    /opt/SUNWsefms/bin/service -d st6140c -c replace -t t85d02 -q t85d02
    Executing the replace command on st6140c
    Completion Status: Success

    In the above example, we manually trigger the copy back by replacing the drive 85,02 with itself. This drive has already been physically replaced.

  4. Use CAM to confirm that the copy back from the in use GHS started.
Sun StorageTek SANtricity Storage Manager
  1. In the Array Management Window, select the Volume Group which contains the replacement drive.
  2. Select Volume Group -> Replace Drives.
  3. Select the replacement drive and replace it by itself.

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback