Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1334150.1
Update Date:2012-09-06
Keywords:

Solution Type  Technical Instruction Sure

Solution  1334150.1 :   How to remove and replace a SPARC T3-4 Server Hard Drive:ATR:1334150.1:0  


Related Items
  • SPARC T3-4
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: SPARC-CAP VCAP
  •  




In this Document
Goal
Fix


Applies to:

SPARC T3-4 - Version Not Applicable and later
Information in this document applies to any platform.

Goal

How to remove and replace a T3-4 Hard Drive

****************************************************************
To report errors or request improvements on this procedure,
please Add a Comment on Doc ID: 1334150.1
****************************************************************

Fix

DISPATCH INSTRUCTIONS

WHAT SKILLS DOES THE ENGINEER NEED:(IS A SITE ENGINEER AVAILABLE?)
This is a CRU procedure.

TIME ESTIMATE: 60 minutes

TASK COMPLEXITY: 0

FIELD ENGINEER INSTRUCTIONS

PROBLEM OVERVIEW: failed hard drive

WHAT STATE SHOULD THE SYSTEM BE IN TO BE READY TO PERFORM THE RESOLUTION ACTIVITY? :

Note: This is a hot service procedure that can be performed by customers while the server is running. The drive must be taken offline before it is removed. If it cannot be taken offline, the OS must be shut down to prevent running programs from attempting to use it.
NOTE: Please be aware that Disks could be in hardware raid volumes. The "cfgadm" commands in this document (and in the Service manual) are for individual drives that are not part of RAID volumes. When a drive is part of a RAID volume, you do not need to unconfigure it before swapping it with a new drive. When replacing a failed drive in a hardware RAID volume, follow the T3-4 RAID Drive Replacement Strategies at: http://docs.oracle.com/cd/E19417-01/html/E20812/z40000081535370.html#scrolltoc

WHAT ACTION DOES THE ENGINEER NEED TO TAKE:

Hard Drive Configuration Reference

This link shows the hard drive slot configuration: http://docs.oracle.com/cd/E19417-01/E20812/z40019011299944.html#scrolltoc

Depending on the configuration of the data on a particular drive, the drive might also be removable while the server is online. However, to hot-plug a drive while the server is online you must take the drive offline before you can safely remove it. Taking a drive offline prevents any applications from accessing it, and removes the logical software links to it.

The following situations inhibit your ability to hot-plug a drive:

- If the drive contains the operating system, and the operating system is not mirrored on another drive.

- If the drive cannot be logically isolated from the online operations of the server.

If either of these conditions apply to the drive being serviced, you must take the server offline (shut down the operating system) before you replace the drive.

A) Remove a Hard Drive

A hard drive is a hot-service component that can be replaced by a customer.

1. Locate the drive in the server that you want to remove.

2. Determine if you need to shut down the OS to replace the drive, and perform one of the following actions:

- If the drive cannot be taken offline without shutting down the OS, follow instructions in Power Off the Server (SP Command) then go to Step 4.

- If the drive can be taken offline without shutting down the OS, go to Step 3.

3. Take the drive offline:

a. At the Oracle Solaris prompt, type the cfgadm -al command to list all drives in the device tree, including drives that are not configured:

# cfgadm -al

This command lists dynamically reconfigurable hardware resources and shows their operational status. In this case, look for the status of the drive you plan to remove. This information is listed in the Occupant column.

Example:

Ap_id Type Receptacle Occupant Condition
...
c2 scsi-sas connected configured unknown
c2::w5000cca00a76d1f5,0 disk-path connected configured unknown
c3 scsi-sas connected configured unknown
c3::w5000cca00a772bd1,0 disk-path connected configured unknown
c4 scsi-sas connected configured unknown
c4::w5000cca00a59b0a9,0 disk-path connected configured unknown
...

You must unconfigure any drive whose status is listed as configured, as described in Step b.

b. Unconfigure the drive using the cfgadm -c unconfigure command.

Example:

# cfgadm -c unconfigure c2::w5000cca00a76d1f5,0

Replace c2::w5000cca00a76d1f5,0 with the drive name that applies to your situation.

c. Verify that the drives blue Ready-to-Remove LED is lit.

4. Press the drive release button to unlock the drive and pull on the latch to remove the drive.
Caution - The latch is not an ejector. Do not force the latch too far to the right. Doing so can damage the latch.

5. Install the replacement drive or a filler tray.

B) Install a Hard Drive

1. Align the replacement drive to the drive slot and slide the drive in until it is seated.

Drives are physically addressed according to the slot in which they are installed. If you are replacing a drive, install the replacement drive in the same slot as the drive that was removed.

2. Close the latch to lock the drive in place.

3. Verify the drive functionality.

a. Determine if you replaced or installed a hard drive in a running server or not.

- If you replaced or installed a hard drive in a server that is running (if you hot-plugged the hard drive), then no further action is necessary. The Solaris OS will auto-configure your hard drive.

- If you replaced or installed a hard drive in a powered-down server, then continue with these procedures to configure the hard drive.

b. If the OS is shut down, and the drive you replaced was not the boot device, boot the OS.

Depending on the nature of the replaced drive, you might need to perform administrative tasks to reinstall software before the server can boot.
Refer to the Oracle Solaris OS administration documentation for more information.

c. At the Oracle Solaris prompt, type the cfgadm -al command to list all drives in the device tree, including any drives that are not configured:

# cfgadm -al

This command helps you identify the drive you installed. Example:

Ap_id Type Receptacle Occupant Condition
...
c2 scsi-sas connected configured unknown
c2::w5000cca00a76d1f5,0 disk-path connected configured unknown
c3 scsi-sas connected configured unknown
c3::sd2 disk-path connected unconfigured unknown
c4 scsi-sas connected configured unknown
c4::w5000cca00a59b0a9,0 disk-path connected configured unknown
...

d. Configure the drive using the cfgadm -c configure command.

Example:

# cfgadm -c configure c2::w5000cca00a76d1f5,0

Replace c2::w5000cca00a76d1f5,0 with the drive name for your configuration.

e. Verify that the blue Ready-to-Remove LED is no longer lit on the drive that you installed.

f. At the Oracle Solaris prompt, type the cfgadm -al command to list all drives in the device tree, including any drives that are not configured:

# cfgadm -al

The replacement drive is now listed as configured. Example:

Ap_id Type Receptacle Occupant Condition
...
c2 scsi-sas connected configured unknown
c2::w5000cca00a76d1f5,0 disk-path connected configured unknown
c3 scsi-sas connected configured unknown
c3::w5000cca00a772bd1,0 disk-path connected configured unknown
c4 scsi-sas connected configured unknown
c4::w5000cca00a59b0a9,0 disk-path connected configured unknown
...

OBTAIN CUSTOMER ACCEPTANCE
WHAT ACTION DOES THE CUSTOMER NEED TO TAKE TO RETURN THE SYSTEM TO AN OPERATIONAL STATE:
PARTS NOTE:
REFERENCE INFORMATION:
SPARC T3-4 Server Service Guide: http://docs.oracle.com/cd/E19417-01/E20812/gentextid-6532.html#scrolltoc
Detailed SVM instructions can be found at  http://download.oracle.com/docs/cd/E19253-01/816-4520/troubleshoottasks-96/index.html.
See Document 1010946.1 for Detailed Veritas Volume Manager instructions.
See Document 1002753.1 for Detailed ZFS instructions

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback