Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-75-1010039.1
Update Date:2012-07-17
Keywords:

Solution Type  Troubleshooting Sure

Solution  1010039.1 :   Analyzing Internal RAID/non-RAID Disk Failures for x64 Sun Blade [TM] 6000/8000 server platforms  


Related Items
  • Sun Blade X6220 Server Module
  •  
  • Sun Blade X6420 Server Module
  •  
  • Sun Blade X8450 Server Module
  •  
  • Sun Blade X8420 Server Module
  •  
  • Sun Blade 8000 P System
  •  
  • Sun Blade 8000 System
  •  
  • Sun Blade X6250 Server Module
  •  
  • Sun Blade X8440 Server Module
  •  
  • Sun Blade X6450 Server Module
  •  
Related Categories
  • PLA-Support>Sun Systems>x64>Blades>SN-x64: BLADE
  •  

PreviouslyPublishedAs
213788


Applies to:

Sun Blade 8000 System - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade 8000 P System - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade X6220 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade X6250 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade X6420 Server Module - Version Not Applicable to Not Applicable [Release N/A]
All Platforms

Purpose

This document addresses failures of internal disks in Solaris[TM] x64, Red Hat , SuSE/Novell and Windows platforms. Failures under hardware RAID and JBOD (non-RAID) are discussed in this document.

Troubleshooting Steps

Symptoms:

- Disk service LED illuminated
- Disk errors in system messages files
- Disk errors on console
- Disk SMART errors during the boot process

Step 1. Verify a supported platform disk and part number

The following link references a support document that assists in the identification of a disk part number. In addition, the document provides the public web location of the Sun systems handbook to confirm the disk in question is a supported disk for your platform:

<Document: 1010055.1> Identifying Oracle Supported Platform Disks

Disks that are not listed on a platforms documentation and deemed unsupported. This is because they have not been tested and therefore have unknown properties and as such may produce unknown errors.
Even if an unsupported disk appears to work correctly, it is recommended to always use supported disks for contracted platforms.

Step 2. Verify disk is or is not a member of a RAID array

The following links reference support documents that assist in identifying if your Solaris, Linux or Windows operating environments are installed as part of a RAID array or not. The Windows instructions are in line:

Solaris: <Document: 1017961.1> How to Identify if a Solaris[TM] Operating Environment is Installed on a Hardware RAID

Linux: <Document: 1013003.1> How to Identify if a Linux Operating Environment is Installed on a Hardware RAID Controller

Windows:
Click on the following:

Right Click on My Computer and select Properties

Select the Hardware tab from the window that appears

Click on Device Manager

Click on Disk Drives. Installed disk(s) are listed

If the disk drive(s) listed display the name Adaptec, LSI, NVIDIA or StorageTek then your platforms disks are under the control of a RAID device.
If however the disk drive(s) listed display the name(s) Fujitsu, Hitachi or Seagate then your platform is not configured under the control of a RAID device therefore is a JBOD only (Just a Bunch Of Disks).

Troubleshooting steps differ for platforms that are installed under the control or a RAID management device. This is because disks under RAID control are hidden from the operating environment and are referenced as a pseudo or meta-device.

Step 3. Verify disk is online has has not been going offline and no physical disk hardware problem

The following links reference support documents that assists in identifying the online/offline status of directly attached platform disks. This document also discusses the location of your operating system error logs and the format in which disk errors should appear:

Solaris: <Document: 1005530.1> How to Check for Solaris[TM] x64 Disk Errors and Online/Offline Status

Linux: <Document: 1002936.1> How to Check for Linux Platform Disk Errors and Online/Offline Status

Windows:  <Document: 1011590.1> How to check for Windows platform disk errors and online/offline status

Disks that are not directly attached to the platform (for example installed in an external storage array), are not discussed in this document.


Storage array disks may have different properties when connected to and behind an external controller and as such change the error syntax and tools used for collection and configuration.

Step 4. Verify disk firmware revision and known applicable issues

The following link references a support document that assists in identifying the disk model number and firmware revision to check for known issues and if applicable patch updates:

<Document: 1008396.1> How to Identify Optical and Hard Disk Firmware Revisions for Checking of Known Issues

Patches and firmware updates are often available for disks under multiple operating systems.

Checking for known issues and updates results in decreased downtime.

Step 5. Run information gathering programs and raise an Oracle service request

The following links reference support documents that assist in the gathering of information from your Solaris, Red Hat, Novell/SuSE and Windows platforms using their own information gathering tools.

Solaris: <Document: 1312847.1> Solaris Explorer PIC

Novell/SuSE Enterprise Linux: <Document: 1010057.1> How to gather information on SuSE Linux Enterprise Systems

Red Hat Enterprise Linux: <Document: 1010058.1> How to Gather Information on Red Hat Enterprise Linux Systems

Windows msinfo32:


Click on the following:

Click on Start and select Run

Type "msinfo32" in the text box that appears

Select the File menu and then select Export

Provide a file name and send this file to Oracle

This is necessary if the resolution steps above did not resolve your issue and Oracle needs to be engaged to continue diagnosis for you. Information gathering programs gather operating system parameters and configuration information from your platform.

At this point, if you have validated that each troubleshooting step above is true for your environment, and the issue still exists, further troubleshooting is required. For additional support contact Oracle Support.

 


 Previously Published As 91617

References

<NOTE:1002936.1> - How to Check for Linux Platform Disk Errors and Online/Offline Status
<NOTE:1005530.1> - How to Check for Solaris[TM] X86 / X64 Disk Errors and Online/Offline Status
<NOTE:1008396.1> - How to Identify Optical and Hard Disk Firmware Revisions for Checking of Known Issues
<NOTE:1010055.1> - Identifying Oracle Supported Platform Disks
<NOTE:1010057.1> - How to gather information on SuSE Linux Enterprise Systems
<NOTE:1010058.1> - How to Gather Information on Red Hat Enterprise Linux Systems
<NOTE:1011590.1> - How to check for Windows platform disk errors and online/offline status
<NOTE:1013003.1> - How to Identify if a Linux Operating Environment is Installed on a Hardware RAID Controller
<NOTE:1017961.1> - How to Identify if a Solaris[TM] Operating Environment is Installed on a Hardware RAID Controller
@<NOTE:1018748.1> - How to Run Oracle Explorer and Forward the Data to an Oracle Service Engineer

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback