Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-75-1392919.1
Update Date:2012-02-27
Keywords:

Solution Type  Troubleshooting Sure

Solution  1392919.1 :   Troubleshooting Sun Storage[TM] Array non-SMART Battery Faults  


Related Items
  • Sun Storage Flexline 240 Array
  •  
  • Sun Storage Flexline 380 Array
  •  
  • Sun Storage 6540 Array
  •  
  • Sun Storage 6130 Array
  •  
  • Sun Storage Flexline 280 Array
  •  
  • Sun Storage Flexline 210 Array
  •  
  • Sun Storage 6140 Array
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>Arrays>SN-DK: 6130
  •  
  • .Old GCS Categories>Sun Microsystems>Storage - Disk>Modular Disk - 6xxx Arrays
  •  
  • .Old GCS Categories>Sun Microsystems>Storage - Disk>Modular Disk - Flexline FLX FLA FLC Arrays
  •  




In this Document
  Purpose
  Last Review Date
  Instructions for the Reader
  Troubleshooting Details


Applies to:

Sun Storage 6130 Array - Version: Not Applicable to Not Applicable - Release: N/A to N/A
Sun Storage 6540 Array - Version: Not Applicable to Not Applicable   [Release: N/A to N/A]
Sun Storage 6140 Array - Version: Not Applicable to Not Applicable   [Release: N/A to N/A]
Sun Storage Flexline 280 Array - Version: Not Applicable to Not Applicable   [Release: N/A to N/A]
Sun Storage Flexline 210 Array - Version: Not Applicable to Not Applicable   [Release: N/A to N/A]
Information in this document applies to any platform.

Purpose

The purpose of this document is to help users identify common problems with batteries in the Sun Storage Flexline 210, 240, 280, 380, StorageTek 6130, StorageTek 6140 or StorageTek 6540 arrays. These batteries provide power to the controller's data cache in the event of a power outage. If you have a StorageTek 2500, Sun Storage 2500-M2, 6x80 array, please refer to <Document 1021054.1> Troubleshooting Sun Storage[TM] Array SMART Battery Faults.

Symptoms Include

The following table contains the most common faults seen by Common Array Manager (CAM) or SANtricity:

Grid ID
CAM Critical Fault
SANtricity Critical Fault
xx.66.1005
Battery Near Expiration
BATTERY_NEAR_EXPIRATION
xx.66.1006
Critical Fault for a Cache Backup Battery has Failed
FAILED_BATTERY
xx.66.1039
Controller Cache Battery Near Expiration
NON_FRU_BATTERY_NEAR_EXPIRATION INTEGRATED_BATTERY_NEAR_EXPIRATION
xx.66.1040
A Controller Cache Backup Battery has Failed
NON_FRU_FAILED_BATTERY or FAILED_INTEGRATED_BATTERY
xx.66.1091
Battery Tray.xx.Battery.xx has transitioned to an Unknown State
BATTERY_UNKNOWN_STATE
xx.66.1101
There has been a failure in the ICC Battery pack
FAILED_BATTERY_SYSTEM
xx.66.1254
Critical Fault for Battery has expired
EXPIRED_BATTERY
xx.66.1255
Critical Fault for Battery has expired
EXPIRED_INTEGRATED_BATTERY


Other possible conditions include:
  • You just replaced the battery, but it still shows failed.
  • Amber LED lit on battery.
  • Amber LED lit on array.
Batteries are monitored by two methods, an Expiration Timer and SMART (Self-Monitoring, Analysis and Reporting Technology) battery technology.  Within the scope of this document only the Expiration Timer is used.

The Expiration Timer is a simple counter whereas the newer SMART battery technology internally tests the ability of the batteries to hold a charge.  Both are used to determine battery replacement but the Expiration Timer is also susceptible to outside conditions which can lead to reports of premature failures.

Last Review Date

January 10, 2012

Instructions for the Reader

A Troubleshooting Guide is provided to assist in debugging a specific issue. When possible, diagnostic tools are included in the document to assist in troubleshooting.

Troubleshooting Details

  1. Verify the Array Type, Firmware Version and Fault.

    Since the steps to resolve battery issues will differ based on the Hardware and Firmware involved, it is necessary to gather this information in order to determine the proper troubleshooting steps.

    • To determine the Array type, see <Document 1021066.1> Verify Sun Storage [TM] Array Array type via the User Interface.
    • To determine the Firmware version, see <Document 1021067.1> Verify Storage[TM] Array Firmware via the user Interface.
    • To determine the faults, see <Document 1021057.1> Verify Sun StorageTek[TM] 2500 and Sun Storage[TM] Critical Faults via the User Interface.

    If there are no faults, go to step 5.

    The following table lists the most common faults associated with batteries.  If you have an array with redundant batteries and both batteries have a fault, each fault should be evaluated on it's own.  Sometimes a single remedy will fix multiple faults.  If you have a single battery with multiple faults, go to step 6, contact Oracle support.

    Critical Fault
    Array Type
    Firmware Version
    Remedy
    Battery Near Expiration
    6130/FLXxx0/6x40
    Any
    Go to step 2.
    Battery Expired
    6130/FLXxx0/6x40AnyGo to step 2.
    Over Temperature
    6130/FLXxx0/6x40AnyGo to step 6.
    Replacement Required
    6130/FLXxx0/6x40AnyGo to step 6.
    Battery Failed
    6130/FLXxx0/6x40AnyGo to step 2.
    Unknown
    6130/FLXxx0/6x40AnyGo to step 6.

  2. Confirm Array System Time is correct.

    Batteries that have the Expiration Timer active are subject to premature failures if the array system time gets improperly set. Typically this is the result of a rogue NTP server.  Use <Document 1021108.1> Verifying and Setting Sun Storage[TM] Array System Time, to verify the array system time.  If the system time is incorrect, search the majorEventLog.txt file (from supportdata bundle) to see if a rogue NTP server is the cause.

    Example:

        # grep NTP majorEventLog.txt
        Description: Controller clocks set via NTP or SNTP
        Description: Controller clocks set via NTP or SNTP
        #

    If you find any instances of the above, you can reset the array system time but the problem is likely to return unless the rogue NTP server is addressed.  Isolating the array(s) and management host on their own private network is the best way to avoid a rogue NTP server.  Reset the array system time and wait 5 minutes.

    • If the critical fault clears, no further action is needed.
    • If the critical fault remains and you recently replaced the battery, go to step 3.
    • If the critical fault remains, the array system time is correct and the battery has not been recently replaced, it is likely time to replace the battery.  Go to step 6.

  3. Reset the Battery Age.

    Go to <Document 1021695.1> Resetting the Battery Age for a StorageTek[TM] 2500 and Sun Storage[TM] 6000 Array.  If after resetting the battery age the problem is still unresolved, it may be necessary to reseat the battery to clear the failed state.  Proceed to step 4.

  4. Reseating the Battery.

    The 6x40 and FLXx80 arrays have batteries that are externally accessible and this can be done without any impact to data access.  Flexline 210, 240 and 6130 arrays will require a controller reboot which means a temporary loss of a data path.  Users may want to schedule an outage for this but it is not a requirement. 

    To reseat the battery follow the service steps for battery replacement to remove and reinsert the battery. 

     

    • For CAM users see Service Advisor
    • For SANtricity users see Recovery Guru

    If you have to reboot the controller

    • For CAM users select the array from the Storage System Summary page, then expand the Physical Devices and select Controllers in the Navigation Tree.  Scroll down to the proper controller and select Reset Controller.  Confirm to execute the reboot.
    • For SANtricity users select the Logical/Physical View tab in the Array Management Window and highlight/select the controller in the Physical window.  From the menu bar, select Advanced then Recovery, then Reset and finally Controller.  Confirm to execute the reboot.

    If the problem still is not resolved, go to step 6.

  5. Battery in an Unknown State.

    Go to <Document 1283914.1> Troubleshooting Sun Storage[TM] Array Unknown Battery Status for further troubleshooting.

  6. Contact Oracle Support.

    Collect a supportdata:

    • For CAM users, reference <Document 1002514.1> Collection Support Data for Arrays Using Sun StorageTek[TM] Common Array Manager.
    • For SANtricity users, reference <Document 1014074.1> Collecting Support Data for Arrays Using Sun StorageTek[TM] SANtricity Storage Manager.

    Log a Service Request



Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback