Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1452873.1
Update Date:2012-09-28
Keywords:

Solution Type  Technical Instruction Sure

Solution  1452873.1 :   How to Recognize and Diagnose a Sun Fire T2000/Sun Blade T6300 Service Processor Battery Failure  


Related Items
  • Sun Fire T2000 Server
  •  
  • Sun Blade T6300 Server Module
  •  
  • Sun Netra T2000 Server
  •  
  • Sun SPARC Enterprise T2000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>SPARC>CMT>SN-SPARC: Tx000
  •  


The following document provides instructions on recognizing and properly diagnosing a Sun Fire T2000 Service Processor Battery Failure

Applies to:

Sun Fire T2000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun SPARC Enterprise T2000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Netra T2000 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Blade T6300 Server Module - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.

Goal

 The following document provides instructions on recognizing and properly diagnosing a Sun Fire T2000 Service Processor Battery Failure

Fix

 Troubleshooting Details:
Symptoms-

The following error messages may be seen on the System Controller (SP),  or lom console logs:

/var/adm/messages or showlogs -v:

APR 02 13:34:22: 00040068: "BATTERY at SC/BAT/V_BAT has exceeded low warning threshold."



Also, you may show faults reported on the SP via ALOM command "showfaults" and will see the faulted reported similar to:

sc> showfaults -v
Last POST run: TUE MAY 22 14:26:07 2007
POST status: Passed all devices

 ID Time              FRU               Fault
2627 FEB 03 21:48:56   SC/BAT            BATTERY at SC/BAT/V_BAT has exceeded low warning threshold.



When connected to ALOM on the Tx000 machine, running the "showenvironment" command at the sc prompt will provide you with greater detail of the environmental statuses of the basic system components installed in the machine such as:
System Temperatures, System Indicator Status, Fans Status, Voltage sensors (in Volts), etc., and will appear similar to the following:

sc> showenvironment


=============== Environmental Status ===============


--------------------------------------------------------------------------------
System Temperatures (Temperatures in Celsius):
--------------------------------------------------------------------------------
Sensor           Status  Temp LowHard LowSoft LowWarn HighWarn HighSoft HighHard
--------------------------------------------------------------------------------
PDB/T_AMB        OK        25    -10      -5       0      45       50       55
MB/T_AMB         OK        24    -10      -5       0      50       55       60
MB/CMP0/T_TCORE  OK        42    -10      -5       0      85       90       95
MB/CMP0/T_BCORE  OK        41    -10      -5       0      85       90       95
IOBD/IOB/TCORE   OK        39    -10      -5       0      95      100      105
IOBD/T_AMB       OK        29    -10      -5       0      52       57       62

--------------------------------------------------------
System Indicator Status:
--------------------------------------------------------
SYS/LOCATE           SYS/SERVICE          SYS/ACT             
OFF                  ON                   ON                  
--------------------------------------------------------
SYS/REAR_FAULT       SYS/TEMP_FAULT       SYS/TOP_FAN_FAULT   
ON                   OFF                  OFF                 

~snip~


--------------------------------------------------------------------------------
Voltage sensors (in Volts):
--------------------------------------------------------------------------------
Sensor          Status      Voltage LowSoft LowWarn HighWarn HighSoft
--------------------------------------------------------------------------------
MB/V_+1V5       OK            1.48    1.36    1.39    1.60     1.63
MB/V_VMEML      OK            1.79    1.63    1.67    1.92     1.98
MB/V_VMEMR      OK            1.79    1.63    1.67    1.92     1.98
MB/V_VTTL       OK            0.89    0.81    0.83    0.96     0.99
MB/V_VTTR       OK            0.87    0.81    0.83    0.96     0.99
MB/V_+3V3STBY   OK            3.34    3.13    3.16    3.53     3.59
MB/V_VCORE      OK            1.31    1.20    1.24    1.36     1.39
IOBD/V_+1V5     OK            1.48    1.36    1.39    1.60     1.63
IOBD/V_+1V8     OK            1.79    1.63    1.67    1.92     1.96
IOBD/V_+3V3MAIN OK            3.34    3.06    3.10    3.49     3.53
IOBD/V_+3V3STBY OK            3.36    3.13    3.16    3.53     3.59
IOBD/V_+1V      OK            1.18    1.09    1.11    1.28     1.30
IOBD/V_+1V2     OK            1.16    1.09    1.11    1.28     1.30
IOBD/V_+5V      OK            5.12    4.55    4.75    5.35     5.45
IOBD/V_-12V     OK          -12.11  -13.08  -12.84  -11.16   -10.92
IOBD/V_+12V     OK           12.00   10.92   11.16   12.84    13.08
SC/BAT/V_BAT    WARNING       0.57      --    2.25      --       --        <--- This is the voltage sensor reporting the current voltage status of the Service Processor Battery



As you will notice above under "Voltage sensors (in Volts):" section, for the service processor battery, we see the following key indicators of a battery failure:

Sensor          Status      Voltage LowSoft LowWarn HighWarn HighSoft
SC/BAT/V_BAT    WARNING       0.57      --    2.25      --       --

1- The current voltage of the SP battery is currently low (at only .57 Volts),
2- the "LowWarn" warning voltage is set to 2.25 (Volts), and when the voltage of the battery falls below the 2.25V threshold, it will generate a warning message which will be logged in the console logs and appear similar to the following:

APR 02 13:34:22: 00040068: "BATTERY at SC/BAT/V_BAT has exceeded low warning threshold."



Also, we will find the fault reported on the SP via ALOM command "showfaults" and will see the fault reported similar to:

sc> showfaults -v
Last POST run: TUE MAY 22 14:26:07 2007
POST status: Passed all devices

 ID Time              FRU               Fault
2627 FEB 03 21:48:56   SC/BAT            BATTERY at SC/BAT/V_BAT has exceeded low warning threshold.



Cause
The root cause of this issue is dropping voltage of the battery over time, per normal usage. The battery is located on the System Controller (SC), or System Processor (SP)

Solution
Replace the failing Service Processor Battery as soon as available.

Please Log an Oracle Service Request*** for this issue to request a new battery to be shipped for the replacement of the failing SP Battery of the server. When opening the SR, you should have ready, or be prepared, to provide (at a minimum) the following ALOM commands output, in order to quickly have this matter resolved:

  sc>  showhost
  sc>  showenvironment
  sc>  showlogs -v
  sc>  showfaults -v

*** Note: for greatest ease and fastest turn-around for this issue, when creating a Service Request(SR) via My Oracle Support Portal, please have a new/fresh Explorer file available for upload during SR creation. Please be sure to select "PSU, Fan, Battery failure" and then proceed to upload the ready Explorer file. ***

Upon receipt of the new SP Battery, please see the following document for detailed instructions on replacing the SP Battery:

How to Remove and Replace a Sun Fire T2000 Service Processor Battery:ATR:1180:1 [VIDEO] (Doc ID 1308278.1)


Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback