Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1012768.1
Update Date:2009-09-14
Keywords:

Solution Type  Technical Instruction Sure

Solution  1012768.1 :   How to tell whether over temperature in Sun StorEdge[TM] 99x0 is genuine or false alarm.  


Related Items
  • Sun Storage 9990 System
  •  
  • Sun Storage 9985 System
  •  
Related Categories
  • GCS>Sun Microsystems>Storage - Disk>Datacenter Disk
  •  

PreviouslyPublishedAs
217539


Description
This document attempt to provide simple logically way to identify and judge whether over temperature SIM (System Information Message) received is genune
or false alarm cause by related hardware components failing.  It is inspire
by an escalation from field which query why no 45C (BF12xx or BF13xx) over temperature was reported before but 60C (BF10xx or BF11xx).  Logically the
45C warning should come first before the 60C.
Hence, it brought to the mind how do we tell if the SIMs (system messages)
generated for over temperature is genuine or brought about by faulty components
like THF (Thermo sensor) or the DKCMN (Disk Controller Monitor board)


Steps to Follow
For over temperature protection, the Sun StorEdge[TM] 99x0 is monitor by 2 kind of thermal Sensor (THF).  Two sensors each to detect 45C and 60C over temperature.  These are strategically located with 2 in the front and 2 in the back of the cabinet for Cluster 1 and cluster 2 of the DKC.  For details on THF please reference the location and replacement maintenance manual (MM).  The precise component that resulted in this category of SIM can be identified by
SIM = BF1010 = 60C overtemp detected by sensor in the front Logic 1 (CL1)
SIM = BF1011 = 60C overtemp detected by sensor in the front Logic 2 (CL2)
SIM = BF1110 = 60C overtemp detected by sensor in the rear Logic 1 (CL1)
SIM = BF1111 = 60C overtemp detected by sensor in the rear Logic 2 (CL2)
SIM = BF1210 = 45C overtemp detected by sensor in the front Logic 1 (CL1)
SIM = BF1211 = 45C overtemp detected by sensor in the front Logic 2 (CL2)
SIM = BF1310 = 45C overtemp detected by sensor in the rear Logic 1 (CL1)
SIM = BF1311 = 45C overtemp detected by sensor in the rear Logic 2 (CL2)
( SIMs are documented in the Maintenance Manual RC section )
Therefore, if there is a genuine over temperature building up in the DKC
cabinet, logically one would expect the progressive series of SIMs from 45C
(BF12xx and or BF13xx) showing up before the 60C (BF10xx and or BF11xx).
We should also seem if not all the category of over temperature SIMs listed
above appearing in the logs.
However, if only one particular SIMs reference code (eg: BF1011) show up,
then it could be ascertain that likely a hardware component failure rather
then over temperature.  Possibly the temperature sensor is at fault and it
could also be the DKCMN that it is connected to be the one.
Note: Only when 45C and 60C sensors detect overtemp in both CL1 and CL2, the subsystem will power off.  The same conditions of subsystem power off due to
over temperature apply to 9900,9900V, USP and NSC.


Product
Sun StorageTek 9900 Entry Level Disk Adapter
Sun StorageTek 9990 System
Sun StorageTek 9985 System
Sun StorageTek 9900V Series Array

Internal Comments
Document built from referencing article,

HDS ID:hds4 - Specification of Overtemp Power-Off in 9960, 9910

Apollo Escalation: 1-16368059

MM Replace: Maintenance Manual SIM-RC for Sim code.
Maintenance Manual Replacement (Hardware T6) for THF location and
detail.

SE9900, Overtemperature, BF1010, BF1011, BF1110, BF1111, BF1210, BF1211, BF1310, BF1311
Previously Published As
85549

Change History
Date: 2006-05-11
User Name: 111868
Action: Approved
Comment: checked keywords
checked TM - corrected
checked expiry date
checked audience - corrected. was "free" is "contract" as per Fee vsFree policy (http://kmo.central/howto/FvF.html)
publishing
Version: 3
Date: 2006-05-11
User Name: 111868
Action: Accept
Comment:
Version: 0
Date: 2006-05-11
User Name: 90045
Action: Approved
Comment: No complain at all
Version: 0
Date: 2006-05-05
User Name: 90045
Action: Accept
Comment:
Version: 0
Date: 2006-05-04
User Name: 128938
Action: Approved
Comment: I believe this document is useful to gauge whether we have genuine over temperature situation or false alarm due to failing temperature monitoring component.
Version: 0
Date: 2006-05-04
User Name: 128938
Action: Created
Comment:
Version: 0
Product_uuid
6e8de98a-2fdc-11d7-9ffe-db2c2493c51c|Sun StorageTek 9900 Entry Level Disk Adapter
15644879-f179-11d8-ab63-080020a9ed93|Sun StorageTek 9990 System
73084edc-dc32-11d9-8a74-080020a9ed93|Sun StorageTek 9985 System
f55c65ae-16a0-11d7-82ad-99701a4c6e31|Sun StorageTek 9900V Series Array

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback