Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1332949.1
Update Date:2012-02-03
Keywords:

Solution Type  Problem Resolution Sure

Solution  1332949.1 :   Grid Control falsely indicates Exadata Power Supply sensor and Voltage sensor problems.  


Related Items
  • Enterprise Manager for Miscellaneous Plug-ins
  •  
  • Exadata Database Machine X2-2 Hardware
  •  
Related Categories
  • PLA-Support>Database Technology>Engineered Systems>Oracle Exadata>DB: Exadata_EST
  •  




In this Document
  Symptoms
  Changes
  Cause
  Solution
  References


Created from <SR 3-3832791971>

Applies to:

Enterprise Manager for Miscellaneous Plug-ins - Version: 11.1.0.1 and later   [Release: 11.1 and later ]
Exadata Database Machine X2-2 Hardware - Version: Not Applicable and later    [Release: N/A and later]
Linux x86-64

Symptoms

After recently implementing Enterprise Manager Grid Control monitoring on an Exadata rack, you begin getting the following alerts from both the Compute Nodes and Storage Cells:

CRITICAL - Power Supply sensor #0x74 indicates that it is at state: Asserted;Power Supply sensor #0x75 indicates that it is at state: Asserted

CRITICAL - Voltage sensor #0x6d indicates that it is at state: Lower Non-recoverable

Changes

Recently implemented Enterprise Manager Grid Control monitoring.

Cause

If your current ILOM version on an affected system is version 3.0.14.11.b r62978 or newer, then:
You are encountering <Bug:12544896> "FALSE ALERT - CRITICAL - VOLTAGE SENSOR".

If your current ILOM version on an affected system is older than version 3.0.14.11.b r62978, then:
You are encountering the issue described in MOS <Document 1310539.1> "Exadata ILOM memory leak in pre-3.0.14 ILOM firmware".

Solution

< Bug:12544896>, "FALSE ALERT - CRITICAL - VOLTAGE SENSOR" is currently being worked by Oracle Development.  No workaround nor solution is currently available.

For now, please ignore the false indications.

Workaround for now:

1.)  Review the ILOM alert log and ensure that all alerts are closed out.


To ensure that all alerts are closed out, the easiest action is to address the alert itself by fixing or correcting the issue(s) listed in the ILOM event log. Alternatively, you could also manually de-assert the alert by using the ipmitool as directed in <NOTE 1398378.1>, "ILOM Targets Raise Critical Power Supply Sensor Alerts In EM That Never Clear"


2.)  Then wait for the re-evaluation of the ILOM alert metric from EM (5 or 15 minutes... depending on the duration set by the user).  The next time Grid Control agent runs the sensor metric it  will clear the alert.

References

<BUG:12544896> - FALSE ALERT - CRITICAL - VOLTAGE SENSOR
<NOTE:1327022.1> - Update Exadata ILOM firmware manually from Storage Cell software patchset
<NOTE:1310539.1> - Exadata ILOM memory leak in pre-3.0.14 ILOM firmware
<NOTE:1398378.1> - ILOM Targets Raise Critical Power Supply Sensor Alerts In EM That Never Clear

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback