![]() | Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition | ||
|
|
![]() |
||||||||||||||||||||
Solution Type Sun Alert Sure Solution 1458754.1 : All M-Series Systems XSCFUs May Fail and/or Halt Due to Berkeley DataBase Corruption Without System Firmware Upgrade Version 1112 on SPARC Enterprise M8000/M9000-32/M9000-64 Servers, or a Minimum Version of 1113 on M3000/M4000/M5000 Servers
___________________________________ Date of Resolved Release: 16-May-2012 ___________________________________ In this Document
Applies to:Sun SPARC Enterprise M3000 ServerSun SPARC Enterprise M4000 Server Sun SPARC Enterprise M5000 Server Sun SPARC Enterprise M9000-32 Server Sun SPARC Enterprise M8000 Server Information in this document applies to any platform. ___________________________________ SUNBUG:7162656 SUNBUG:7180251 Date of Workaround Release: 16-May-2012 Date of Resolved Release: 23-Jul-2012 ___________________________________ DescriptionMandatory firmware upgrade to1112 on Sun SPARC Enterprise M8000/M9000-32/M9000-64 Servers, and minimum firmware upgrade to 1113 on Sun SPARC Enterprise M3000/M4000/M5000 Servers is required for all M-series systems. Failure to do so may result in the XSCFUs used in the M-series platforms to fail and/or halt due to internal Berkeley DataBase (BDB) corruption. OccurrenceThis issue can occur on the following platforms:
To determine the XCP firmware version on one of these systems, the following command can be used: XSCF> version -c xcp XCP 1112 output will appear similar to the following:
SymptomsBehavior is dependent on the XCP firmware revision. Upon detecting a BDB corruption during the boot process, a XSCF will exhibit symptoms similar to the following: WorkaroundThere is no workaround for this issue.
XCP 1112 and 1113 firmware is available for download at: Note 1: It is advised that your XCP firmware (XCP<1092) be updated as soon as possible to avoid unexpected domain outages. XSCFU reboots should be avoided until the XCP firmware update has been completed. For XCP firmware > 1092: update the firmware at your next scheduled maintenance window to avoid potential loss of domain console access. Note 2: The initial release of this document stated that this issue was fixed for all affected servers by XCP 1112. However, it was later found that the fix in XCP 1112 was incomplete for M3000, M4000 and M5000 Servers. The completion of the fix for these servers was delivered in XCP 1113. History16-May-2012: Document created, Resolved release 05-Jun-2012: Updated the Description and Workaround sections. 26-Jun-2012: Updated document status, Occurrence, and Workaround sections. 23-Jul-2012: Updated Occurrence, and Workaround sections. Resolved. 25-Jul-2012: Updated Document title and description for clarification. 29-Jul-2012: Added note to the Workaround section.
NOTE: The XCP 1112 firmware does not fix this issue on the Sun SPARC Enterprise M3000, M4000, M5000 as initially thought. This issue will be addressed in the XCP 1113 firmware version for these systems. This Sun Alert will be updated when the firmware version that addresses this issue on the Sun SPARC Enterprise M3000, M4000, M5000 systems is available. This issue is reported, and all escalations documented under CR 7078506. CR 7162656 is the software workaround for this issue. Note: Services (Field and TSC) are not required to proactively create SRs for customer upgrades. There is no method to proactively check if BDB corruption has occurred.
Feb 6 08:28:04 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /OPNL Feb 6 08:28:05 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /PSU#0 Feb 6 08:28:08 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /PSU#1 Feb 6 08:28:09 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /PSU#2 Feb 6 08:28:11 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /PSU#3 Feb 6 08:28:13 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /FANBP_C Feb 6 08:28:14 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B Feb 6 08:28:15 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/CPUM#0 Feb 6 08:28:15 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/CPUM#0 Feb 6 08:28:16 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/CPUM#1 Feb 6 08:28:17 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/CPUM#1 Feb 6 08:28:18 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/MEMB#0 Feb 6 08:28:18 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/MEMB#1 Feb 6 08:28:19 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/MEMB#2 Feb 6 08:28:19 m5000-xscf0 monitor_msg: SCF:Unit configuration change(add) /MBU_B/MEMB#3 In "showlogs power": Feb 06 08:28:02 GMT 2012 SCF Reset Power On -- Locked Feb 06 08:28:09 GMT 2012 System Power On Pow.Fail/Recov.-- Locked Feb 06 08:29:08 GMT 2012 Domain Power On Pow.Fail/Recov.00 Locked Feb 06 08:29:10 GMT 2012 Domain Power On Pow.Fail/Recov.01 Locked. “Unit configuration change (add)” messages are normal at first chassis power-on, upon XSCFU replacement, or when hardware is added. The distinguishing difference for BDB corruption triggered reconstruction is when the messages occur coincident with the outage of a running domain or XSCF reset. Attachments This solution has no attachment |
||||||||||||||||||||
|