Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Problem Resolution Sure Solution 1017844.1 : Sun Fire[TM] MidRange Server I/O Board (IB) power supply failures.
PreviouslyPublishedAs 229081
Applies to:Sun Fire 3800 Server - Version: Not ApplicableSun Fire 4800 Server - Version: Not Applicable and later [Release: N/A and later] Sun Fire 4810 Server - Version: Not Applicable and later [Release: N/A and later] Sun Fire 6800 Server - Version: Not Applicable and later [Release: N/A and later] Sun Fire E4900 Server - Version: Not Applicable and later [Release: N/A and later] All Platforms SymptomsSymptomsThis document pertains to I/O Board (IB) power failures in Sun Fire[TM] servers. These boards can be fail with scenarios similar to the following (note that the voltages reported will differ from case to case, and so to the IB location): showenvironment may report ERROR LOW for the particular device and should report how low the voltage is as well, for example look at IB9 below:
Errors seen in operation resulting in a domain outage may be like this:
or also like this:
or perhaps like this:
Lastly, an error of this type in POST may appear as follows:
NOTEs:
CauseThis error indicates a power failure of the I/O Board.
The board needs to be replaced.
SolutionIn order to replace the board, as a customer, you need to open a Service Request with Oracle Support Services and schedule to have the IB or IB_SSC replaced.
Recommended Action Plan for Sun Support Services Engineers:1) Validate the error messages are as described in this article. If there are multiple boards showing this error at the same time, escalate the issue instead of proceeding. 2) Verify the type of IB or IB_SSC involved (see showboards output which will identify the I/O Board as PCI, PCI+, or PCI-X). 3) Make sure the System Controller (SC) is at ScApp 5.20.3 or higher (prefer HIGHER) to avoid CR 6300392 if the configuration includes adjacent domains. See "Additional Information" section of this article for details. 4) Dispatch the IB or IB_SSC replacement per normal process. Additional Information"Adjacent Domain Issue"Testing has shown that an I/O Board failure can result in an outage of an adjacent domain if your version of ScApp is below 5.20.3.
We found that the timing of the adjacent domain reboot is inconsequential. The domain could be rebooted 10 minutes or 10 months following the IB failure and the result would be the same. Fortunately, we confirmed a workaround to avoid this situation (prior to resolving it via ScApp update):
So, in summary, if you encounter an IB Power failure, and you have adjacent domains, and you are on a version of ScApp LESS then 5.20.3 - reboot the Main SC proactively to avoid an adjacent domain issue. The better advice is to upgrade your version of ScApp to avoid the issue altogether via patch 114527.Internal Information References BugID 6401739 The part number (non-FRU) of the D108 power supply is 300-1345. D108 information: The main issue discussed within this document is known as the D108 Power Supply failure. The D108 is the DC-DC converter on the PCI boards for these servers. If there is any concern mentioned with regards to repeat I/O Board power failures, make sure to confirm that the replacement board is at least part number 540-4616-05 or at least part number 540-4591-04 depending on which part is needed. Escalate to the next level of technical support if you have questions with regards to this document. Previously Published As 83696 Attachments This solution has no attachment |
||||||||||||
|