Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-73-1404618.1
Update Date:2012-02-14
Keywords:

Solution Type  FAB (standard) Sure

Solution  1404618.1 :   FCO A0320-1: Proactive: 3.3v DC-DC Converter failures on some Sun SPARC Enterprise M5000 systems.  


Related Items
  • Sun SPARC Enterprise M5000 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>Sun_Other>Sun Collections>SN-OTH: Sun FAB
  •  




In this Document
  Symptoms
  Changes
  Cause
  Solution
  References


Oracle Confidential (PARTNER). Do not distribute to customers
Reason: FABs available to Internals and Partners only

Applies to:

Sun SPARC Enterprise M5000 Server - Version: Not Applicable to Not Applicable - Release: N/A to N/A
Information in this document applies to any platform.
__________

Affected Parts:

300-2363 - 3.3V DC-DC Converter, DDC_B

Symptoms

The 3.3V DC to DC Converters (part number 300-2363) used on I/O Units (part number 541-2240) and shipped in the Sun SPARC Enterprise M5000 may fail under high PCI load conditions, resulting in system availability issues.

Impact

This will cause the I/O Units (IOUs) to fail and system to become unusable.

This issue impacts only a small identified number of Sun SPARC Enterprise M5000 systems and Services spares are not impacted by this issue as this affected part never shipped as a spare.

Changes

Contributing Factors

Affected M5000 systems that shipped to customers were not ordered with full configurations. These systems will not experience this failure until additional PCI adapters are added and the system experiences high PCI loads.  The number of installed PCI cards and the exact load that will lead to this failure is not known.

There is no evidence that the M4000 is impacted by this issue.

Cause

Newly qualified 3.3v DC-DC Converters (p/n 300-2363) may experience a signal integrity issue under heavy PCI load condition.  These new converters were not tested at full load in Manufacturing. This omission was recognized soon after converter introduction and a StopShip/Purge was initiated on November 14, 2011 at which time M5000 systems again began shipping with the previous converters (p/n 300-2010).

Further root cause to determine why some units fail under heavy load is currently underway with the supplier.  Until then the 300-2363 has be dis-qualified for use in the M5000.

Solution

Target Completion Date: 31-Jul-2012

Workaround

No workaround available - see Resolution section below.

Resolution

Hot Swappable: No
Replacement Time: 30 minutes

Using the attached customer list proactively replace all affected 3.3v DC-DC Converters (p/n 300-2363) with part number 300-2010 until the above Target Completion Date. Each impacted M5000 system will contain 2 affected converters.

After the above Target Completion Date systems should only be remediated per standard break-fix processes.

A Customer Ready document and Cover Sheet are available as attachments to this knowledge article.

The Service Manual includes DDCR replacement instructions in section 8.4 via the below URL;

  http://docs.oracle.com/cd/E19580-01/819-2210-14/819-2210-14.pdf


Identification of Affected Parts (how to):

Only Fuji Electric DC-DC Converters with part number 300-2363 installed in M5000 servers are impacted. The previous DC-DC Converter (p/n 300-2010) is not impacted by this issue.   Visual identification is required as there is no means of identifying a suspect converter electronically.

The previously attached Customer List has been removed and is now available via the below (internal only) URL;

  Customer List

...to be used to proactively contact customers with affected systems.  This customer list has two tabs; The first tab is customer information taken from contract and SR data and should be used first to locate affected systems.  The second tab is customer information from shipment data and should be used if affected systems are not located using the first tab.

To help identify the affected part a picture is attached to this knowledge article.

Hardware Remediation and Material Availability Details:

As Service Logistics spares were not impacted by this isssue, and due to the small number of impacted systems, current RSL stock will be used for this proactive remediation and no additional material will be ordered.

Comments:

Replacement time of 30 minutes does not include time to shutdown/startup the domain. Total on-site time is estimated at 1 hour.

References:

WW Stop Ship: PA999#02872.A
Other: NCAT-170

For information about FAB documents, its release processes, implementation strategies and billing information, click here.

In addition to the above you may email:

[email protected]


Contacts:

Contributor: [email protected], [email protected]
Responsible Engineer: [email protected]
Responsible Manager: [email protected]
Business Unit Group: Systems Group-OPL

References

FAB > Hardware Remediation > Mandatory

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback