Document Audience: | INTERNAL |
Document ID: | A0223-1 |
Title: | On Sun systems, a small number of 256MB DIMMs may experience Uncorrectable Memory Errors (UE). |
Copyright Notice: | Copyright © 2007 Sun Microsystems, Inc. All Rights Reserved |
Update Date: | Tue Mar 23 00:00:00 MST 2004 |
----------------------------------------------------------------------------
- Sun Proprietary/Confidential: Internal Use Only -
----------------------------------------------------------------------------
FIELD CHANGE ORDER
(For Authorized Distribution by Enterprise Services)
FCO #: A0223-1
Status: inactive
Synopsis: On Sun systems, a small number of 256MB DIMMs may experience Uncorrectable Memory Errors (UE).Date: Mar/23/2004
SunAlert: Yes
Top FIN/FCO Report: No
Products Reference: 256MB Samsung B-die DIMMs
Product Category: Server / System Component
Product Affected:
Systems Affected:
Mkt_ID Platform Model Description
------ -------- ----- -----------
- N28 - Netra 20
- A35 - Sun Fire 280R
- A28 - Sun Blade 1000
- A29 - Sun Blade 2000
- A37 - Sun Fire V480
- A30 - Sun Fire V880
- S8 - Sun Fire 3800
- S12 - Sun Fire 4800
- S12i - Sun Fire 4810
- S24 - Sun Fire 6800
- F12K - Sun Fire 12K
- F15K - Sun Fire 15K
X-Options Affected:
Mkt_ID Platform Model Description
------ -------- ----- -----------
x7053A - - 1GB Memory Expansion
Parts Affected:
Part Number Description
---------- -----------
501-5401-02 ASSY SDRAM DIMM 256MB
501-5401-03 ASSY SDRAM DIMM 256MB
Type Vendor Model SerialNumber(Min) SerialNumber(Max)
---- ------ ------- ----------------- -----------------
Memory Samsung B-die 501540178190000 501540178310000
References:
Sun Alert: 50765
DPCO: 363
WWStopShip: P001-20100
Issue Description:
Sun systems containing 256MB Samsung B-die DIMMs, having a module date code
between 0115 and 0127 (built between weeks 15 and 27 of 2001), may experience
Uncorrectable Memory Errors (UE). This can lead to System Panics.
The approximate overall DIMM serial number range is:
501540178190000 to 501540178310000
A physical check can be performed on the Samsung part number (also given on a
white label on the DIMMs) to verify it is a B-die. The below example part
number would be for a Samsung B-die DIMM:
M323S1742BT2-C1LS0
^
|
Denotes B-die
C-die DIMMs would have a 'C' in place of 'B' above.
Customer systems would see multiple uncorrectable memory failures in a
single machine.
The following are examples of the types of messages that may be seen as
a result of a system panic:
Server paniced with messages Oct 14 05:10:15 r8xpc25 unix: [ID 908439
kern.notice] [AFT0] Multiple Softerrors: ...
156 Intermittent, 8 Persistent, and 92 Sticky Softerrors
...
WARNING: [AFT1] Uncorrectable system bus (UE) Event on CPU0
Privileged Data Ac cess at TL=0, ... AFSR 0x00100004.00000027
AFAR 0x00000040.02de0040 Oct 14 ...
Fault_PC 0x10023f64 Esynd 0x0027 Slot B: J7900 J7901 J8001 J8000 ...
WARNING: [AFT1] EDU Event on CPU0 at TL=0, errID 0x00000a4d.89938ce0
AFSR 0x00000028.00000027 AFAR 0
ECCerrors SB0/P1 J14500 501-5401 78275566 + panic 25/09 ...
ECC errors SB0/P1 J14300 501-5401 6532879A + panic 28/10 ...
failed POST 20 SB1/P3 J16300 501-5401 78275564 12/11 36719569
failed POST SB1/P0 J13600 501-5401 TBC 29/11 36733623 choeaci0
ECC errors SB0/P3 J16500 501-5401 78274862 + panic 02/12 36734792 ...
ECC errors SB0/P0 J13600 501-5401 78275549 ...
...
WDU Event on CPU0 at TL=0, errID 0x00000a4d.89938ce0 ...
AFAR 0x00000040.02de0040 AMBIGUOUS ...
UE EDU WDU Error(s) ...
WARNING: [AFT1] Uncorrectable system bus (UE) Event on CPU1 ...
Privileged Data Access at TL=0,errID ...
Privileged Data Access at ...
Root cause was determined to be a resistive Word line via on a DRAM chip.
It was not possible to generate an accurate list of affected DIMM serial
numbers. The most complete way to bound this issue is to use the date code
range given above.
Corrective action was made available in Sun Manufacturing on December 1,
2001 via Worldwide Stopship/Purge P001-20100. Corrective action was made
available in Sun Services via DPCO# 363.
Implementation:
---
| | MANDATORY (Fully Pro-Active)
---
---
| | CONTROLLED PRO-ACTIVE (per Sun Geo Plan)
---
---
| X | UPON FAILURE
---
Replacement Time Estimate:
1.0 hours
Special Considerations:
In order to determine the serial numbers of DIMMs contained in the
system, visit the web site below and enter the SN of the system in question:
http://gscc/metrics/cgi/bdie/report.cgi
If your system is identified as having previously failed with a memory error
in the last 6 months, then for Serengeti platforms (SF 3800/48x0/6800), follow
the instructions at:
http://sdpsweb.central/FIN_FCO/FCO/FCO_A0223-1_Dir/SPE/Serengeti_README.txt
http://sdpsweb.central/FIN_FCO/FCO/FCO_A0223-1_Dir/SPE/fruid_serengeti.tar
For all other platforms, follow the instructions at:
http://sdpsweb.central/FIN_FCO/FCO/FCO_A0223-1_Dir/SPE/Generic_README.txt
http://sdpsweb.central/FIN_FCO/FCO/FCO_A0223-1_Dir/SPE/fruid_generic.tar
Corrective Action:
Based on the criteria below:
replace 501-5401-02 or -03 within affected serial number range
with 501-5401-03 (or above) outside of affected range
Using the above URLs, proactively replace Samsung B-die DIMMs within the
affected date code range shown above, ONLY if the customer has experienced at
least 2 Uncorrectable Errors in a single system within 6 months for Enterprise
Servers (Sun Fire 3800/4800/4810/6800, Sun Fire 12K/15K) or High End Entry
Servers (Sun Fire V480/V880).
Due to the low number of affected DIMMs likely to reside in the Low End Entry
Servers/Workstations (Netra 20, Sun Fire 280R, Sun Blade 1000/2000), DIMMs
are only to be proactively replaced at the account team's discretion.
It is understood and accepted that at the discretion of the responsible
account teams, there may be instances where it is necessary to proactively
replace affected Samsung B-die DIMMs.
However, please file an escalation and get assistance from engineering before
making large proactive replacements at customer sites to determine if the
reported errors are due to this issue and within affected range.
All affected Samsung B-die DIMMs should NOT be scrapped locally and should
be returned in accordance with DPCO# 363 to the repair vendors as normal
procedures.
Comments:
Samsung 256MB DIMMs outside of the datecode range of 0115 to 0127 are NOT
affected by this issue.
Micron and Infineon are not affected.
All other 256MB DIMM vendors, and other part numbers not listed in this FCO,
are not affected.
--------------------------------------------------------------------------
Billing Type:
Warranty: Sun will provide parts at no charge under Warranty
Service. On-Site Labor Rates are based on how the
system was initially installed.
Contract: Sun will provide parts at no charge. On-Site Labor Rates
are based on the type of service contract.
Non Contract: Sun will provide parts at no charge. Installation by
Sun is available based on the On-Site Labor Rates
defined in the Price List.
--------------------------------------------------------------------------
Implementation Footnote:
________________________
i) In case of Mandatory FCOs, Sun Services will attempt to contact
all known customers to recommend the part upgrade.
ii) For controlled proactive swap FCOs, Sun Services mission critical
support teams will initiate proactive swap efforts for their respective
accounts, as required.
iii) For Replace upon Failure FCOs, Sun Services partners will implement
the necessary corrective actions as and when they are required.
--------------------------------------------------------------------------
All released FINs and FCOs can be accessed using your favorite network
browser as follows:
SunWeb Access:
______________
* Access the top level URL of http://sdpsweb.Central/FIN_FCO/index.html
* From there, select the appropriate link to query or browse the FIN and
FCO Homepage collections.
SunSolve Online Access:
_______________________
* Access the SunSolve Online URL at http://sunsolve.Central/
* From there, select the appropriate link to browse the FIN or FCO index.
Internet Access:
_______________
* Access the top level URL of https://spe.sun.com
--------------------------------------------------------------------------
General:
________
Send questions or comments to [email protected]
---------------------------------------------------------------------------