Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1359338.1
Update Date:2012-10-11
Keywords:

Solution Type  Problem Resolution Sure

Solution  1359338.1 :   Sun Fire X4170/X4270/X4275 May Panic with "pcieb-4: PCI(-X) Express Fatal Error. (0x101)"  


Related Items
  • Sun Fire X4270 Server
  •  
  • Sun Fire X4270 M2 Server
  •  
  • Sun Fire X4170 Server
  •  
  • Sun Fire X4170 M2 Server
  •  
  • Sun Fire X4275 Server
  •  
  • Sun Netra X4270 Server
  •  
Related Categories
  • PLA-Support>Sun Systems>x64>Server>SN-x64: SERVER 64bit
  •  




Applies to:

Sun Fire X4170 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire X4170 M2 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire X4270 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire X4270 M2 Server - Version Not Applicable to Not Applicable [Release N/A]
Sun Fire X4275 Server - Version Not Applicable to Not Applicable [Release N/A]
x86_64

Symptoms

The following describes the fingerprints to confirm this issue occurred.

The system will panic with either of the following panic strings:

  • pcieb-4: PCI(-X) Express Fatal Error. (0x101)
  • pcieb-5: PCI(-X) Express Fatal Error. (0x101)
  • pcieb-4: PCI(-X) Express Fatal Error. (0x103)
  • pcieb-5: PCI(-X) Express Fatal Error. (0x103)
  • pcie_pci-5: PCI(-X) Express Fatal Error
  • pcie_pci-4: PCI(-X) Express Fatal Error

"SUNW-MSG-ID: SUNOS-8000-0G" will be seen in /var/adm/messages and 'fmadm faulty', eg:

genunix: [ID 843051 kern.info] NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
unix: [ID 836849 kern.notice]
^Mpanic[cpu9]/thread=fffffe8000a65c60:
genunix: [ID 647700 kern.notice] pcieb-4: PCI(-X) Express Fatal Error. (0x101)
unix: [ID 100000 kern.notice]
genunix: [ID 655072 kern.notice] fffffe8000a65bf0 pcieb:pcieb_intr_handler+1ea ()
genunix: [ID 655072 kern.notice] fffffe8000a65c40 unix:av_dispatch_autovect+78 ()
genunix: [ID 655072 kern.notice] fffffe8000a65c50 unix:intr_thread+5f ()
unix: [ID 100000 kern.notice]


Running 'fmdump -e' will show the following events similar to the following:

ereport.io.pci.nr            <-- No response
ereport.io.pci.nr            <-- No response
ereport.io.pci.nr            <-- No response
ereport.io.pci.fabric        <-- PCIe/PCI Fabric Error
ereport.io.pciex.rc.ce-msg   <-- PCI Express root complex received a correctable error message
ereport.io.pci.fabric        <-- PCIe/PCI Fabric Error
ereport.io.pciex.rc.mce-msg  <-- PCI Express root complex received multiple correctable errors
ereport.io.pci.fabric        <-- PCIe/PCI Fabric Error
ereport.io.pci.fabric        <-- PCIe/PCI Fabric Error
ereport.io.pci.fabric        <-- PCIe/PCI Fabric Error
ereport.io.pciex.pl.re       <-- PCI Express physical layer Receiver Error
ereport.io.pciex.rc.ce-msg   <-- PCI Express root complex received a correctable error message

 

This issue can be seen with or without PCI cards installed on the PCI Riser on the panicing pcie device path (pcieb-4 or pcieb-5)

Cause

<SunBug: 7043991> identified an incorrect settings within the Lynx IDT PCIe switch which can, in rare circumstances, lead to the switch reporting receiver underflow or overflow events. This causes the switch link to drop leading to a surprise link down event and a reset of the switch.

When this event happens the primary PCI Riser is unresponsive which is why the ereport.io.pci.nr and/or ereport.io.pciex.pl.re events are detected by FMA.

Solution

The issue is resolved in the following BIOS releases

Sun Fire X4170

  • <Patch: 10432029> - X4170 SW 2.4 - ILOM and BIOS and later


Sun Fire X4170 M2

  • <Patch: 13357887> - Sun Fire X4170 M2/X4270 M2 ILOM Software release 1.5


Sun Fire X4270

  • <Patch: 10432029> - X4270 SW 2.4 - ILOM and BIOS and later


Sun Fire X4270 M2

  • <Patch: 13357887> - Sun Fire X4170 M2/X4270 M2 ILOM Software release 1.5

Sun Netra X4270

  • <Patch: 14141821> - SUN NETRA X4270 SERVER SOFTWARE 1.2 - ILOM AND BIOS


Sun Fire X4275

  • <Patch: 10432029> - X4275 SW 2.4 - ILOM and BIOS and later

References


Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback