Asset ID: |
1-72-1473622.1 |
Update Date: | 2012-07-24 |
Keywords: | |
Solution Type
Problem Resolution Sure
Solution
1473622.1
:
Exachk Reports InfiniBand Errors
Related Items |
- Exadata Database Machine X2-2 Full Rack
|
Related Categories |
- PLA-Support>Sun Systems>x64>Engineered Systems HW>SN-x64: EXADATA
|
Exacheck reports infiniband errors in a setup with 1 Exadata X2-2 FullRack connected to 1 Exalogic HalfRack.
Created from <SR 3-5863579341>
Applies to:
Exadata Database Machine X2-2 Full Rack - Version Not Applicable to Not Applicable [Release N/A]
Information in this document applies to any platform.
Symptoms
Exacheck reports infiniband errors in a setup with 1 Exadata X2-2 FullRack connected to 1 Exalogic HalfRack.
Errors for 0x2128df102ac000 "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 0"
GUID 0x2128df102ac000 port 1: [VL15Dropped == 5]
Link info: 52 1[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128df102ac0a0 74 4[ ] "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 0" ( )
GUID 0x2128df102ac000 port 2: [VL15Dropped == 13]
Link info: 54 2[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128df102ac0a0 74 3[ ] "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 0" ( )
Errors for 0x2128df102ac040 "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 1"
GUID 0x2128df102ac040 port 1: [VL15Dropped == 3]
Link info: 63 1[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128df102ac0a0 74 2[ ] "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 1" ( )
GUID 0x2128df102ac040 port 2: [VL15Dropped == 14]
Link info: 65 2[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128df102ac0a0 74 1[ ] "SUN IB QDR GW switch camgw01 10.0.55.193 Bridge 1" ( )
Errors for 0x2128deac2ac000 "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 0"
GUID 0x2128deac2ac000 port 1: [VL15Dropped == 6]
Link info: 53 1[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128deac2ac0a0 75 4[ ] "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 0" ( )
GUID 0x2128deac2ac000 port 2: [VL15Dropped == 13]
Link info: 55 2[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128deac2ac0a0 75 3[ ] "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 0" ( )
Errors for 0x2128deac2ac040 "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 1"
GUID 0x2128deac2ac040 port 1: [VL15Dropped == 5]
Link info: 64 1[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128deac2ac0a0 75 2[ ] "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 1" ( )
GUID 0x2128deac2ac040 port 2: [VL15Dropped == 26]
Link info: 66 2[ ] ==( 4X 10.0 Gbps Active/ LinkUp)==> 0x002128deac2ac0a0 75 1[ ] "SUN IB QDR GW switch camgw02 10.0.55.194 Bridge 1" ( )
VL15 stands for "Virtual Lane" (VL) 15
Changes
Cause
VL15 is used for SM traffic, but also by tools like ibdiagnet, and get the highest priority, but there is no flowcontrol, i.e. if there is no available buffer capacity in the receiver side, the packet will be dropped (and the VL15dropped counter will increment).
Solution
You can ignore these messages.
You can also run this command to clear the cumulative errors:
ibqueryerrors.pl -rR -s LinkDowned,RcvSwRelayErrors,XmtDiscards,XmtWait
Attachments
This solution has no attachment