Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1397685.1
Update Date:2012-06-25
Keywords:

Solution Type  Problem Resolution Sure

Solution  1397685.1 :   Sun Storage 7000 Unified Storage System: nge network interfaces may disappear from the appliance under certain conditions  


Related Items
  • Sun Storage 7310 Unified Storage System
  •  
  • Sun Storage 7410 Unified Storage System
  •  
  • Sun Storage 7110 Unified Storage System
  •  
  • Sun Storage 7210 Unified Storage System
  •  
Related Categories
  • PLA-Support>Sun Systems>DISK>NAS>SN-DK: 7xxx NAS
  •  
  • .Old GCS Categories>Sun Microsystems>Storage - Disk>Unified Storage
  •  


This document is applicable to 7x10 series unified storage appliances that use nge network interfaces

In this Document
Symptoms
Cause
Solution
References


Applies to:

Sun Storage 7210 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 7310 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 7410 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 7110 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
7000 Appliance OS (Fishworks)
NAS head revision : [before 2010.Q3.3.1]
BIOS revision : [not dependent]
ILOM revision : [not dependent]
JBODs Model : [not dependent]
CLUSTER related : [not dependent]


Symptoms

Under certain conditions it is possible for an interface on an nge device to fail to attach to the datalink for the device leading to a faulted interface.

The interface may appear to fail or to not be configured after a reboot.
The system log for the appliance will show errors similar to those below:

Jan 27 01:45:41 appliance nge: [ID 801725 kern.warning] WARNING: nge2: nge_attach: nge_smu_sema() failed
Jan 27 01:47:17 appliance last message repeated 19 times


The debug.sys log may show errors similar to the following

Jan 27 01:47:18 appliance svc.startd[45]: [ID 652011 daemon.warning] svc:/appliance/kit/network/datalink:nge2: Method "exec /usr/lib/ak/svc/method /akdatalink start" failed with exit status 95.
Jan 27 01:47:18 appliance svc.startd[45]: [ID 748625 daemon.error] appliance/kit/network/datalink:nge2 failed fatally: transitioned to maintenance (see 'svcs -xv' for details)
Jan 27 01:47:25 appliance nge: [ID 801725 kern.warning] WARNING: nge2: nge_attach: nge_smu_sema() failed
Jan 27 01:47:47 appliance last message repeated 4 times
Jan 27 01:47:47 appliance ip: [ID 171694 kern.error] nge2: DL_ATTACH_REQ failed: DL_SYSERR (errno 22)
Jan 27 01:47:47 appliance ip: [ID 960278 kern.error] nge2: DL_BIND_REQ failed: DL_OUTSTATE
Jan 27 01:47:47 appliance ip: [ID 320251 kern.error] nge2: DL_PHYS_ADDR_REQ failed: DL_OUTSTATE
Jan 27 01:47:47 appliance ip: [ID 316886 kern.error] nge2: DL_UNBIND_REQ failed: DL_OUTSTATE
Jan 27 01:47:54 appliance svc.startd[45]: [ID 652011 daemon.warning] svc:/appliance/kit/network/interface:ipmp1: Method "exec /usr/lib/ak/svc/method/akinterface start" failed with exit status 95.
Jan 27 01:47:54 appliance svc.startd[45]: [ID 748625 daemon.error] appliance/kit/network/interface:ipmp1 failed fatally: transitioned to maintenance (see 'svcs -xv' for details)
Jan 27 01:48:32 appliance nge: [ID 801725 kern.warning] WARNING: nge2: nge_attach: nge_smu_sema() failed

Also look in the interface SMF service logs for messages like:

ifconfig: SIOCSLIFNAME for ip: nge3: Invalid argument
ifconfig: setifflags: SIOCGLIFFLAGS: nge3: no such interface
This is likely to be <CR 6860426> nge_smu_sema() is failing under certain conditions

To discuss this information further with Oracle experts and industry peers, we encourage you to review, join or start a discussion in the My Oracle Support Community - 7000 Series ZFS Appliances

Cause

The cause of this problem is a problem with the SMU firmware of the onboard network devices.

See <CR 6860426> nge_smu_sema() is failing under certain conditions

For further details.

The root cause of this problem is that SMU firmware has run to a unknown state which does not release the semaphore anymore.

Solution

The nge driver was modified to resolve this problem and the fix is to upgrade the appliance kit version to at least 2010.08.17.3.1 (2010.Q3.3.1) or later.
A temporary workaround is to reboot the NAS head.

See the FISHworks wiki Software Update page for details of patch numbers of the appliance kit versions that can be downloaded through the My Oracle Support Portal "Patches and Updates" tab.

 

Back to <Document 1392086.1> Sun Storage 7000 Unified Storage System: How to Troubleshoot Network Problems.

References

@ <BUG:6860426> - NGE_SMU_SEMA() IS FAILING UNDER CERTAIN CONDITIONS
<NOTE:1392086.1> - Sun Storage 7000 Unified Storage System: How to Troubleshoot Network Problems

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback