Sun Microsystems, Inc.  Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-77-1019498.1
Update Date:2011-02-17
Keywords:

Solution Type  Sun Alert Sure

Solution  1019498.1 :   Sun StorageTek 25x0 and 6140 Arrays may send wrong Network packets, causing the Sun StorageTek Common Array Manager (CAM) Host to Lose the Network Connection to the Array  


Related Items
  • Sun Storage 2510 Array
  •  
  • Sun Storage 2530 Array
  •  
  • Sun Storage 2540 Array
  •  
  • Sun Storage 6140 Array
  •  
Related Categories
  • GCS>Sun Microsystems>Sun Alert>Criteria Category>Availability
  •  
  • GCS>Sun Microsystems>Sun Alert>Release Phase>Resolved
  •  

PreviouslyPublishedAs
240105


Bug Id
<SUNBUG: 6721662>

Product
Sun StorageTek 2540 Array
Sun StorageTek 2530 Array
Sun StorageTek 2510 Array
Sun StorageTek 6140 Array

Date of Workaround Release
01-Aug-2008

Date of Resolved Release
21-Oct-2008

Sun StorageTek 25x0 and 6140 Arrays may send wrong Network packets, causing the Sun StorageTek Common Array Manager (CAM) Host to Lose the Network Connection to the Array

1. Impact

Sun StorageTek 25x0 and 6140 arrays with certain firmware may send wrong network packets to the network.  This can cause the Sun StorageTek Common Array Manager(CAM) host to lose the network connection to the array.  This may, in turn, prevent out-of-band array management (via the network) when using CAM.

2. Contributing Factors

This issue can occur on the following platforms:
  • Sun StorageTek 2510, 2530, 2540 Arrays without firmware 06.70.54.11
  • Sun StorageTek 6140 Array without firmware 06.60.11.11
  • Sun StorageTek 6140 Array without firmware 07.10.26.12
Note: Access to data is not affected even if the Management Host is also a Data Host.

When a host with Sun StorageTek Common Array Manager (CAM) installed tries to access an array through CAM, the array may send bad network packets coming from the host (via CAM) back to the network (it forwards the same packets on the network again). If both the CAM host and array controllers are attached to a network switch, the network switch stores the CAM host's MAC address information on the switch port connected to the array, which causes the CAM host to lose the network connection to the array.

3. Symptoms

Once the connection is lost, the "sscs list event" command may return messages similar to the following:

("test2540" is the array name of ST2540 in this example):
$ sscs list event
:
2008-07-08 07:17:56 257 test2540 (StorageTek 2540) OutOfBand    Communication Lost       
2008-07-08 07:26:04 260 test2540 (StorageTek 2540) OutOfBand    Communication Established
2008-07-08 07:46:03 265 test2540 (StorageTek 2540) OutOfBand    Communication Established
2008-07-08 08:01:49 269 test2540 (StorageTek 2540) OutOfBand    Communication Established
2008-07-08 08:11:04 272 test2540 (StorageTek 2540) OutOfBand    Communication Established
2008-07-08 08:33:38 277 test2540 (StorageTek 2540) OutOfBand    Communication Lost
Or, the sscs command would return a message similiar to the following:
$ sscs list array test2540
test2540: The object was not found.
Or the following error message:
$ sscs list array test2540
An unexpected internal system error occurred.
Retry the operation and then contact your Sun service representative if the error persists.
Sun StorageTek Common Array Manager (CAM) would return the following error message when clicking on the array name link of the Sun StorageTek 25x0 array:
(!) An error occurred during the operation to retrieve data.
The selected item was not found. It may have been deleted or modified by another session or operation.
Or:
(!) The array test2540 could not be contacted.
The operation to obtain data from the array failed.
Please look for "Lost Communication" alarm in Alarms page to get instruction.
4. Workaround

By default, the CAM host communicates with the arrays out-of-band over Ethernet. To provide temporary avoidance of this issue, configure in-band management (with CAM version 6.0 or later) to travel over the data path (using Fibre Channel, etc.) between the data host and the array instead.

To configure In-Band Management with CAM, please refer to "Sun StorageTek Common Array Manager Software Installation Guide" (Version 6.0 or later).

Sun StorageTek Common Array Manager Software 6.0 Documentation can be found at:


5. Resolution

This issue is addressed in the following releases:

The 25x0 and 6140 RAID controller firmware fix is available for CAM version 6.1.0 with the following patches:

SPARC Platform
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 137934-01 for Solaris CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 137934-02 for Solaris CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 137934-02 for Solaris CAM 6.1.0) or later
Windows
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 137935-01 for Windows CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 137935-02 for Windows CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 137935-02 for Windows CAM 6.1.0) or later
Linux
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 137936-01 for Linux RHEL SuSE CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 137936-02 for Linux RHEL SuSE CAM 6.1.0) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 137936-02 for Linux RHEL SuSE CAM 6.1.0) or later
The 25x0 and 6140 RAID controller firmware fix is available for CAM version 6.1.1 with the following patches:

SPARC Platform
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139034-01 for Solaris CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139034-01 for Solaris CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139034-01 for Solaris CAM 6.1.1) or later
Windows
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139035-01 for Windows CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139035-01 for Windows CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139035-01 for Windows CAM 6.1.1) or later
Linux
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139036-01 for Linux RHEL SuSE CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139036-01 for Linux RHEL SuSE CAM 6.1.1) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139036-01 for Linux RHEL SuSE CAM 6.1.1) or later
The 25x0 and 6140 RAID controller firmware fix is available for CAM version 6.1.2 with the following patches:

SPARC Platform
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139339-01 for Solaris CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139339-01 for Solaris CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139339-01 for Solaris CAM 6.1.2) or later
Windows
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139340-01 for Windows CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139340-01 for Windows CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139340-01 for Windows CAM 6.1.2) or later
Linux
  • Sun StorageTek 2510/2530/2540 Arrays with firmware 06.70.54.11 (patch 139341-01 for Linux RHEL SuSE CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 06.60.11.11 (patch 139341-01 for Linux RHEL SuSE CAM 6.1.2) or later
  • Sun StorageTek 6140 Arrays with firmware 07.10.26.12 (patch 139341-01 for Linux RHEL SuSE CAM 6.1.2) or later
To resolve this issue, install one of the patches above on the host where CAM 6.1.0, 6.1.1, or 6.1.2 is installed. Then upgrade the RAID controller firmware.

However, even after upgrading the RAID controller firmware via network, the original network communication issue may happen. In order to temporarily avoid the network communication issue while upgrading RAID controller firmware, please follow the procedure below.

1) Install one of patches listed above to the management host (CAM host).
2) Check the IP address of ST25x0 controller registered on CAM host with the following commands. These commands need to be executed on CAM host:
$ sscs login -h localhost -u root
$ sscs list devices
Example:
$ sscs list devices

Monitored On   Device Type    IP Address    WWN Active ASR
---------    ------------   ------------- ---------------- ----------
camhost  test   2540 2540  192.168.1.10  200a00a0b835eb27 Y N
3) Run the following ping command with the IP address which you got in the previous step in order to issue ping to the RAID controller every one second repeatedly. (This command needs to be executed on CAM host.)
Solaris:
$ ping -s <RAID_Controller_IP_address>
Windows:
> ping -t RAID_Controller_IP_address
Linux: Please contact your Linux OS support provider to ask the method to issue a ping command to the RAID controller every one second repeatedly.
4) While running the ping command , perform the firmware upgrade using CAM's web interface.
5) Once the firmware upgrade ends, then stop the ping command executed in step 3. To stop the ping command, input ^C (Ctrl C) on the shell window where ping command is executed.




Modification History
26-Aug-2008: Updated Symptoms and Resolution sections
09-Sep-2008: Updated Product, Impact, Contributing Factors and Resolution sections
12-Sep-2008: Updated Resolution section for clarification
21-Oct-2008: Updated Contributing Factors and Resolution sections; Resolved


References

<SUNPATCH: 137934-01>
<SUNPATCH: 137934-02>
<SUNPATCH: 137935-01>
<SUNPATCH: 137935-02>
<SUNPATCH: 137936-01>
<SUNPATCH: 137936-02>
<SUNPATCH: 139034-01>
<SUNPATCH: 139035-01>
<SUNPATCH: 139036-01>
<SUNPATCH: 139339-01>
<SUNPATCH: 139340-01>
<SUNPATCH: 139341-01>

Internal Comments
Please send technical questions to the following email:
[email protected]
and CC the following persons:
Internal Contributor/Submitter
Internal Eng Responsible Engineer
Internal Services Knowledge Engineer
Internal Contributor/submitter
[email protected]
Internal Eng Responsible Engineer
[email protected]
Internal Services Knowledge Engineer
[email protected]
Internal Eng Business Unit Group
NWS (Network Storage)
Internal Escalation ID
11250405, 38080371, 11230084, 11205155
Internal Resolution Patches
137934-01, 137934-02, 137935-01, 137935-02, 137936-01, 137936-02, 139034-01, 139035-01, 139036-01, 139339-01, 139340-01, 139341-01

References

SUNPATCH:137934-01
SUNPATCH:137934-02
SUNPATCH:137935-01
SUNPATCH:137935-02
SUNPATCH:137936-01
SUNPATCH:137936-02
SUNPATCH:139034-01
SUNPATCH:139035-01
SUNPATCH:139036-01
SUNPATCH:139339-01
SUNPATCH:139340-01
SUNPATCH:139341-01

References

SUNPATCH:137934-01
SUNPATCH:137934-02
SUNPATCH:137935-01
SUNPATCH:137935-02
SUNPATCH:137936-01
SUNPATCH:137936-02
SUNPATCH:139034-01
SUNPATCH:139035-01
SUNPATCH:139036-01
SUNPATCH:139339-01
SUNPATCH:139340-01
SUNPATCH:139341-01

Attachments
This solution has no attachment
  Copyright © 2011 Sun Microsystems, Inc.  All rights reserved.
 Feedback