Asset ID: |
1-75-1395461.1 |
Update Date: | 2012-06-25 |
Keywords: | |
Solution Type
Troubleshooting Sure
Solution
1395461.1
:
Sun Storage 7000 Unified Storage System: Best Practice Recommendations for Network Configuration
Related Items |
- Sun Storage 7310 Unified Storage System
- Sun Storage 7410 Unified Storage System
- Sun ZFS Storage 7120
- Sun ZFS Storage 7320
- Sun ZFS Storage 7420
- Sun Storage 7110 Unified Storage System
- Sun Storage 7210 Unified Storage System
|
Related Categories |
- PLA-Support>Sun Systems>DISK>NAS>SN-DK: 7xxx NAS
- .Old GCS Categories>Sun Microsystems>Storage - Disk>Unified Storage
|
In this Document
Applies to:
Sun Storage 7310 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
Sun ZFS Storage 7120 - Version Not Applicable to Not Applicable [Release N/A]
Sun ZFS Storage 7320 - Version Not Applicable to Not Applicable [Release N/A]
Sun ZFS Storage 7420 - Version Not Applicable to Not Applicable [Release N/A]
Sun Storage 7110 Unified Storage System - Version Not Applicable to Not Applicable [Release N/A]
7000 Appliance OS (Fishworks)
NAS head revision : [not dependent]
BIOS revision : [not dependent]
ILOM revision : [not dependent]
JBODs Model : [not dependent]
CLUSTER related : [not dependent]
Purpose
This document will explain the best practice recommendations for setting up various network related configurations on the Sun Storage 7000 Unified Storage System.
Troubleshooting Steps
The appliance uses a 4 layer model for network configuration.
- Devices - these are the physical instances of a network interface, for example the onboard quad gigabit ethernet card in a 7110 gives 4 devices out of the box - nge0, nge1, nge2 and nge3.
- Datalinks - the datalinks are the means by which packets are sent and received, they can be associated with one or more devices. Link aggregation (LACP) and VLANs are configured at the datalink layer.
- Interfaces - these are the means by which addressing is configured. An interface is either associated with a single datalink, or is associated with a group of other interfaces in an IPMP group.
- Routing - Governs how the IP packets will be directed. Routes can be added automatically by the system or manually by an administrator. Automatically added routes will show as "System", "DHCP" or "Dynamic" depending on how it was added. A manually added route will show as "Static", or "Inactive" if associated with an offline or inactive interface.
Datalink layer
Link Aggregation or LACP is used primarily as a means of increasing performance. It works by associating two or more devices with a single datalink to increase the throughput available to that datalink. It can be seen then that for Link Aggregation to work correctly the devices that are to be used in the aggregation must be cabled to the same switch before attempting to configure the datalink.
To configure a link aggregation via the BUI simply navigate to Configuration > Network and click on the "+" icon next to the Datalinks section. Next, simply name the new datalink, check the LACP checkbox and then select which devices should be used in the datalink from the list of available devices.
Please note that some switches do not use the LACP protocol, please see the Configuration:Network:#Datalinks section of the appropriate Administration manual for your system, or the same section available through the HELP link in the BUI for recommendations on the Properties to use for different scenarios. Please also note that some configuration on the switch may also be necessary. See your switch manufacturers documentation for details.
An example of configuring a switch to support Link Aggregation can be seen here:
<Document 1400154.1> "Sun Storage 7000 Unified Storage System: An example of how to configure Link Aggregation on a switch".
Attempting to configure an LACP datalink to a switch that is not configured or able to support it can cause the Management Interface on the appliance to become unusable.
See <Document 1396100.1> "Sun Storage 7000 Unified Storage System: Causes and Solutions for Well Known General Networking Problems" for details on this.
VLANs can be used to increase network security and isolation, and also to increase the number of available datalinks if there are a small number of available network devices. Again, you must make sure that your network switch is able to support VLANs.
If you do use VLANs then this can often mean that you are configuring many datalinks with many interfaces associated with those datalinks. Please note that if you wish to use SNMP to monitor your system that there is currently (as of 2011.1.1.0) a limit of 20 interfaces that SNMP can monitor.
Interface Layer
IPMP is used primarily as a way of increasing redundancy so that network connectivity is unaffected by the failure of a single component be it a physical network port, a cable or a switch. To provide this redundancy, an IPMP group is created where the IPMP interface sits above two or more interfaces that are associated with datalinks. For the maximum redundancy the lower level interfaces associated with datalinks must have those datalinks associated with devices connected physically to different switches so that if one switch fails, other datalinks are still active.
To configure an IPMP interface via the BUI navigate to Configuration > Network and click on the "+" icon ext to Interfaces. Now check the "IP MultiPathing Group" checkbox and select the interfaces to include in the IPMP group from the list of available interfaces.
To see more details on configuring interfaces and IPMP see the Configuration:Network#Interfaces section of the appropriate Administration manual for your system, or check the same section available through the online HELP in the BUI.
There are two methods that are used by the appliance to determine if an interface is failed.
- Probe-based failure detection - this will issue an ICMP probe from each of the datalink-associated interfaces (the test interfaces) in the IPMP group in turn, to either a default gateway, or the first 5 systems on the same subnet that respond to a multicast ICMP probe. If 5 consecutive pings are unanswered the interface is considered failed. Please note that this does not necessarily mean that the device itself has physically failed. Indeed this is probably the least likely cause of the problem. See <Document 1396100.1> "Sun Storage 7000 Unified Storage System: Causes and Solutions for Well Known General Networking Problems" for further details. If then 10 further consecutive pings are answered the interface is then considered repaired.
- Link-based failure detection - uses properties of the network device driver to check on whether the link to the network is active.
The best practice recommendation is to use link-based failure detection on the appliance. This removes the dependence on other networking components external to the appliance to provide a stable network interface. To enable link-based failure detection you need to make sure that the test interfaces in an IPMP group do not have a traditional IP addresses configured. Instead they should be configured with the address and netmask of 0.0.0.0/8. Only the IPMP interface itself should be configured with a valid IP address and netmask for the appropriate subnet.
It is possible to have both link aggregation for performance reasons, and IPMP for redundancy reasons. The best best practice is to create two or more aggregated datalinks on the appliance, such that within each aggregated datalink are two or more devices connected from the appliance to the same network switch. An interface is then created for each aggregated datalink with the 0.0.0.0/8 address so it will do link-based failure detection. Finally an IPMP interface will be created and configured with an appropriate IP address on the correct subnet, this IPMP interface will be linked to all the 0.0.0.0/8 test interfaces on the aggregated datalinks. Each of those test interfaces can be chosen to be ACTIVE or STANDBY as required.
Routing
If routing is administered manually and RIP and RIPng routing protocols are not allowed to automatically configure dynamic routes, then follow these best practices:
- Have a single default route configured to use the main admin network interface on the appliance
- Configure individual static routes for each subnet that the data share clients use
This will ensure that requests made by clients on the data networks are not routed back through the admin interface. The interfaces and datalinks that connect to the client data networks should use the higher throughput devices if any are installed. e.g. the 10 Gbps Ethernet devices rather than the 1 Gbps onboard devices. These onboard devices can be used for the admin interface as this will not require a high throughput.
If it is necessary to have a default route set for the data networks as well then it will be necessary to change the "multihoming" policy from the default of "loose" to prevent replies being sent back to client requests by any available route.
Please see the Configuration:Network#Routing section of the appropriate Administration manual for the appropriate system, or the same section of the online guide available via the BUI "HELP" icon, for further details on routing and the "multihoming" policy.
Services
DNS - The appliance works best when the DNS service is correctly configured and able to resolve all hostnames successfully. Although it is possible to specify the loopback IP address of 127.0.0.1 during initial configuration for DNS servers this is not recommended in a production environment, and is only suitable for testing purposes. The appliance will not be able to resolve hostnames of other servers in this situation and critical services may not work. This is especially true if Active Directory is used as a directory service. In this case at least one of the DNS servers must be able to resolve hostname and server records in the Active Directory portion of the domain namespace. The DNS server(s) should contain both forward and reverse lookup entries for the appliance.
NTP - It is recommended that NTP be used to synchronize the time on the appliance and on any other severs that may be required to provide client access to shares. For example if Active Directory is used to authenticate users of an SMB share then the time on the Active Directory Server and the appliance must agree. NTP is the best way to achieve this. In order to have NTP synchronize the times there must be less than 5 minutes difference between the time on the appliance and the time provided by the NTP server when NTP is configured.
Clustering Considerations
When configuring a cluster, it is recommended that each cluster head have it's own dedicated admin interface that is private and not part of the cluster resources that will be subject to takeover and failback operations. The reason for this is so that each head will be able to still be accessed via the BUI once a takeover has happened. This will enable faster troubleshooting to find the root cause of unexpected takeovers or reboots. It will also allow a support bundle to be collected from a cluster head in a "STRIPPED" state.
Unfortunately doing this does require that the device that is dedicated to the admin interface on one head cannot be used for anything on the other head. For example if nge0 is used for the private dedicated admin interface on head A in a 7410 cluster then nge0 cannot be used for anything else on head B. Head B must use a different device for it's dedicated private admin interface, say nge1. Again nge1 therefore cannot be used for any purpose on head A.
Back to <Document 1392086.1> Sun Storage 7000 Unified Storage System: How to Troubleshoot Network Problems.
@ This document contains normalized content and is managed by the Domain Lead(s) of the respective domains. To notify content owners of a knowledge gap contained in this document, and/or prior to updating this document, please contact the domain engineers that are managing this document via the "Document Feedback" alias(es)
References
<NOTE:1396100.1> - Sun Storage 7000 Unified Storage System: Causes and Solutions for Well Known General Networking Problems
<NOTE:1400154.1> - Sun Storage 7000 Unified Storage System: An example of how to configure Link Aggregation on a switch
<NOTE:1392086.1> - Sun Storage 7000 Unified Storage System: How to Troubleshoot Network Problems
Attachments
This solution has no attachment