ARL Headswap: Part 3/4 - Node C and Retiring Node B

Performing a non-disruptive ARL (Aggregate Relocate) Headswap for NetApp Clustered Data ONTAP - Step-by-Step Walkthrough Series:


Caveat Lector: Unofficial information!

10) Replace NODE-A with NODE-C

10.1) Cable NODE-C
Moving connections from NODE-A to NODE-C

10.2) Boot NODE-C to the Boot Menu
Power on NODE-C (if not already) and press Ctrl-C to access the boot loader environment.
At the LOADER> prompt type:

boot_ontap prompt

Interrupt the boot sequence by pressing Ctrl-C to get to the Boot Menu

10.3) Enter Maintenance Mode
At the Boot Menu, select (5) Maintenance mode boot

10.4) Verify MPHA *>

storage show disk -p


10.5) Acquire the System ID of NODE-C *>

sysconfig
disk show -a


10.6) Assign NODE-A’s Disks to NODE-C
Note: Only root aggr and spare disks get assigned to NODE-C. Check with ‘disk show -a’.
(8.2.x) Reassign disks from NODE-A to NODE-C *>

disk reassign -s NODE-A_SYSID -d NODE-C_SYSID

(8.3.x) Reassign disks from NODE-A to NODE-C *>

disk reassign -s NODE-A_SYSID -d NODE-C_SYSID -p NODE-B_SYSID

Note: The -p option is only required in maintenance mode when shared disks are present.
Enter n to ‘Abort reassignment (y/n)?’
Enter y to ‘After the node becomes operation ... Do you want to continue (y/n)?’
Enter y to ‘Disk ownership will be updated ... Do you want to continue (y/n)?’

10.7) Verify ha setting shows “ha” *>

ha-config show

Note: To modify use*>

ha-config modify controller ha
ha-config modify chassis ha


10.8) Destroy mailbox disks *>

mailbox destroy local
mailbox destroy partner


10.9) Verify personality of FC/UTA ports *>

ucadmin show
ucadmin modify


10.10) Exit maintenance mode *>

halt

This returns us to the LOADER> prompt.

10.11) Check Date
On NODE-B check the date in Clustershell::>

date

On NODE-C check the date and time at LOADER>

show date
show time

And if necessary>

set date mm/dd/yyyy
set time hh:mm:ss


10.12) Boot>

boot_ontap prompt


10.13) UPDATE FLASH FROM BACKUP CONFIG
IMPORTANT - DO NOT LET THE NODE BOOT WITHOUT DOING OPTION 6 FIRST!
Interrupt the boot by pressing Ctrl-C.
At the boot menu, select option (6) Update flash from backup config.

This will replace all flash-based configurations with the last backup to disks. Are you sure you want to continue? y

Enter y

The boot proceeds normally and the system then asks you to confirm the system ID mismatch. Confirm the mismatch.
Example:

WARNING: System id mismatch. This usually occurs when replacing CF or NVRAM cards! Override system id (y|n) ? [n] y

The node might go through one round of reboot before booting normally.

10.14) Verify NODE-C
Log in to NODE-C and from Clustershell::>

cluster show
event log show -messagename scsiblade.*
storage aggregate show -owner-name NODE-C


11) Return Data LIFs to NODE-C

11.1) Broadcast-Domains, Failover-Groups, VLANs, IFGRPs
Handle as required.

11.2) Return data LIFs to NODE-C
Typical commands::>

net int show -curr-node NODE-C
net int modify -vserver VSERVER -lif LIF_NAME -home-port PORT -home-node NODE-C
net int revert -vserver VSERVER -lif LIF_NAME


11.3) Restore SAN LIFs
Typical commands::>

net int show -curr-node NODE-C
net int modify -vserver VSERVER -lif LIF_NAME -status-admin up


12) Handle NODE-C’s Other LIFs

12.1) Cluster LIFs (Node Local)
Re-home as required (one LIF at a time).

12.2) Node-Mgmt LIF (Node Local)
Re-home as required.

12.3) Intercluster LIF (Node Local)
Re-home as required.

13) Use ARL to Relocate Aggregates from NODE-B to NODE-C

13.1) Perform Aggregate Relocation
Note 1: “-override-destination-checks true” is required since NODE-B and NODE-C are not a HA-pair.
Note 2: “-override-vetoes true” may be required
Relocate all the data aggregates from NODE-B to NODE-C::>

set adv
aggr show -owner-name NODE-B -root false

aggr relocation start -node NODE-B -destination NODE-C -aggregate-list AGGR_NAME -ndo-controller-upgrade true -override-destination-checks true
## REPEAT UNTIL ALL DATA AGGREGATES HAVE BEEN RELOCATED ##

storage aggregate relocation show -node NODE-B

Proceed once all data aggregates owned by NODE-B have been successfully relocated.

13.2) Verify Aggregates are Online and Check for Offline Volumes::>

storage aggregate show -nodes NODE-C -root false
volume show -node NODE-C -state offline


13.3) Verify “Node owns partner aggregates as part of the non-disruptive head upgrade procedure”::>

storage failover show -node NODE-C


13.4) Verify that NODE-B does not own any non-root aggregates that are online::>

storage aggregate show -owner-name NODE-B -ha-policy sfo -state online


14) Record NODE-B info

14.1) Networking Data Collection::>

network interface show -curr-node NODE-B
network port show -node NODE-B
network fcp adapter show -node NODE-B
ifgrp show -node NODE-B


14.2) Record/Verify SYSID (used later)::>

system node show -instance -node NODE-B


15) Migrate Data LIFs off NODE-B

15.1) Migrate data LIFs off NODE-B
Typical commands::>

net int show -curr-node NODE-B
net int modify -vserver VSERVER -lif LIF_NAME -home-port PORT -home-node NODENAME
net int revert -vserver VSERVER -lif LIF_NAME


15.2) Down SAN LIFs
Typical commands::>

net int show -curr-node NODE-B
net int modify -vserver VSERVER -lif LIF_NAME -status-admin down


16) Handle NODE-B’s Other LIFs and Networking

16.1) Cluster LIFs (Node Local)
It is important that at least one cluster port maps from the old platform to the new platform. Cluster LIFs would be temporarily homed on this port.

16.2) Cluster-Mgmt LIF (Cluster Wide)
If Cluster-Mgmt LIF(s) are on NODE-B then rehome to another node.

16.3) Node-Mgmt LIF (Node Local)
Ideally rehome to a port in-common to the old and new platform.

16.4) Intercluster LIF (Node Local)
Ideally rehome to a port in-common to the old and new platform.

16.5) Broadcast-Domains, Failover-Groups, VLANs, IFGRPs
Handle as required.

17) Retire NODE-B

17.1) Verify cluster status::>

set -c off; set adv; cluster show


17.2) (2-Node Cluster) Make sure epsilon is on NODE-C (which is staying up)::>

cluster modify -node NODE-B -epsilon false
cluster modify -node NODE-C -epsilon true


17.3) Verify no Data LIFs on NODE-B::>

net int show -role data -curr-node NODE-B


17.4) Halt NODE-B::>

system node halt -node NODE-B


17.5) Decommission NODE-B
- Wait for NODE-B to display the LOADER> prompt.
- (Dual-Chassis Configuration Only!) Power off
- Uncable NODE-B
- (Shared Chassis Configuration) Remove NODE-B from the chassis
- (Dual-Chassis Configuration) Remove NODE-B from the cabinet

17.6) (As required) Remove cards from NODE-B and install into NODE-D

Comments