Skip to main content

NetApp

ONTAP Storage Failover and Giveback Runbook

Scope

This article covers planned storage failover testing or maintenance where one node in an HA pair is taken over and later given back. NetApp documents the key commands as storage failover takeover and storage failover giveback: storage failover takeover and storage failover giveback.

Prechecks

system health status show
storage failover show
storage failover show-giveback
cluster show
network interface show -role data
volume show -state !online

Confirm:

CheckRequirement
SFO stateEnabled and possible
Partner healthPartner node healthy enough to serve storage
Data LIFsFailover targets valid
WorkloadsOwners aware of possible pause
SnapMirrorNo critical unexpected lag or unhealthy state

CLI Takeover Process

Use normal takeover for planned maintenance unless support directs otherwise:

storage failover takeover -ofnode node01

Monitor:

storage failover show
storage aggregate show
volume show -node node02

Complete maintenance. Then check giveback readiness:

storage failover show-giveback

Give back:

storage failover giveback -ofnode node01

Monitor until complete:

storage failover show
storage failover show-giveback
system health status show

REST API Validation Process

Use REST to collect pre and post evidence:

curl -k -u admin:'<password>' \
  "https://cluster.example.com/api/cluster/nodes?fields=name,state,ha,uptime"

curl -k -u admin:'<password>' \
  "https://cluster.example.com/api/storage/aggregates?fields=name,node,state,space"

curl -k -u admin:'<password>' \
  "https://cluster.example.com/api/network/ip/interfaces?fields=name,location,state,enabled"

For takeover and giveback execution, use the documented CLI or System Manager workflow unless your automation standard has validated the matching REST operations for your ONTAP release and platform.

Best Practices

Backout

If giveback fails, do not repeatedly force it. Capture:

storage failover show-giveback
event log show -severity ERROR
system health alert show

Resolve the veto or health issue, then retry giveback. Use override options only when the operational risk is understood and approved.

Back to top