Wednesday 28 December 2016

NetApp controller CPU high and disk util low

You can use sysstat -x 1 command to list the system performance.
If there is high CPU utilization but low disk utilization, then it is an indicator that internal processes are consuming resources.

First place to look at is WAFL scan. "priv set diag;wafl scan status".

If there are multiple scans running then "wafl scan speed" and the following two diag flags can be modified to reduce the CPU utilization.
setflag wafl_blk_reclaim_secs_max xxx
setflag wafl_blk_reclaim_secs_min yyy

 

Tuesday 27 December 2016

NetApp snapshot deleted but aggr space is not freed up ?

After a snapshot is deleted on a volume, the space is not freed up immediately (especially if the volume is part of a snapmirror relationship).

The snapmirror volume goes through a de-swizzle process. Read more here :
https://kb.netapp.com/support/s/article/what-is-deswizzler-or-deswizzling?language=en_US

You will have to wait for the volume de-swizzle operation and container block reclamation to complete.
You can check the status of the scan using the command priv set advanced;wafl scan status

Wednesday 21 December 2016

Oh ! When did the LIF failover : NetApp VIF

Did you notice that the your network interface is no longer in the home node. The network interface has failed over but there is no alert for it and now you dont know what caused the issue ?

That is right, there is no ERROR or Alert generated for the VIF failover. One way to find out is to look at the logs on the command line. Here is the command that helped me.

event log show -event vifmgr*
 
look for events with severity of information and event type of
vifmgr.lifsuccessfullymoved:

Sunday 18 December 2016

how to find WWN information on a windows 2012 server

It is as simple as running the powershell command "get-intiatorport"

You are interested in the port address column.


NetApp disk failed initialization due to error 5.

Hmmm... just reseat the disk.

Here is the list of things to check:

1) disk show <disk ID>  : Do you see no owner information. In this case, there is no ownership information.

 disk show 0a.10.2
  DISK       OWNER                      POOL   SERIAL NUMBER         HOME 
------------ -------------              -----  -------------         ------------- 

 disk show 0a.10.9
  DISK       OWNER                      POOL   SERIAL NUMBER         HOME 
------------ -------------              -----  -------------         -------------


2) scsi capacity <disk id> : Do you see capacity information
 *> scsi capacity 3a.10.9
      1172123568 blocks x 512 bytes/blk = 600127266816 bytes

3) sysconfig shows the disk has failed.


These are indicators that the system could not take ownership of the disk. You can try a re-seat of the disk.

A reseat of the disk did the trick in my case.

Wednesday 14 December 2016

What command was it ? When did I run that command on Netapp ?

Were you in the middle of an outage yesterday and today you don't remember the command you ran on the NetApp filers or what time the command was run?

This post is here to help.

Step 1: https://<controller IP>/spi/<node name>/etc/log/mlog

Step 2: Look for a file called command-history.log

You should be able to find all the commands that were executed on the controller.

Thursday 8 December 2016

upgrading NetApp firmware (need webserver)

When you are updating firmware/ONTAP on the NetApp infrastructure, you need a HTTP link to download the firmware to the controllers.(There are other ways around it though).

One of the easiest way to configure  a web server is to use the HFS application. Here are three simple steps to creating your webpage.

1) Download the tool from http://www.rejetto.com/hfs/?f=dl
2) Configure the tool.
  • Launch the application
  •  Point to the directory that you want to share.

3) Browse and ensure the right port is chosen.

Commvault : DR backup to cloud fails to run

 The Commvault DR backup to cloud (an option within Control Panel of Commvault console) was reporting failures.  The CVCloudService.log repo...