Scale-out file system metrics - FSx for ONTAP

Scale-out file system metrics

The following metrics are provided for FSx for ONTAP file systems with two or more high-availability (HA) pairs. For the metrics, a datapoint is emitted for each HA pair and for each aggregate (for storage utilization metrics).

Note

If you have a file system with multiple HA pairs, you can also use the single-HA pair file system metrics and the volume metrics.

Network I/O metrics

All of these metrics take two dimensions, FileSystemId and FileServer.

  • FileSystemId – Your file system's AWS resource ID.

  • FileServer – The name of a file server (or node) in ONTAP (for example, FsxId01234567890abcdef-01). Odd-numbered file servers are preferred file servers (that is, they service traffic unless the file system has failed over to the secondary file server), while even-numbered file servers are secondary file servers (that is, they serve traffic only when their partner is unavailable). Because of this, secondary file servers typically show less utilization than preferred file servers.

Metric Description
NetworkThroughputUtilization

Network throughput utilization as a percentage of available network throughput for your file system. This metric is equivalent to the maximum of NetworkSentBytes and NetworkReceivedBytes as a percentage of the network throughput capacity of one HA pair for your file system. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's file servers.

The Average statistic is the average network throughput utilization for the given file server over the specified period.

The Minimum statistic is the lowest network throughput utilization for the given file server over one minute, for the specified period.

The Maximum statistic is the highest network throughput utilization for the given file server over one minute, for the specified period.

Units: Percent

Valid statistics: Average, Minimum, and Maximum

NetworkSentBytes

The number of bytes (network IO) sent by your file system. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's file servers.

The Sum statistic is the total number of bytes sent over the network by the given file server over the specified period.

The Average statistic is the average number of bytes sent over the network by the given file server over the specified period.

The Minimum statistic is the lowest number of bytes sent over the network by the given file server over the specified period.

The Maximum statistic is the highest number of bytes sent over the network by the given file server over the specified period.

To calculate sent throughput (bytes per second) for any statistic, divide the statistic by the seconds in the specified period.

Units: Bytes

Valid statistics: Sum, Average, Minimum, and Maximum

NetworkReceivedBytes

The number of bytes (network IO) received by your file system. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's file servers.

The Sum statistic is the total number of bytes received over the network by the given file server over the specified period.

The Average statistic is the average number of bytes received over the network by the given file server each minute over the specified period.

The Minimum statistic is the lowest number of bytes received over the network by the given file server each minute over the specified period.

The Maximum statistic is the highest number of bytes received over the network by the given file server each minute over the specified period.

To calculate received throughput (bytes per second) for any statistic, divide the statistic by the seconds in the period.

Units: Bytes

Valid statistics: Sum, Average, Minimum, and Maximum

File server metrics

All of these metrics take two dimensions, FileSystemId and FileServer.

Metric Description
CPUUtilization

The percent utilization of the file system's CPU resources. There is one metric emitted each minute for each of your file system's file servers.

The Average statistic is the average CPU utilization of the file system over a specified period.

The Minimum statistic is the lowest CPU utilization for the given file server over the specified period.

The Maximum statistic is the highest CPU utilization for the given file server over the specified period.

Units: Percent

Valid statistics: Average, Minimum, and Maximum

FileServerDiskThroughputUtilization

The disk throughput between your file server and aggregate, as a percentage of the provisioned limit determined by throughput capacity. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). This metric is equivalent to the sum of DiskReadBytes and DiskWriteBytes as a percentage of the file server's disk throughput capacity of one HA pair for your file system. There is one metric emitted each minute for each of your file system's file servers.

The Average statistic is the average file server disk throughput utilization for the given file server over the specified period.

The Minimum statistic is the lowest file server disk throughput utilization for the given file server over the specified period.

The Maximum statistic is the highest file server disk throughput utilization for the given file server over the specified period.

Units: Percent

Valid statistics: Average, Minimum, and Maximum

FileServerDiskIopsUtilization

The IOPS utilization of available disk IOPS capacity for your file server, as a percentage of its disk IOPS limit. This differs from DiskIopsUtilization in that the utilization of disk IOPS out of the maximum that your file server can handle, as opposed to your provisioned disk IOPS. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's file servers.

The Average statistic is the average disk IOPS utilization for the given file server over the specified period.

The Minimum statistic is the lowest disk IOPS utilization for the given file server over the specified period.

The Maximum statistic is the highest disk IOPS utilization for the given file server over the specified period.

Units: Percent

Valid statistics: Average, Minimum, and Maximum

FileServerCacheHitRatio

The percentage of all read requests which are served by data that resides in your file system's RAM or NVMe caches for each of your HA pairs (for example, the active file server in an HA pair). A higher percentage indicates a higher ratio of cached reads to total reads. All I/O is considered, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's file servers.

Units: Percent

The Average statistic is the average cache hit ratio for one of your file system's HA pairs over the specified period.

The Minimum statistic is the lowest cache hit ratio for one of your file system's HA pairs over the specified period.

The Maximum statistic is the highest cache hit ratio for one of your file system's HA pairs over the specified period.

Valid statistics: Average, Minimum, and Maximum

Disk I/O metrics

All of these metrics take two dimensions, FileSystemId and Aggregate.

  • FileSystemId – Your file system's AWS resource ID.

  • Aggregate – Your file system's performance tier consists of multiple storage pools called aggregates. There is one aggregate for each HA pair. For example, aggregate aggr1 maps to file server FsxId01234567890abcdef-01 (the active file server) and file server FsxId01234567890abcdef-02 (the secondary file server) in an HA pair.

Metric Description
DiskReadBytes

The number of bytes (disk IO) from ay disk reads from this aggregate. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's aggregates.

The Sum statistic is the total number of bytes read each minute from the given aggregate over the specified period.

The Average statistic is the average number of bytes read each minute from the given aggregate over the specified period.

The Minimum statistic is the lowest number of bytes read each minute from the given aggregate over the specified period.

The Maximum statistic is the highest number of bytes read each minute from the given aggregate over the specified period.

To calculate read disk throughput (bytes per second) for any statistic, divide the statistic by the seconds in the period.

Units: Bytes

Valid statistics: Sum, Average, Minimum, and Maximum

DiskWriteBytes

The number of bytes (disk IO) from any disk writes to this aggregate. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's aggregates.

The Sum statistic is the total number of bytes written to the given aggregate over the specified period.

The Average statistic is the average number of bytes written to the given aggregate each minute over the specified period.

The Minimum statistic is the lowest number of bytes written to the given aggregate each minute over the specified period.

The Maximum statistic is the highest number of bytes written to the given aggregate each minute over the specified period.

To calculate write disk throughput (bytes per second) for any statistic, divide the statistic by the seconds in the specified period.

Units: Bytes

Valid statistics: Sum, Average, Minimum, and Maximum

DiskIopsUtilization

The disk IOPS utilization of one aggregate, as a percentage of the aggregate's disk IOPS limit (that is, the file system's total IOPS divided by the number of HA pairs for your file system). This differs from FileServerDiskIopsUtilization in that it is the utilization of provisioned disk IOPS against your provisioned IOPS limit, as opposed to the maximum disk IOPS supported by the file server (that is, dictated by your configured throughput capacity per HA pair). All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's aggregates.

The Average statistic is the average disk IOPS utilization for the given aggregate over the specified period.

The Minimum statistic is the lowest disk IOPS utilization for the given aggregate over the specified period.

The Maximum statistic ii the highest disk IOPS utilization for the given aggregate over the specified period.

Units: Percent

Valid statistics: Average, Minimum, and Maximum

DiskReadOperations

The number of read operations (disk IO) to this aggregate. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's aggregates.

The Sum statistic is the total number of read operations performed by the given aggregate over the specified period.

The Average statistic is the average number of read operations performed each minute by the given aggregate over the specified period.

The Minimum statistic is the lowest number of read operations performed each minute by the given aggregate over the specified period.

The Maximum statistic is the highest number of read operations performed each minute by the given aggregate over the specified period.

To calculate average disk IOPS over the period, use the Average statistic and divide the result by 60 (seconds).

Units: Count

Valid statistics: Sum, Average, Minimum, and Maximum

DiskWriteOperations

The number of write operations (disk IO) to this aggregate. All traffic is considered in this metric, including background tasks (such as SnapMirror, tiering, and backups). There is one metric emitted each minute for each of your file system's aggregates.

The Sum statistic is the total number of write operations performed by the given aggregate over the specified period.

The Average statistic is the average number of write operations performed each minute by the given aggregate over the specified period.

To calculate average disk IOPS over the period, use the Average statistic and divide the result by 60 (seconds).

Units: Count

Valid statistics: Sum and Average

Detailed file system metrics

Detailed file system metrics are detailed storage-utilization metrics for each of your storage tiers. Detailed file system metrics have either the FileSystemId, StorageTier, and DataType dimensions, or the FileSystemId, StorageTier, DataType, and Aggregate dimensions.

  • When the Aggregate dimension is not supplied, the metrics are for your entire file system. The StorageUsed and StorageCapacity metrics have a single data point each minute corresponding to the file system's total consumed storage (per storage tier) and total storage capacity (for the SSD tier). Meanwhile, the StorageCapacityUtilization metric emits one metric each minute for each aggregate.

  • When the Aggregate dimension is supplied, the metrics are for each aggregate.

The meaning of the dimensions are as follows:

  • FileSystemId – Your file system's AWS resource ID.

  • Aggregate – Your file system's performance tier consists of multiple storage pools called aggregates. There is one aggregate for each HA pair. For example, aggregate aggr1 maps to file server FsxId01234567890abcdef-01 (the active file server) and file server FsxId01234567890abcdef-02 (the secondary file server) in an HA pair.

  • StorageTier – Indicates the storage tier that the metric measures, with possible values of SSD and StandardCapacityPool.

  • DataType – Indicates the type of data that the metric measures, with the possible value All.

There is a row for each unique combination of a given metric and dimensional key-value pairs, with a description of what that combination measures.

Metric Description
StorageCapacityUtilization

The storage capacity utilization for a given file system aggregate. There is one metric emitted each minute for each of your file system's aggregates.

The Average statistic is the average amount of storage capacity utilization for a given aggregate over the specified period.

The Minimum statistic is the minimum amount of storage capacity utilization for a given aggregate over the specified period.

The Maximum statistic is the maximum amount of storage capacity utilization for a given aggregate over the specified period.

Units: Bytes

Valid statistics: Average, Minimum, and Maximum

StorageCapacity

The storage capacity for a given file system aggregate. There is one metric emitted each minute for each of your file system's aggregates.

The Average statistic is the average amount of storage capacity for a given aggregate over the specified period.

The Minimum statistic is the minimum amount of storage capacity for a given aggregate over the specified period.

The Maximum statistic is the maximum amount of storage capacity for a given aggregate over the specified period.

Units: Bytes

Valid statistics: Average, Minimum, and Maximum

StorageUsed

The used physical storage capacity in bytes, specific to the storage tier. This value includes savings from storage-efficiency features, such as data compression and deduplication. Valid dimension values for StorageTier are SSD and StandardCapacityPool, corresponding to the storage tier that this metric measures. There is one metric emitted each minute for each of your file system's aggregates.

The Average statistic is the average amount of physical storage capacity consumed on the given storage tier by the given aggregate over the specified period.

The Minimum statistic is the minimum amount of physical storage capacity consumed on the given storage tier by the given aggregate over the specified period.

The Maximum statistic is the maximum amount of physical storage capacity consumed on the given storage tier by the given aggregate over the specified period.

Units: Bytes

Valid statistics: Average, Minimum, and Maximum