Menu
Amazon CloudWatch
User Guide

Amazon Redshift Metrics and Dimensions

Amazon Redshift sends metrics to CloudWatch for each active cluster every minute. Detailed monitoring is enabled by default. For more information, see Monitoring Amazon Redshift Cluster Performance in the Amazon Redshift Cluster Management Guide.

Amazon Redshift Metrics

The AWS/Redshift namespace includes the following metrics.

Metric Description
CPUUtilization

The percentage of CPU utilization. For clusters, this metric represents an aggregation of all nodes (leader and compute) CPU utilization values.

Units: Percent

Dimensions: NodeID, ClusterIdentifier

DatabaseConnections

The number of database connections to a cluster.

Units: Count

Dimensions: ClusterIdentifier

HealthStatus

Indicates the health of the cluster. Every minute the cluster connects to its database and performs a simple query. If it is able to perform this operation successfully, the cluster is considered healthy. Otherwise, the cluster is unhealthy. An unhealthy status can occur when the cluster database is under extremely heavy load or if there is a configuration problem with a database on the cluster.

Note

In Amazon CloudWatch this metric is reported as 1 or 0 whereas in the Amazon CloudWatch console, this metric is displayed with the words HEALTHY or UNHEALTHY for convenience. When this metric is displayed in the Amazon CloudWatch console, sampling averages are ignored and only HEALTHY or UNHEALTHY are displayed. In Amazon CloudWatch, values different than 1 and 0 may occur because of sampling issue. Any value below 1 for HealthStatus is reported as 0 (UNHEALTHY).

Units: 1/0 (HEALTHY/UNHEALTHY in the Amazon CloudWatch console)

Dimensions: ClusterIdentifier

MaintenanceMode

Indicates whether the cluster is in maintenance mode.

Note

In Amazon CloudWatch this metric is reported as 1 or 0 whereas in the Amazon CloudWatch console, this metric is displayed with the words ON or OFF for convenience. When this metric is displayed in the Amazon CloudWatch console, sampling averages are ignored and only ON or OFF are displayed. In Amazon CloudWatch, values different than 1 and 0 may occur because of sampling issues. Any value greater than 0 for MaintenanceMode is reported as 1 (ON).

Units: 1/0 (ON/OFF in the Amazon CloudWatch console).

Dimensions: ClusterIdentifier

NetworkReceiveThroughput

The rate at which the node or cluster receives data.

Units: Bytes/seconds (MB/s in the Amazon CloudWatch console)

Dimensions: NodeID, ClusterIdentifier

NetworkTransmitThroughput

The rate at which the node or cluster writes data.

Units: Bytes/second (MB/s in the Amazon CloudWatch console)

Dimensions: NodeID, ClusterIdentifier

PercentageDiskSpaceUsed

The percent of disk space used.

Units: Percent

Dimensions: NodeID, ClusterIdentifier

QueriesCompletedPerSecond

This metric is used to determine Query Throughput. The metric is the average number of queries completed per second, reported in five-minute intervals.

Units: Count/second

Dimensions: latency

QueryDuration

The average amount of time to complete a query. Reported in five-minute intervals.

Units: Microseconds

Dimensions: latency

QueryRuntimeBreakdown

The amount of time all active queries have spent in various stages of execution during the previous five minutes.

Units: Milliseconds

Dimensions: Stage

ReadIOPS

The average number of disk read operations per second.

Units: Count/second

Dimensions: NodeID

ReadLatency

The average amount of time taken for disk read I/O operations.

Units: Seconds

Dimensions: NodeID

ReadThroughput

The average number of bytes read from disk per second.

Units: Bytes (GB/s in the Amazon CloudWatch console)

Dimensions: NodeID

WLMQueriesCompletedPerSecond

This metric is used to determine Query Throughput for a Workload Management queue. The metric is the average number of queries completed per second for a Workload Management (WLM) queue, reported in five-minute intervals.

Units: Count/second

Dimensions: wlmid

WLMQueryDuration

The average length of time to complete a query for a Workload Management (WLM) queue. Reported in five-minute intervals.

Units: Microseconds

Dimensions: wlmid

WriteIOPS

The average number of disk write operations per second.

Units: Count/seconds

Dimensions: NodeID

WriteLatency

The average amount of time taken for disk write I/O operations.

Units: Seconds

Dimensions: NodeID

WriteThroughput

The average number of bytes written to disk per second.

Units: Bytes (GB/s in the Amazon CloudWatch console)

Dimensions: NodeID

Dimensions for Amazon Redshift Metrics

Amazon Redshift data can be filtered along any of the following dimensions in the table below.

Amazon Redshift data can be filtered along any of the dimensions in the table following.

Dimension Description
latency

Values are short, medium, and long.

Short is less than 10 seconds, medium is between 10 seconds and 10 minutes, and long is over 10 minutes.

NodeID

Filters requested data that is specific to the nodes of a cluster. NodeID will be either "Leader", "Shared", or "Compute-N" where N is 0, 1, ... for the number of nodes in the cluster. "Shared" means that the cluster has only one node, i.e. the leader node and compute node are combined.

Metrics are reported for the leader node and compute nodes only for CPUUtilization, NetworkTransmitThroughput, and ReadIOPS. Other metrics that use the NodeId dimension are reported only for compute nodes.

ClusterIdentifier

Filters requested data that is specific to the cluster. Metrics that are specific to clusters include HealthStatus, MaintenanceMode, and DatabaseConnections. In general metrics in for this dimension (e.g. ReadIOPS) that are also metrics of nodes represent an aggregate of the node metric data. You should take care in interpreting these metrics because they aggregate behavior of leader and compute nodes.

Stage

The execution stages for a query. The possible values are:

  • QueryPlanning: Time spent parsing and optimizing SQL statements .

  • QueryWaiting: Time spent waiting in the wlm queue.

  • QueryExecutingRead: Time spent executing read queries.

  • QueryExecutingInsert: Time spent executing insert queries.

  • QueryExecutingDelete: Time spent executing delete queries.

  • QueryExecutingUpdate: Time spent executing update queries.

  • QueryExecutingCtas: Time spent executing create table as ... queries.

  • QueryExecutingUnload: Time spent executing unload queries.

  • QueryExecutingCopy: Time spent executing copy queries.

  • QueryCommit: Time spent committing.

wmlid

The identifier for a Workload Management Queue.