Metric Description
AsynchronousSearchSubmissionRateThe number of asynchronous searches submitted in the last minute.
AsynchronousSearchInitializedRateThe number of asynchronous searches initialized in the last minute.
AsynchronousSearchRunningCurrentThe number of asynchronous searches currently running.
AsynchronousSearchCompletionRateThe number of asynchronous searches successfully completed in the last minute.
AsynchronousSearchFailureRateThe number of asynchronous searches that completed and failed in the last minute.
AsynchronousSearchPersistRateThe number of asynchronous searches that persisted in the last minute.
AsynchronousSearchPersistFailedRateThe number of asynchronous searches that failed to persist in the last minute.
AsynchronousSearchRejectedThe total number of asynchronous searches rejected since the node up time.
AsynchronousSearchCancelledThe total number of asynchronous searches cancelled since the node up time.
AsynchronousSearchMaxRunningTimeThe duration of longest running asynchronous search on a node in the last minute.
Asynchronous search cluster statistics
Metric Description
AsynchronousSearchStoreHealthThe health of the store in the persisted index (RED/non-RED) in the last minute.
AsynchronousSearchStoreSizeThe size of the system index across all shards in the last minute.
AsynchronousSearchStoredResponseCountThe numbers of stored responses in the system index in the last minute.
SQL metrics
Amazon OpenSearch Service provides the following metrics for SQL support (p. 220).
Metric Description
SQLFailedRequestCountByCusErrThe number of requests to the _sql API that failed due to a client issue.
For example, a request might return HTTP status code 400 due to an IndexNotFoundException.
Relevant statistics: Sum
SQLFailedRequestCountBySysErrThe number of requests to the _sql API that failed due to a server problem or feature limitation. For example, a request might return HTTP status code 503 due to a VerificationException.
Relevant statistics: Sum
k-NN metrics
Metric Description
SQLRequestCount The number of requests to the _sql API.
Relevant statistics: Sum
SQLDefaultCursorRequestCountSimilar to SQLRequestCount but only counts pagination requests.
Relevant statistics: Sum
SQLUnhealthy A value of 1 indicates that, in response to certain requests, the SQL plugin is returning 5xx response codes or passing invalid query DSL to OpenSearch. Other requests should continue to succeed. A value of 0 indicates no recent failures. If you see a sustained value of 1, troubleshoot the requests your clients are making to the plugin.
Relevant statistics: Maximum
k-NN metrics
Amazon OpenSearch Service includes the following metrics for the k-nearest neighbor (k-NN (p. 223)) plugin.
Metric Description
KNNCacheCapacityReached Per-node metric for whether cache capacity has been reached. This metric is only relevant to approximate k-NN search.
Relevant statistics: Maximum
KNNCircuitBreakerTriggered Per-cluster metric for whether the circuit breaker is triggered. If any nodes return a value of 1 for KNNCacheCapacityReached, this value will also return 1. This metric is only relevant to approximate k-NN search.
Relevant statistics: Maximum
KNNEvictionCount Per-node metric for the number of graphs that have been evicted from the cache due to memory constraints or idle time. Explicit evictions that occur because of index deletion are not counted. This metric is only relevant to approximate k-NN search.
Relevant statistics: Sum
KNNGraphIndexErrors Per-node metric for the number of requests to add the knn_vector field of a document to a graph that produced an error.
Relevant statistics: Sum
KNNGraphIndexRequests Per-node metric for the number of requests to add the knn_vector field of a document to a graph.
Relevant statistics: Sum
KNNGraphMemoryUsage Per-node metric for the current cache size (total size of all graphs in memory) in kilobytes. This metric is only relevant to approximate k-NN search.
k-NN metrics
Metric Description
Relevant statistics: Average
KNNGraphQueryErrors Per-node metric for the number of graph queries that produced an error.
Relevant statistics: Sum
KNNGraphQueryRequests Per-node metric for the number of graph queries.
Relevant statistics: Sum
KNNHitCount Per-node metric for the number of cache hits. A cache hit occurs when a user queries a graph that is already loaded into memory.
This metric is only relevant to approximate k-NN search.
Relevant statistics: Sum
KNNLoadExceptionCount Per-node metric for the number of times an exception occurred while trying to load a graph into the cache. This metric is only relevant to approximate k-NN search.
Relevant statistics: Sum
KNNLoadSuccessCount Per-node metric for the number of times the plugin successfully loaded a graph into the cache. This metric is only relevant to approximate k-NN search.
Relevant statistics: Sum
KNNMissCount Per-node metric for the number of cache misses. A cache miss occurs when a user queries a graph that is not yet loaded into memory. This metric is only relevant to approximate k-NN search.
Relevant statistics: Sum
KNNQueryRequests Per-node metric for the number of query requests the k-NN plugin received.
Relevant statistics: Sum
KNNScriptCompilationErrors Per-node metric for the number of errors during script compilation.
This statistic is only relevant to k-NN score script search.
Relevant statistics: Sum
KNNScriptCompilations Per-node metric for the number of times the k-NN script has been compiled. This value should usually be 1 or 0, but if the cache containing the compiled scripts is filled, the k-NN script might be recompiled. This statistic is only relevant to k-NN score script search.
Relevant statistics: Sum
KNNScriptQueryErrors Per-node metric for the number of errors during script queries. This statistic is only relevant to k-NN score script search.
Relevant statistics: Sum