Background Method Resulting attributions

Asymmetric Shapley Values

The SageMaker Clarify time series forecasting model explanation solution is a feature attribution method rooted in cooperative game theory, similar in spirit to SHAP. Specifically, Clarify uses random order group values, also known as asymmetric Shapley values in machine learning and explainability.

Background

The goal is to compute attributions for input features to a given forecasting model f. The forecasting model takes the following inputs:

Past time series (target TS). For example, this could be past daily train passengers in the Paris-Berlin route, denoted by x_t.
(Optional) A covariate time series. For example, this could be festivities and weather data, denoted by z_t ∈ R^S. When used, covariate TS could be available only for the past time steps or also for the future ones (included in the festivity calendar).
(Optional) Static covariates, such as quality of service (like 1st or 2nd class), denoted by u ∈ R^E.

Static covariates, dynamic covariates, or both can be omitted, depending on the specific application scenario. Given a prediction horizon K ≥ 0 (e.g. K=30 days) the model prediction can be characterized by the formula: f(x_[1:T], z_[1:T+K], u) = x_{[T+1:T +K+1]}.

The following diagram shows a dependency structure for a typical forecasting model. The prediction at time t+1 depends on the three types of inputs previously mentioned.

Method

Explanations are computed by querying the time series model f on a series of points derived by the original input. Following game theoretic constructions, Clarify averages differences in predictions led by obfuscating (that is, setting to a baseline value) parts of the inputs iteratively. The temporal structure can be navigated in a chronological or anti-chronological order or both. Chronological explanations are built by iteratively adding information from the first time step, while anti-chronological from the last step. The latter mode may be more appropriate in the presence of recency bias, such as when forecasting stock prices. One important property of the computed explanations is that they sum to the original model output if the model provides deterministic outputs.

Resulting attributions

Resulting attributions are scores that mark individual contributions of specific time steps or input features toward the final forecast at each forecasted time step. Clarify offers the following two granularities for explanations:

Timewise explanations are inexpensive and provide information about specific time steps only, such as how much the information of the 19th day in the past contributed to the forecasting of the 1st day in the future. These attributions do not explain individually static covariates and aggregate explanations of target and covariate time series. The attributions are a matrix A where each A_tk is the attribution of time step t toward forecasting of time step T+k. Note that if the model accepts future covariates, t can be greater than T.
Fine-grained explanations are more computationally intensive and provide a full breakdown of all attributions of the input variables.

Note
Fine-grained explanations only support chronological order.

The resulting attributions are a triplet composed of the following:
- Matrix A^x ∈ R^T×K related to the input time series, where A_tk^x is the attribution of x_t toward forecasting step T+k
- Tensor A^z ∈ R^T+K×S×K related to the covariate time series, where A_tsk^z is the attribution of z_ts (i.e. the sth covariate TS) toward forecasting step T+k
- Matrix A^u ∈ R^E×K related to the static covariates, where A_ek^u is the attribution of u_e (the eth static covariate) toward forecasting step T+k

Regardless of the granularity, the explanation also contains an offset vector B ∈ R^K that represents the “basic behavior” of the model when all data is obfuscated.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Shapley Values

SHAP Baselines for Explainability

Asymmetric Shapley Values

Background

Method

Resulting attributions

Note