Decomposing uncertainty

Bayesian neural networks (BNNs) yield a predictive distribution Mathematical formula showing conditional probability of y given x. , which provides a set of different predictions from which you can estimate variance Mathematical symbol representing a function V with empty parentheses. ; that is, total predictive uncertainty Mathematical square root symbol with variable x inside. . The total predictive uncertainty can be split into these two components of uncertainty by using the law of total variance:

The expected value Predictive distribution of a target variable , given input X icon, typically used to represent closing or canceling an action. and random parameters Theta symbol representing an angle or mathematical concept. that specify a BNN, Mathematical expression showing expectation of y given x and theta. , is estimated by a BNN with a single forward propagation and denoted as Mathematical function f(x, θ) with x and θ as variables. . The variance of the target, given input and random parameters, Mathematical formula showing nabla operator applied to vector y with respect to x and theta. , is output by the BNN, too, and denoted as Mathematical formula showing s prime as a function of x and theta. . Thus, the total predictive uncertainty is the sum of these two numbers:

The variance about the BNN’s predicted means — the epistemic uncertainty
The average of the BNN’s predicted variance — the aleatoric uncertainty

The following formula demonstrates how to calculate total uncertainty in accordance with (Kendall and Gal 2017). BNNs input X icon, typically used to represent closing or canceling an action. , generate a random parameter configuration Theta symbol representing an angle or mathematical concept. , and make a single forward propagation through the neural network to output a mean Mathematical function f(x, θ) with x and θ as variables. and variance Mathematical formula showing s prime as a function of x and theta. . We denote a random generation, or simulation, by ~. With fixed X icon, typically used to represent closing or canceling an action. , you can reiterate this process Lowercase letter "T" in a serif font against a white background. many times to yield a set:

Mathematical formula showing calculation of total uncertainty using Bayesian Neural Networks.

These Lowercase letter "T" in a serif font against a white background. many samples Mathematical formula showing a sequence of functions f and s with subscripts and superscripts. provide the necessary statistics to ascertain uncertainties. You can do this by estimating epistemic uncertainty and aleatoric uncertainty separately, and then take their sum, as shown previously in the first equation in this section.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Deterministic overconfidence

Epistemic uncertainty