Skip to content

Scores

Forecast scores are metrics that quantify how well probabilistic or point forecasts match the observed data. All scores are calculated weekly, and the value shown in the dashboard refers to the average over the analyzed period.

For the calculation of CRPS (Continuous Ranked Probability Score) and the Log Score (logarithmic score), forecasts must be parameterized as probability distributions. For the dashboard, we chose to parameterize them as log-normal distributions. Details about the parameterization are available here.

MAE - Mean Absolute Error

Measures the average absolute magnitude of the forecast errors. Lower values indicate that the forecasts are, on average, closer to the actual data. It is calculated by the following expression:

where is the total number of weeks analyzed (or forecasts made), is the actual (observed) value in week , and is the forecasted (estimated) value for week .

MSE - Mean Squared Error

Measures the mean of the squared differences between the forecast and the observation. MSE penalizes larger errors more heavily. Lower values indicate that the forecasts are, on average, closer to the actual data. It is calculated by the following expression:

where is the total number of weeks analyzed (or forecasts made), is the actual (observed) value in week , and is the forecasted (estimated) value for week .

CRPS - Continuous Ranked Probability Score

Measures how well the predicted cumulative distribution function (CDF) matches the observed data. Lower values indicate that the predicted distribution better matches the observed outcome. It is calculated by the following expression:

where

in which is the cumulative distribution function of the standard normal distribution and , where is the observed value in week , and and are the parameters of the log-normal distribution in week . Details on how the distribution parameters are obtained are available here.

Log Score

Measures the logarithm of the probability assigned to the observed value by the predicted distribution. Higher values indicate that the predicted distribution better matches the observed outcome. It is calculated by the following expression:

where

in which is the observed value in week , and and are the parameters of the log-normal distribution in week . Details on how the distribution parameters are obtained are available here.

Interval Score

In the evaluation of forecast performance, it assesses the prediction intervals. In the dashboard, 90% prediction intervals are used. Lower values indicate that the forecasts are, on average, closer to the actual data. It is calculated by the following expression:

where

in which is the indicator function (equals 1 if the condition is true, and 0 otherwise), is the significance level of the interval, is the upper bound of the interval in week , and is the lower bound.