Right-Censored Log Loss Survival Measure

Calculates the right-censored log-likelihood (RCLL) or logarithmic loss, introduced by Avati et al. (2020).

Details

The observation-wise RCLL is defined by:

$$L_{RCLL}(S_i, t_i, \delta_i) = -log[\delta_i f_i(t_i) + (1 - \delta_i) S_i(t_i)]$$

where $\delta_i$ is the censoring indicator, $f_i$ the predicted probability density function and $S_i$ the predicted survival function for observation $i$. RCLL is proper given that censoring and survival distribution are independent, see Rindt et al. (2022). Simulation studies by Sonabend et al. (2024) provide strong empirical evidence supporting the properness of this score. See section Interpolation for implementation details.

To get a single score across all $N$ observations of the test set, we return the average of the observation-wise scores:

$$\sum_{i=1}^N L_{RCLL}(S_i, t_i, \delta_i) / N$$

Dictionary

This Measure can be instantiated via the dictionary mlr_measures or with the associated sugar function msr():

MeasureSurvRCLL$new()
mlr_measures$get("surv.rcll")
msr("surv.rcll")

Parameters

Id	Type	Default	Levels	Range
eps	numeric	1e-06		$[0, 1]$
ERV	logical	FALSE	TRUE, FALSE	-

Meta Information

Type: "surv"
Range: $[0, \infty)$
Minimize: TRUE
Required prediction: distr

Parameter details

eps (numeric(1))
Very small number to substitute near-zero values in order to prevent errors in e.g. log(0) and/or division-by-zero calculations. Default value is 1e-06.

ERV (logical(1))
If TRUE then the Explained Residual Variation method is applied, which means the score is standardized against a Kaplan-Meier baseline. Default is FALSE.

Interpolation

To evaluate scores involving subject-specific survival functions $S_i(t)$, we perform linear interpolation on the discrete survival values provided in the prediction. Duplicate survival values are removed prior to interpolation to ensure strict monotonicity and non-negative density values. Therefore we are left with the distinct survival time points $t_0 < \cdots < t_n$ and the corresponding survival values $S(t_j)$.

Interpolation is performed using base R’s approx() with method = "linear" and rule = 2, ensuring:

Left extrapolation (for $t < t_0$) assumes $S(0) = 1$ and uses the slope from $(0, 1)$ to $(t_0, S(t_0))$.
Right extrapolation (for $t > t_n$) uses the slope from the last interval $(t_{n-1}, S(t_{n-1}))$ to $(t_n, S(t_n))$, with results truncated at 0 to preserve non-negativity.

This ensures a continuous, piecewise-linear survival function $S(t)$ that satisfies $S(0) = 1$ and remains non-increasing and non-negative across the entire domain.

The density at time point $t_k$, with $t_i \le t_k < t_{i+1}$, is estimated as follows:

$$ f_i(t_k) = -\frac{S_i(t_{i+1}) - S_i(t_i)}{t_{i+1} - t_i} $$

This corresponds to the (negative) slope of the $S_i(t)$ between the closest grid point after $t_i$ and $t_i$ itself.

References

Avati, Anand, Duan, Tony, Zhou, Sharon, Jung, Kenneth, Shah, H N, Ng, Y A (2020). “Countdown Regression: Sharp and Calibrated Survival Predictions.” Proceedings of The 35th Uncertainty in Artificial Intelligence Conference, 115(4), 145–155. https://proceedings.mlr.press/v115/avati20a.html.

Rindt, David, Hu, Robert, Steinsaltz, David, Sejdinovic, Dino (2022). “Survival regression with proper scoring rules and monotonic neural networks.” Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, 151(4), 1190–1205. https://proceedings.mlr.press/v151/rindt22a.html.

Sonabend, Raphael, Zobolas, John, Kopper, Philipp, Burk, Lukas, Bender, Andreas (2024). “Examining properness in the external validation of survival models with squared and logarithmic losses.” https://arxiv.org/abs/2212.05260v3.

Other Probabilistic survival measures: mlr_measures_surv.graf, mlr_measures_surv.intlogloss, mlr_measures_surv.logloss, mlr_measures_surv.schmid

Other distr survival measures: mlr_measures_surv.calib_alpha, mlr_measures_surv.calib_index, mlr_measures_surv.dcalib, mlr_measures_surv.graf, mlr_measures_surv.intlogloss, mlr_measures_surv.logloss, mlr_measures_surv.schmid

Super classes

mlr3::Measure -> mlr3proba::MeasureSurv -> MeasureSurvRCLL

Methods

Inherited methods

Method `new()`

Creates a new instance of this R6 class.

Usage

MeasureSurvRCLL$new(ERV = FALSE)

Arguments

ERV: (logical(1))
Standardize measure against a Kaplan-Meier baseline (Explained Residual Variation)

Method `clone()`

The objects of this class are cloneable with this method.

Usage

MeasureSurvRCLL$clone(deep = FALSE)

Arguments

deep: Whether to make a deep clone.

Details

Dictionary

Parameters

Meta Information

Parameter details

Interpolation

References

See also

Super classes

Methods

Public methods

Method new()

Usage

Arguments

Method clone()

Usage

Arguments

Method `new()`

Method `clone()`