Feature request: Predict survival time #37

hfrick · 2024-01-08T12:13:10Z

This issue showed someone trying to predict survival time with aorsf via tidymodels. We currently only have predictions of the survival probability implemented in censored. Looking around aorsf I didn't see any prediction type that we could wrap for "survival time". Is that correct? Would you consider implementing that? 🙌

bcjaeger · 2024-01-08T16:25:52Z

Thank you! I'd be happy to implement this. The biggest obstacle on my end is deciding how to do it. There are a few ways that could work:

Compute median time-to-event in each predicted leaf and then aggregate (similar to bag_tree-rpart.R file in censored)
Compute probability of censored weights (PCW), then fit a regression forest with those weights (similar to how you compute C-stat/Brier score using inverse PCW, building on ideas in this paper)
Compute predicted mortality with aorsf and then use one of the existing survival time prediction methods to convert the predicted mortality to predicted time to event.

My thoughts on these:

I'd estimate that option 1. would take the most time to develop, followed by option 2, and then option 3.
I think 1. would have to be implemented in aorsf, 2. could be implemented in either aorsf or censored, and so could 3.
I have no idea which method would actually work best! That's not ideal because I'm tempted to develop all three and then compare them, but I realize you may not want to wait that long =/

@hfrick, do you have thoughts or preferences on how I should proceed? My initial impression is that I like option 1 because it would be the most efficient computationally. However, it would also take me a little while to get it working and then run it through proper tests to make sure it's right.

hfrick · 2024-01-10T10:52:26Z

Option 1 of median time-to-event is, I think, the most common option and sounds the most straightforward in terms of definition. It'd be great to see that feature live in aorsf given that I think it'd be attractive for users both of aorsf directly and via a framework. Re time: no particular rush. We are currently actively working on survival analysis in tidymodels and want to release a whole lot of new features across the framework in Q1 but we can integrate survival time prediction via aorsf in censored at any time.

bcjaeger · 2024-01-10T15:01:34Z

Thank you! I appreciate your thoughts on this very much. I will move ahead using median time-to-event and keep you updated.

hfrick · 2024-01-10T15:18:14Z

Thanks so much for your willingness to implement this! 🙏

bcjaeger · 2024-01-23T03:54:37Z

Hello @hfrick! I'm happy to share an update. With aorsf version 0.1.3 and higher, models can predict survival time (reprex below). I have done some preliminary assessment of the predicted survival times and they seem to be a little less effective at discriminating high versus low risk cases than the mortality (pred_type = 'mort') option. This makes sense to me. I think mortality predictions do a better job of quantifying observed events.

Do you think it would be feasible for me to propose making predicted mortality the default for aorsf in yardstick::concordance_survival(), instead of predicted time? If so, I'd be happy to work on a PR implementing that change. If not, I'm happy to at least resolve the compatibility issue noted in tidymodels/yardstick#475

library(aorsf)

fit_time <- orsf(pbc_orsf, time + status ~ . - id, 
                 oobag_pred_type = 'time')

predict(fit_time, new_data = pbc_orsf[1:3, ], pred_type = 'time')
#>          [,1]
#> [1,]  360.580
#> [2,] 2555.766
#> [3,] 1195.855

fit_time$eval_oobag$stat_values
#>           [,1]
#> [1,] 0.8360331

fit_mort <- orsf_update(fit_time, oobag_pred_type = 'mort')

fit_mort$eval_oobag$stat_values
#>           [,1]
#> [1,] 0.8435335

^{Created on 2024-01-22 with reprex v2.1.0}

hfrick · 2024-01-23T13:51:24Z

That's awesome, thank you! 🎉 I've opened tidymodels/censored#301 to enable that in censored. Given that there is such a high focus on consistency across tidymodels, I don't think we are likely to change what the default is for any one engine. At that abstraction level, the goal is typically to not have to remember details about an engine. Mortality predictions are also currently not part of tidymodels but that is something that might change in future. If that happens, that would be the opportunity to enable that for aorsf and others and possibly revist defaults.

bcjaeger · 2024-01-23T16:52:52Z

I totally understand prioritizing consistency! This is a good incentive for me to investigate more thoughtful ways for aorsf to predict survival time. I will check out tidymodels/censored#301 and prepare a PR. If there is a deadline for that feature being in censored, just let me know and I'll be happy to coordinate.

Thanks for your help improving aorsf! It is great working with you.

bblodfon · 2024-03-01T12:57:55Z

Hi @bcjaeger! Sorry for intruding in this issue :)

Could we maybe have the survival time in mlr3extralearners as well (this would be a response prediction type, see mlr3proba::.surv_return())? https://github.com/mlr-org/mlr3extralearners/blob/main/R/learner_aorsf_surv_aorsf.R#L178
I was just reading a paper where they the authors calculate survival time from a distribution S(t). In the end, a time-interval weighted approach might be applicable to aorsf and easy to implement as you get the survival matrix S(t) (observations x times) and can implement easily the equation (6) from that paper (I think it should have a denominator of (t_max - t_min) in there as well...). Of course, these type of calculations might not be ideal in cases where the distribution is improper, as was shown in the C-hacking paper, fig 2

bcjaeger · 2024-03-01T13:29:47Z

Very nice! I will try this out

hfrick mentioned this issue Jan 23, 2024

Enable predictions with type = "time" for aorsf tidymodels/censored#301

Closed

bcjaeger closed this as completed Jan 23, 2024

bblodfon mentioned this issue Mar 1, 2024

try mean instead of median survival time prediction #41

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: Predict survival time #37

Feature request: Predict survival time #37

hfrick commented Jan 8, 2024

bcjaeger commented Jan 8, 2024

hfrick commented Jan 10, 2024

bcjaeger commented Jan 10, 2024

hfrick commented Jan 10, 2024

bcjaeger commented Jan 23, 2024

hfrick commented Jan 23, 2024

bcjaeger commented Jan 23, 2024

bblodfon commented Mar 1, 2024 •

edited

Loading

bcjaeger commented Mar 1, 2024

Feature request: Predict survival time #37

Feature request: Predict survival time #37

Comments

hfrick commented Jan 8, 2024

bcjaeger commented Jan 8, 2024

hfrick commented Jan 10, 2024

bcjaeger commented Jan 10, 2024

hfrick commented Jan 10, 2024

bcjaeger commented Jan 23, 2024

hfrick commented Jan 23, 2024

bcjaeger commented Jan 23, 2024

bblodfon commented Mar 1, 2024 • edited Loading

bcjaeger commented Mar 1, 2024

bblodfon commented Mar 1, 2024 •

edited

Loading