California Whipsnake |
In the end, while our models were statistically significantly better, I don't believe that the difference was of much practical significance. More worthwhile is knowing which signals actually improved our predictive power--these may be worthy of further investigation to better explain the correlations between the feature and cancer survival.
I'm not going to regurgitate the paper which is freely available via the link above, but I do want to highlight a few points that I believe are important and of general interest:
- Random Survival Forest is the best out-of-the-box survival model we found. After testing this on some other data sets, I believe that Random Survival Forest may in fact be your best bet if you are doing survival modeling.
- A leaderboard or real-time model evaluation system is an enormous motivator in one's research. The fact that you can quickly tweak a model and have it evaluated and placed next to your others for comparison takes much of the grind out of research.
- Competitions are not an inexpensive option to hiring personnel or doing your own research. The computational overhead, tech support, and manpower to procure the data, produce the evaluation system, advertise the challenge, police the participants, and evaluate their submissions requires a substantial amount of manpower and expense.
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.