Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small

Richard D Riley; Kym I E Snell; Glen P Martin; Rebecca Whittle; Lucinda Archer; Matthew Sperrin; Gary S Collins

doi:10.1016/j.jclinepi.2020.12.005

Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small

Richard D Riley^*, Kym I E Snell, Glen P Martin, Rebecca Whittle, Lucinda Archer, Matthew Sperrin, Gary S Collins

^*Corresponding author for this work

Applied Health Research

Research output: Contribution to journal › Article › peer-review

69 Downloads (Pure)

Abstract

OBJECTIVES: When developing a clinical prediction model, penalization techniques are recommended to address overfitting, as they shrink predictor effect estimates toward the null and reduce mean-square prediction error in new individuals. However, shrinkage and penalty terms ('tuning parameters') are estimated with uncertainty from the development data set. We examined the magnitude of this uncertainty and the subsequent impact on prediction model performance.

STUDY DESIGN AND SETTING: This study comprises applied examples and a simulation study of the following methods: uniform shrinkage (estimated via a closed-form solution or bootstrapping), ridge regression, the lasso, and elastic net.

RESULTS: In a particular model development data set, penalization methods can be unreliable because tuning parameters are estimated with large uncertainty. This is of most concern when development data sets have a small effective sample size and the model's Cox-Snell R² is low. The problem can lead to considerable miscalibration of model predictions in new individuals.

CONCLUSION: Penalization methods are not a 'carte blanche'; they do not guarantee a reliable prediction model is developed. They are more unreliable when needed most (i.e., when overfitting may be large). We recommend they are best applied with large effective sample sizes, as identified from recent sample size calculations that aim to minimize the potential for model overfitting and precisely estimate key parameters.

Original language	English
Pages (from-to)	88-96
Number of pages	9
Journal	Journal of Clinical Epidemiology
Volume	132
Early online date	8 Dec 2020
DOIs	https://doi.org/10.1016/j.jclinepi.2020.12.005
Publication status	Published - Apr 2021

Bibliographical note

Keywords

Risk prediction models
Penalization
Shrinkage
Overfitting
Sample size

Access to Document

10.1016/j.jclinepi.2020.12.005Licence: Creative Commons: Attribution (CC BY)

RileyR2020PenalizationFinal published version, 1.18 MBLicence: Creative Commons: Attribution (CC BY)

Cite this

@article{65f0a233935c4213b4d0683cb2d249ac,

title = "Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small",

abstract = "OBJECTIVES: When developing a clinical prediction model, penalization techniques are recommended to address overfitting, as they shrink predictor effect estimates toward the null and reduce mean-square prediction error in new individuals. However, shrinkage and penalty terms ('tuning parameters') are estimated with uncertainty from the development data set. We examined the magnitude of this uncertainty and the subsequent impact on prediction model performance.STUDY DESIGN AND SETTING: This study comprises applied examples and a simulation study of the following methods: uniform shrinkage (estimated via a closed-form solution or bootstrapping), ridge regression, the lasso, and elastic net.RESULTS: In a particular model development data set, penalization methods can be unreliable because tuning parameters are estimated with large uncertainty. This is of most concern when development data sets have a small effective sample size and the model's Cox-Snell R2 is low. The problem can lead to considerable miscalibration of model predictions in new individuals.CONCLUSION: Penalization methods are not a 'carte blanche'; they do not guarantee a reliable prediction model is developed. They are more unreliable when needed most (i.e., when overfitting may be large). We recommend they are best applied with large effective sample sizes, as identified from recent sample size calculations that aim to minimize the potential for model overfitting and precisely estimate key parameters.",

keywords = "Risk prediction models, Penalization, Shrinkage, Overfitting, Sample size",

author = "Riley, {Richard D} and Snell, {Kym I E} and Martin, {Glen P} and Rebecca Whittle and Lucinda Archer and Matthew Sperrin and Collins, {Gary S}",

year = "2021",

month = apr,

doi = "10.1016/j.jclinepi.2020.12.005",

language = "English",

volume = "132",

pages = "88--96",

journal = "Journal of Clinical Epidemiology",

issn = "0895-4356",

publisher = "Elsevier",

}

TY - JOUR

T1 - Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small

AU - Riley, Richard D

AU - Snell, Kym I E

AU - Martin, Glen P

AU - Whittle, Rebecca

AU - Archer, Lucinda

AU - Sperrin, Matthew

AU - Collins, Gary S

PY - 2021/4

Y1 - 2021/4

N2 - OBJECTIVES: When developing a clinical prediction model, penalization techniques are recommended to address overfitting, as they shrink predictor effect estimates toward the null and reduce mean-square prediction error in new individuals. However, shrinkage and penalty terms ('tuning parameters') are estimated with uncertainty from the development data set. We examined the magnitude of this uncertainty and the subsequent impact on prediction model performance.STUDY DESIGN AND SETTING: This study comprises applied examples and a simulation study of the following methods: uniform shrinkage (estimated via a closed-form solution or bootstrapping), ridge regression, the lasso, and elastic net.RESULTS: In a particular model development data set, penalization methods can be unreliable because tuning parameters are estimated with large uncertainty. This is of most concern when development data sets have a small effective sample size and the model's Cox-Snell R2 is low. The problem can lead to considerable miscalibration of model predictions in new individuals.CONCLUSION: Penalization methods are not a 'carte blanche'; they do not guarantee a reliable prediction model is developed. They are more unreliable when needed most (i.e., when overfitting may be large). We recommend they are best applied with large effective sample sizes, as identified from recent sample size calculations that aim to minimize the potential for model overfitting and precisely estimate key parameters.

AB - OBJECTIVES: When developing a clinical prediction model, penalization techniques are recommended to address overfitting, as they shrink predictor effect estimates toward the null and reduce mean-square prediction error in new individuals. However, shrinkage and penalty terms ('tuning parameters') are estimated with uncertainty from the development data set. We examined the magnitude of this uncertainty and the subsequent impact on prediction model performance.STUDY DESIGN AND SETTING: This study comprises applied examples and a simulation study of the following methods: uniform shrinkage (estimated via a closed-form solution or bootstrapping), ridge regression, the lasso, and elastic net.RESULTS: In a particular model development data set, penalization methods can be unreliable because tuning parameters are estimated with large uncertainty. This is of most concern when development data sets have a small effective sample size and the model's Cox-Snell R2 is low. The problem can lead to considerable miscalibration of model predictions in new individuals.CONCLUSION: Penalization methods are not a 'carte blanche'; they do not guarantee a reliable prediction model is developed. They are more unreliable when needed most (i.e., when overfitting may be large). We recommend they are best applied with large effective sample sizes, as identified from recent sample size calculations that aim to minimize the potential for model overfitting and precisely estimate key parameters.

KW - Risk prediction models

KW - Penalization

KW - Shrinkage

KW - Overfitting

KW - Sample size

U2 - 10.1016/j.jclinepi.2020.12.005

DO - 10.1016/j.jclinepi.2020.12.005

M3 - Article

C2 - 33307188

SN - 0895-4356

VL - 132

SP - 88

EP - 96

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

ER -

Penalization and shrinkage methods produced unreliable clinical prediction models especially when sample size was small

Abstract

Bibliographical note

Keywords

Access to Document

Fingerprint

Cite this