Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review

Paula Dhiman; Jie Ma; Cathy Qi; Garrett Bullock; Jamie C Sergeant; Richard D Riley; Gary S Collins

doi:10.1186/s12874-023-02008-1

Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review

Paula Dhiman^*, Jie Ma, Cathy Qi, Garrett Bullock, Jamie C Sergeant, Richard D Riley, Gary S Collins

^*Corresponding author for this work

Applied Health Research

Research output: Contribution to journal › Article › peer-review

33 Downloads (Pure)

Abstract

Background: Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome.

Methods: We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size.

Results: A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63–82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66–84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84).

Conclusions: Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.

Original language	English
Article number	188
Number of pages	11
Journal	BMC Medical Research Methodology
Volume	23
Issue number	1
DOIs	https://doi.org/10.1186/s12874-023-02008-1
Publication status	Published - 19 Aug 2023

Keywords

Prediction model
Sample size
Methodology

Access to Document

10.1186/s12874-023-02008-1Licence: Creative Commons: Attribution (CC BY)

12874_2023_Article_2008.pdfFinal published version, 1.4 MBLicence: Creative Commons: Attribution (CC BY)

Cite this

@article{96bf67e641594ffabe926c54c336e87d,

title = "Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review",

abstract = "Background: Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome. Methods: We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size. Results: A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63–82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66–84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84). Conclusions: Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.",

keywords = "Prediction model, Sample size, Methodology",

author = "Paula Dhiman and Jie Ma and Cathy Qi and Garrett Bullock and Sergeant, {Jamie C} and Riley, {Richard D} and Collins, {Gary S}",

year = "2023",

month = aug,

day = "19",

doi = "10.1186/s12874-023-02008-1",

language = "English",

volume = "23",

journal = "BMC Medical Research Methodology",

issn = "1471-2288",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - Sample size requirements are not being considered in studies developing prediction models for binary outcomes

T2 - a systematic review

AU - Dhiman, Paula

AU - Ma, Jie

AU - Qi, Cathy

AU - Bullock, Garrett

AU - Sergeant, Jamie C

AU - Riley, Richard D

AU - Collins, Gary S

PY - 2023/8/19

Y1 - 2023/8/19

N2 - Background: Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome. Methods: We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size. Results: A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63–82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66–84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84). Conclusions: Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.

AB - Background: Having an appropriate sample size is important when developing a clinical prediction model. We aimed to review how sample size is considered in studies developing a prediction model for a binary outcome. Methods: We searched PubMed for studies published between 01/07/2020 and 30/07/2020 and reviewed the sample size calculations used to develop the prediction models. Using the available information, we calculated the minimum sample size that would be needed to estimate overall risk and minimise overfitting in each study and summarised the difference between the calculated and used sample size. Results: A total of 119 studies were included, of which nine studies provided sample size justification (8%). The recommended minimum sample size could be calculated for 94 studies: 73% (95% CI: 63–82%) used sample sizes lower than required to estimate overall risk and minimise overfitting including 26% studies that used sample sizes lower than required to estimate overall risk only. A similar number of studies did not meet the ≥ 10EPV criteria (75%, 95% CI: 66–84%). The median deficit of the number of events used to develop a model was 75 [IQR: 234 lower to 7 higher]) which reduced to 63 if the total available data (before any data splitting) was used [IQR:225 lower to 7 higher]. Studies that met the minimum required sample size had a median c-statistic of 0.84 (IQR:0.80 to 0.9) and studies where the minimum sample size was not met had a median c-statistic of 0.83 (IQR: 0.75 to 0.9). Studies that met the ≥ 10 EPP criteria had a median c-statistic of 0.80 (IQR: 0.73 to 0.84). Conclusions: Prediction models are often developed with no sample size calculation, as a consequence many are too small to precisely estimate the overall risk. We encourage researchers to justify, perform and report sample size calculations when developing a prediction model.

KW - Prediction model

KW - Sample size

KW - Methodology

U2 - 10.1186/s12874-023-02008-1

DO - 10.1186/s12874-023-02008-1

M3 - Article

SN - 1471-2288

VL - 23

JO - BMC Medical Research Methodology

JF - BMC Medical Research Methodology

IS - 1

M1 - 188

ER -

Sample size requirements are not being considered in studies developing prediction models for binary outcomes: a systematic review

Abstract

Keywords

Access to Document

Fingerprint

Cite this