Meta-analysis of individual patient data versus aggregate data from longitudinal clinical trials

AP Jones; Richard Riley; PR Williamson; A Whitehead

doi:10.1177/1740774508100984

Meta-analysis of individual patient data versus aggregate data from longitudinal clinical trials

AP Jones, Richard Riley, PR Williamson, A Whitehead

Research output: Contribution to journal › Article

86 Citations (Scopus)

Abstract

Background In clinical trials following individuals over a period of time, the same assessment may be made at a number of time points during the course of the trial. Our review of current practice for handling longitudinal data in Cochrane systematic reviews shows that the most frequently used approach is to ignore the correlation between repeated observations and to conduct separate meta-analyses at each of a number of time points. Purpose The purpose of this paper is to show the link between repeated measurement models used with aggregate data and those used when individual patient data (IPD) are available, and provide guidance on the methods that practitioners might use for aggregate data meta-analyses, depending on the type of data available. Methods We discuss models for the meta-analysis of longitudinal continuous outcome data when IPD are available. In these models time is included either as a factor or as a continuous variable, and account is taken of the correlation between repeated observations. The meta-analysis of IPD can be conducted using either a one-step or a two-step approach: the latter involves analysing the IPD separately in each study and then combining the study estimates taking into account their covariance structure. We discuss the link between models for use with aggregate data and the two-step IPD approach, and the problems which arise when only aggregate data are available. The methods are applied to IPD from 5 trials in Alzheimer's disease. Results Two major issues for the meta-analysis of aggregate data are the lack of information about correlation coefficients and the effect of missing data at the patient-level. Application to the Alzheimer's disease data set shows that ignoring correlation can lead to different pooled estimates of the treatment difference and their standard errors. Furthermore, the amount of missing data at the patient level can affect these estimates. Limitations The models assume fixed treatment effects across studies, and that any missing data is missing at random, both at the patient-level and the study level. Conclusions It is preferable to obtain IPD from all studies to correctly account for the correlation between repeated observations. When IPD are not available, the ideal aggregate data are model-based estimates of treatment difference and their variance and covariance estimates. If covariance estimates are not available, sensitivity analyses should be undertaken to investigate the robustness of the results to different amounts of correlation. Clinical Trials 2009; 6: 16-27. http://ctj.sagepub.com

Original language	English
Pages (from-to)	16-27
Number of pages	12
Journal	Clinical Trials
Volume	6
Issue number	1
DOIs	https://doi.org/10.1177/1740774508100984
Publication status	Published - 1 Feb 2009

Access to Document

10.1177/1740774508100984

Cite this

@article{05d4f9ce99264ac8a4799cd6e5d63ad4,

title = "Meta-analysis of individual patient data versus aggregate data from longitudinal clinical trials",

abstract = "Background In clinical trials following individuals over a period of time, the same assessment may be made at a number of time points during the course of the trial. Our review of current practice for handling longitudinal data in Cochrane systematic reviews shows that the most frequently used approach is to ignore the correlation between repeated observations and to conduct separate meta-analyses at each of a number of time points. Purpose The purpose of this paper is to show the link between repeated measurement models used with aggregate data and those used when individual patient data (IPD) are available, and provide guidance on the methods that practitioners might use for aggregate data meta-analyses, depending on the type of data available. Methods We discuss models for the meta-analysis of longitudinal continuous outcome data when IPD are available. In these models time is included either as a factor or as a continuous variable, and account is taken of the correlation between repeated observations. The meta-analysis of IPD can be conducted using either a one-step or a two-step approach: the latter involves analysing the IPD separately in each study and then combining the study estimates taking into account their covariance structure. We discuss the link between models for use with aggregate data and the two-step IPD approach, and the problems which arise when only aggregate data are available. The methods are applied to IPD from 5 trials in Alzheimer's disease. Results Two major issues for the meta-analysis of aggregate data are the lack of information about correlation coefficients and the effect of missing data at the patient-level. Application to the Alzheimer's disease data set shows that ignoring correlation can lead to different pooled estimates of the treatment difference and their standard errors. Furthermore, the amount of missing data at the patient level can affect these estimates. Limitations The models assume fixed treatment effects across studies, and that any missing data is missing at random, both at the patient-level and the study level. Conclusions It is preferable to obtain IPD from all studies to correctly account for the correlation between repeated observations. When IPD are not available, the ideal aggregate data are model-based estimates of treatment difference and their variance and covariance estimates. If covariance estimates are not available, sensitivity analyses should be undertaken to investigate the robustness of the results to different amounts of correlation. Clinical Trials 2009; 6: 16-27. http://ctj.sagepub.com",

author = "AP Jones and Richard Riley and PR Williamson and A Whitehead",

year = "2009",

month = feb,

day = "1",

doi = "10.1177/1740774508100984",

language = "English",

volume = "6",

pages = "16--27",

journal = "Clinical Trials",

issn = "1740-7753",

publisher = "SAGE Publications",

number = "1",

}

TY - JOUR

T1 - Meta-analysis of individual patient data versus aggregate data from longitudinal clinical trials

AU - Jones, AP

AU - Riley, Richard

AU - Williamson, PR

AU - Whitehead, A

PY - 2009/2/1

Y1 - 2009/2/1

N2 - Background In clinical trials following individuals over a period of time, the same assessment may be made at a number of time points during the course of the trial. Our review of current practice for handling longitudinal data in Cochrane systematic reviews shows that the most frequently used approach is to ignore the correlation between repeated observations and to conduct separate meta-analyses at each of a number of time points. Purpose The purpose of this paper is to show the link between repeated measurement models used with aggregate data and those used when individual patient data (IPD) are available, and provide guidance on the methods that practitioners might use for aggregate data meta-analyses, depending on the type of data available. Methods We discuss models for the meta-analysis of longitudinal continuous outcome data when IPD are available. In these models time is included either as a factor or as a continuous variable, and account is taken of the correlation between repeated observations. The meta-analysis of IPD can be conducted using either a one-step or a two-step approach: the latter involves analysing the IPD separately in each study and then combining the study estimates taking into account their covariance structure. We discuss the link between models for use with aggregate data and the two-step IPD approach, and the problems which arise when only aggregate data are available. The methods are applied to IPD from 5 trials in Alzheimer's disease. Results Two major issues for the meta-analysis of aggregate data are the lack of information about correlation coefficients and the effect of missing data at the patient-level. Application to the Alzheimer's disease data set shows that ignoring correlation can lead to different pooled estimates of the treatment difference and their standard errors. Furthermore, the amount of missing data at the patient level can affect these estimates. Limitations The models assume fixed treatment effects across studies, and that any missing data is missing at random, both at the patient-level and the study level. Conclusions It is preferable to obtain IPD from all studies to correctly account for the correlation between repeated observations. When IPD are not available, the ideal aggregate data are model-based estimates of treatment difference and their variance and covariance estimates. If covariance estimates are not available, sensitivity analyses should be undertaken to investigate the robustness of the results to different amounts of correlation. Clinical Trials 2009; 6: 16-27. http://ctj.sagepub.com

AB - Background In clinical trials following individuals over a period of time, the same assessment may be made at a number of time points during the course of the trial. Our review of current practice for handling longitudinal data in Cochrane systematic reviews shows that the most frequently used approach is to ignore the correlation between repeated observations and to conduct separate meta-analyses at each of a number of time points. Purpose The purpose of this paper is to show the link between repeated measurement models used with aggregate data and those used when individual patient data (IPD) are available, and provide guidance on the methods that practitioners might use for aggregate data meta-analyses, depending on the type of data available. Methods We discuss models for the meta-analysis of longitudinal continuous outcome data when IPD are available. In these models time is included either as a factor or as a continuous variable, and account is taken of the correlation between repeated observations. The meta-analysis of IPD can be conducted using either a one-step or a two-step approach: the latter involves analysing the IPD separately in each study and then combining the study estimates taking into account their covariance structure. We discuss the link between models for use with aggregate data and the two-step IPD approach, and the problems which arise when only aggregate data are available. The methods are applied to IPD from 5 trials in Alzheimer's disease. Results Two major issues for the meta-analysis of aggregate data are the lack of information about correlation coefficients and the effect of missing data at the patient-level. Application to the Alzheimer's disease data set shows that ignoring correlation can lead to different pooled estimates of the treatment difference and their standard errors. Furthermore, the amount of missing data at the patient level can affect these estimates. Limitations The models assume fixed treatment effects across studies, and that any missing data is missing at random, both at the patient-level and the study level. Conclusions It is preferable to obtain IPD from all studies to correctly account for the correlation between repeated observations. When IPD are not available, the ideal aggregate data are model-based estimates of treatment difference and their variance and covariance estimates. If covariance estimates are not available, sensitivity analyses should be undertaken to investigate the robustness of the results to different amounts of correlation. Clinical Trials 2009; 6: 16-27. http://ctj.sagepub.com

U2 - 10.1177/1740774508100984

DO - 10.1177/1740774508100984

M3 - Article

C2 - 19254930

SN - 1740-7753

VL - 6

SP - 16

EP - 27

JO - Clinical Trials

JF - Clinical Trials

IS - 1

ER -

Meta-analysis of individual patient data versus aggregate data from longitudinal clinical trials

Abstract

Access to Document

Fingerprint

Cite this