Inter-rater reliability of case-note audit: a systematic review

Richard Lilford; A Edwards; Alan Girling; T Hofer; GLD Tanna; Jane Petty; J Nicholl

doi:10.1258/135581907781543012

Inter-rater reliability of case-note audit: a systematic review

Richard Lilford, A Edwards, Alan Girling, T Hofer, GLD Tanna, Jane Petty, J Nicholl

Research output: Contribution to journal › Article

46 Citations (Scopus)

Abstract

OBJECTIVE: The quality of clinical care is often assessed by retrospective examination of case-notes (charts, medical records). Our objective was to determine the inter-rater reliability of case-note audit. METHODS: We conducted a systematic review of the inter-rater reliability of case-note audit. Analysis was restricted to 26 papers reporting comparisons of two or three raters making independent judgements about the quality of care. RESULTS: Sixty-six separate comparisons were possible, since some papers reported more than one measurement of reliability. Mean kappa values ranged from 0.32 to 0.70. These may be inflated due to publication bias. Measured reliabilities were found to be higher for case-note reviews based on explicit, as opposed to implicit, criteria and for reviews that focused on outcome (including adverse effects) rather than process errors. We found an association between kappa and the prevalence of errors (poor quality care), suggesting alternatives such as tetrachoric and polychoric correlation coefficients be considered to assess inter-rater reliability. CONCLUSIONS: Comparative studies should take into account the relationship between kappa and the prevalence of the events being measured.

Original language	English
Pages (from-to)	173-80
Number of pages	8
Journal	Journal of Health Services Research & Policy
Volume	12
Issue number	3
DOIs	https://doi.org/10.1258/135581907781543012
Publication status	Published - 1 Jul 2007

Access to Document

10.1258/135581907781543012

Cite this

@article{742b5538f8bf4b7a959814037ee25b38,

title = "Inter-rater reliability of case-note audit: a systematic review",

abstract = "OBJECTIVE: The quality of clinical care is often assessed by retrospective examination of case-notes (charts, medical records). Our objective was to determine the inter-rater reliability of case-note audit. METHODS: We conducted a systematic review of the inter-rater reliability of case-note audit. Analysis was restricted to 26 papers reporting comparisons of two or three raters making independent judgements about the quality of care. RESULTS: Sixty-six separate comparisons were possible, since some papers reported more than one measurement of reliability. Mean kappa values ranged from 0.32 to 0.70. These may be inflated due to publication bias. Measured reliabilities were found to be higher for case-note reviews based on explicit, as opposed to implicit, criteria and for reviews that focused on outcome (including adverse effects) rather than process errors. We found an association between kappa and the prevalence of errors (poor quality care), suggesting alternatives such as tetrachoric and polychoric correlation coefficients be considered to assess inter-rater reliability. CONCLUSIONS: Comparative studies should take into account the relationship between kappa and the prevalence of the events being measured.",

author = "Richard Lilford and A Edwards and Alan Girling and T Hofer and GLD Tanna and Jane Petty and J Nicholl",

year = "2007",

month = jul,

day = "1",

doi = "10.1258/135581907781543012",

language = "English",

volume = "12",

pages = "173--80",

journal = "Journal of Health Services Research & Policy",

issn = "1758-1060",

publisher = "SAGE Publications",

number = "3",

}

TY - JOUR

T1 - Inter-rater reliability of case-note audit: a systematic review

AU - Lilford, Richard

AU - Edwards, A

AU - Girling, Alan

AU - Hofer, T

AU - Tanna, GLD

AU - Petty, Jane

AU - Nicholl, J

PY - 2007/7/1

Y1 - 2007/7/1

N2 - OBJECTIVE: The quality of clinical care is often assessed by retrospective examination of case-notes (charts, medical records). Our objective was to determine the inter-rater reliability of case-note audit. METHODS: We conducted a systematic review of the inter-rater reliability of case-note audit. Analysis was restricted to 26 papers reporting comparisons of two or three raters making independent judgements about the quality of care. RESULTS: Sixty-six separate comparisons were possible, since some papers reported more than one measurement of reliability. Mean kappa values ranged from 0.32 to 0.70. These may be inflated due to publication bias. Measured reliabilities were found to be higher for case-note reviews based on explicit, as opposed to implicit, criteria and for reviews that focused on outcome (including adverse effects) rather than process errors. We found an association between kappa and the prevalence of errors (poor quality care), suggesting alternatives such as tetrachoric and polychoric correlation coefficients be considered to assess inter-rater reliability. CONCLUSIONS: Comparative studies should take into account the relationship between kappa and the prevalence of the events being measured.

AB - OBJECTIVE: The quality of clinical care is often assessed by retrospective examination of case-notes (charts, medical records). Our objective was to determine the inter-rater reliability of case-note audit. METHODS: We conducted a systematic review of the inter-rater reliability of case-note audit. Analysis was restricted to 26 papers reporting comparisons of two or three raters making independent judgements about the quality of care. RESULTS: Sixty-six separate comparisons were possible, since some papers reported more than one measurement of reliability. Mean kappa values ranged from 0.32 to 0.70. These may be inflated due to publication bias. Measured reliabilities were found to be higher for case-note reviews based on explicit, as opposed to implicit, criteria and for reviews that focused on outcome (including adverse effects) rather than process errors. We found an association between kappa and the prevalence of errors (poor quality care), suggesting alternatives such as tetrachoric and polychoric correlation coefficients be considered to assess inter-rater reliability. CONCLUSIONS: Comparative studies should take into account the relationship between kappa and the prevalence of the events being measured.

U2 - 10.1258/135581907781543012

DO - 10.1258/135581907781543012

M3 - Article

C2 - 17716421

SN - 1758-1060

VL - 12

SP - 173

EP - 180

JO - Journal of Health Services Research & Policy

JF - Journal of Health Services Research & Policy

IS - 3

ER -

Inter-rater reliability of case-note audit: a systematic review

Abstract

Access to Document

Fingerprint

Cite this