Data driven region-of-interest selection without inflating type I error rate

Joseph Brooks; Alexia Zoumpoulaki; Howard Bowman

doi:10.1111/psyp.12682

Data driven region-of-interest selection without inflating type I error rate

Joseph Brooks, Alexia Zoumpoulaki, Howard Bowman

Psychology

Research output: Contribution to journal › Article › peer-review

31 Citations (Scopus)

146 Downloads (Pure)

Abstract

In event-related potentials (ERP) and other large multi-dimensional neuroscience datasets, researchers often select regions-of-interest (ROIs) for analysis. The method of ROI selection can critically affect the conclusions of a study by causing the researcher to miss effects in the data or to detect spurious effects. In practice, to avoid inflating Type I error rate (i.e., false positives), ROIs are often based on a priori hypotheses or independent information. However, this can be insensitive to experiment-specific variations in effect location (e.g. latency shifts) reducing power to detect effects. Data-driven ROI selection, in contrast, is non-independent and uses the data under analysis to determine ROI positions. Therefore, it has potential to select ROIs based on experiment-specific information and increase power for detecting effects. However, data driven methods have been criticized because they can substantially inflate Type I error rate. Here we demonstrate, using simulations of simple ERP experiments, that data-driven ROI selection can indeed be more powerful than a priori hypotheses or independent information. Furthermore, we show that data-driven ROI selection using the aggregate-grand-average from trials (AGAT), despite being based on the data at hand, can be safely used for ROI selection under many circumstances. However, when there is a noise difference between conditions, using the AGAT can inflate Type 1 error and should be avoided. We identify critical assumptions for use of the AGAT and provide a basis for researchers to use, and reviewers to assess, data-driven methods of ROI localization in ERP and other studies.

Original language	English
Pages (from-to)	100-113
Journal	Psychophysiology
Volume	54
Issue number	1
Early online date	20 Dec 2016
DOIs	https://doi.org/10.1111/psyp.12682
Publication status	Published - Jan 2017

Keywords

EEG
ERPs
window selection
Type I error rate

ASJC Scopus subject areas

Neuropsychology and Physiological Psychology

Access to Document

10.1111/psyp.12682Licence: Creative Commons: Attribution-NonCommercial (CC BY-NC)

Brooks_et_al_Data-driven_region-of-interest_PsychophysiologyFinal published version, 879 KBLicence: Creative Commons: Attribution-NonCommercial (CC BY-NC)

Cite this

@article{e4e245be8935427991c42e49a38d7301,

title = "Data driven region-of-interest selection without inflating type I error rate",

abstract = "In event-related potentials (ERP) and other large multi-dimensional neuroscience datasets, researchers often select regions-of-interest (ROIs) for analysis. The method of ROI selection can critically affect the conclusions of a study by causing the researcher to miss effects in the data or to detect spurious effects. In practice, to avoid inflating Type I error rate (i.e., false positives), ROIs are often based on a priori hypotheses or independent information. However, this can be insensitive to experiment-specific variations in effect location (e.g. latency shifts) reducing power to detect effects. Data-driven ROI selection, in contrast, is non-independent and uses the data under analysis to determine ROI positions. Therefore, it has potential to select ROIs based on experiment-specific information and increase power for detecting effects. However, data driven methods have been criticized because they can substantially inflate Type I error rate. Here we demonstrate, using simulations of simple ERP experiments, that data-driven ROI selection can indeed be more powerful than a priori hypotheses or independent information. Furthermore, we show that data-driven ROI selection using the aggregate-grand-average from trials (AGAT), despite being based on the data at hand, can be safely used for ROI selection under many circumstances. However, when there is a noise difference between conditions, using the AGAT can inflate Type 1 error and should be avoided. We identify critical assumptions for use of the AGAT and provide a basis for researchers to use, and reviewers to assess, data-driven methods of ROI localization in ERP and other studies.",

keywords = "EEG, ERPs, window selection, Type I error rate",

author = "Joseph Brooks and Alexia Zoumpoulaki and Howard Bowman",

year = "2017",

month = jan,

doi = "10.1111/psyp.12682",

language = "English",

volume = "54",

pages = "100--113",

journal = "Psychophysiology",

issn = "0048-5772",

publisher = "Wiley Online Library",

number = "1",

}

TY - JOUR

T1 - Data driven region-of-interest selection without inflating type I error rate

AU - Brooks, Joseph

AU - Zoumpoulaki, Alexia

AU - Bowman, Howard

PY - 2017/1

Y1 - 2017/1

N2 - In event-related potentials (ERP) and other large multi-dimensional neuroscience datasets, researchers often select regions-of-interest (ROIs) for analysis. The method of ROI selection can critically affect the conclusions of a study by causing the researcher to miss effects in the data or to detect spurious effects. In practice, to avoid inflating Type I error rate (i.e., false positives), ROIs are often based on a priori hypotheses or independent information. However, this can be insensitive to experiment-specific variations in effect location (e.g. latency shifts) reducing power to detect effects. Data-driven ROI selection, in contrast, is non-independent and uses the data under analysis to determine ROI positions. Therefore, it has potential to select ROIs based on experiment-specific information and increase power for detecting effects. However, data driven methods have been criticized because they can substantially inflate Type I error rate. Here we demonstrate, using simulations of simple ERP experiments, that data-driven ROI selection can indeed be more powerful than a priori hypotheses or independent information. Furthermore, we show that data-driven ROI selection using the aggregate-grand-average from trials (AGAT), despite being based on the data at hand, can be safely used for ROI selection under many circumstances. However, when there is a noise difference between conditions, using the AGAT can inflate Type 1 error and should be avoided. We identify critical assumptions for use of the AGAT and provide a basis for researchers to use, and reviewers to assess, data-driven methods of ROI localization in ERP and other studies.

AB - In event-related potentials (ERP) and other large multi-dimensional neuroscience datasets, researchers often select regions-of-interest (ROIs) for analysis. The method of ROI selection can critically affect the conclusions of a study by causing the researcher to miss effects in the data or to detect spurious effects. In practice, to avoid inflating Type I error rate (i.e., false positives), ROIs are often based on a priori hypotheses or independent information. However, this can be insensitive to experiment-specific variations in effect location (e.g. latency shifts) reducing power to detect effects. Data-driven ROI selection, in contrast, is non-independent and uses the data under analysis to determine ROI positions. Therefore, it has potential to select ROIs based on experiment-specific information and increase power for detecting effects. However, data driven methods have been criticized because they can substantially inflate Type I error rate. Here we demonstrate, using simulations of simple ERP experiments, that data-driven ROI selection can indeed be more powerful than a priori hypotheses or independent information. Furthermore, we show that data-driven ROI selection using the aggregate-grand-average from trials (AGAT), despite being based on the data at hand, can be safely used for ROI selection under many circumstances. However, when there is a noise difference between conditions, using the AGAT can inflate Type 1 error and should be avoided. We identify critical assumptions for use of the AGAT and provide a basis for researchers to use, and reviewers to assess, data-driven methods of ROI localization in ERP and other studies.

KW - EEG

KW - ERPs

KW - window selection

KW - Type I error rate

U2 - 10.1111/psyp.12682

DO - 10.1111/psyp.12682

M3 - Article

SN - 0048-5772

VL - 54

SP - 100

EP - 113

JO - Psychophysiology

JF - Psychophysiology

IS - 1

ER -

Data driven region-of-interest selection without inflating type I error rate

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this