I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data

Mahan Hosseini; Michael Powell; John Collins; Chloe Callahan-Flintoft; William Jones; Howard Bowman; Brad Wyble

doi:10.1016/j.neubiorev.2020.09.036

I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data

Mahan Hosseini, Michael Powell, John Collins, Chloe Callahan-Flintoft, William Jones, Howard Bowman, Brad Wyble

Psychology

Research output: Contribution to journal › Review article › peer-review

4 Citations (Scopus)

Abstract

Machine learning has enhanced the abilities of neuroscientists to interpret information collected through EEG, fMRI, and MEG data. With these powerful techniques comes the danger of overfitting of hyperparameters which can render results invalid. We refer to this problem as 'over-hyping' and show that it is pernicious despite commonly used precautions. Over-hyping occurs when analysis decisions are made after observing analysis outcomes and can produce results that are partially or even completely spurious. It is commonly assumed that cross-validation is an effective protection against overfitting or overhyping, but this is not actually true. In this article, we show that spurious result can be obtained on random data by modifying hyperparameters in seemingly innocuous ways, despite the use of cross-validation. We recommend a number of techniques for limiting over-hyping, such as lock boxes, blind analyses, pre-registrations, and nested cross-validation. These techniques, are common in other fields that use machine learning, including computer science and physics. Adopting similar safeguards is critical for ensuring the robustness of machine-learning techniques in the neurosciences.

Original language	English
Pages (from-to)	456-467
Journal	Neuroscience and biobehavioral reviews
Volume	119
DOIs	https://doi.org/10.1016/j.neubiorev.2020.09.036
Publication status	E-pub ahead of print - 6 Oct 2020

Bibliographical note

Keywords

Analysis
Classification
EEG
Machine learning
Overfitting
Overhyping

ASJC Scopus subject areas

Neuropsychology and Physiological Psychology
Cognitive Neuroscience
Behavioral Neuroscience

Access to Document

10.1016/j.neubiorev.2020.09.036

Cite this

@article{952c4ac7392d431a836952eacb1e7e17,

title = "I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data",

abstract = "Machine learning has enhanced the abilities of neuroscientists to interpret information collected through EEG, fMRI, and MEG data. With these powerful techniques comes the danger of overfitting of hyperparameters which can render results invalid. We refer to this problem as 'over-hyping' and show that it is pernicious despite commonly used precautions. Over-hyping occurs when analysis decisions are made after observing analysis outcomes and can produce results that are partially or even completely spurious. It is commonly assumed that cross-validation is an effective protection against overfitting or overhyping, but this is not actually true. In this article, we show that spurious result can be obtained on random data by modifying hyperparameters in seemingly innocuous ways, despite the use of cross-validation. We recommend a number of techniques for limiting over-hyping, such as lock boxes, blind analyses, pre-registrations, and nested cross-validation. These techniques, are common in other fields that use machine learning, including computer science and physics. Adopting similar safeguards is critical for ensuring the robustness of machine-learning techniques in the neurosciences.",

keywords = "Analysis, Classification, EEG, Machine learning, Overfitting, Overhyping",

author = "Mahan Hosseini and Michael Powell and John Collins and Chloe Callahan-Flintoft and William Jones and Howard Bowman and Brad Wyble",

note = "Copyright {\textcopyright} 2020. Published by Elsevier Ltd.",

year = "2020",

month = oct,

day = "6",

doi = "10.1016/j.neubiorev.2020.09.036",

language = "English",

volume = "119",

pages = "456--467",

journal = "Neuroscience and biobehavioral reviews",

issn = "0149-7634",

publisher = "Elsevier",

}

TY - JOUR

T1 - I tried a bunch of things

T2 - The dangers of unexpected overfitting in classification of brain data

AU - Hosseini, Mahan

AU - Powell, Michael

AU - Collins, John

AU - Callahan-Flintoft, Chloe

AU - Jones, William

AU - Bowman, Howard

AU - Wyble, Brad

PY - 2020/10/6

Y1 - 2020/10/6

N2 - Machine learning has enhanced the abilities of neuroscientists to interpret information collected through EEG, fMRI, and MEG data. With these powerful techniques comes the danger of overfitting of hyperparameters which can render results invalid. We refer to this problem as 'over-hyping' and show that it is pernicious despite commonly used precautions. Over-hyping occurs when analysis decisions are made after observing analysis outcomes and can produce results that are partially or even completely spurious. It is commonly assumed that cross-validation is an effective protection against overfitting or overhyping, but this is not actually true. In this article, we show that spurious result can be obtained on random data by modifying hyperparameters in seemingly innocuous ways, despite the use of cross-validation. We recommend a number of techniques for limiting over-hyping, such as lock boxes, blind analyses, pre-registrations, and nested cross-validation. These techniques, are common in other fields that use machine learning, including computer science and physics. Adopting similar safeguards is critical for ensuring the robustness of machine-learning techniques in the neurosciences.

AB - Machine learning has enhanced the abilities of neuroscientists to interpret information collected through EEG, fMRI, and MEG data. With these powerful techniques comes the danger of overfitting of hyperparameters which can render results invalid. We refer to this problem as 'over-hyping' and show that it is pernicious despite commonly used precautions. Over-hyping occurs when analysis decisions are made after observing analysis outcomes and can produce results that are partially or even completely spurious. It is commonly assumed that cross-validation is an effective protection against overfitting or overhyping, but this is not actually true. In this article, we show that spurious result can be obtained on random data by modifying hyperparameters in seemingly innocuous ways, despite the use of cross-validation. We recommend a number of techniques for limiting over-hyping, such as lock boxes, blind analyses, pre-registrations, and nested cross-validation. These techniques, are common in other fields that use machine learning, including computer science and physics. Adopting similar safeguards is critical for ensuring the robustness of machine-learning techniques in the neurosciences.

KW - Analysis

KW - Classification

KW - EEG

KW - Machine learning

KW - Overfitting

KW - Overhyping

UR - http://www.scopus.com/inward/record.url?scp=85095433113&partnerID=8YFLogxK

U2 - 10.1016/j.neubiorev.2020.09.036

DO - 10.1016/j.neubiorev.2020.09.036

M3 - Review article

C2 - 33035522

SN - 0149-7634

VL - 119

SP - 456

EP - 467

JO - Neuroscience and biobehavioral reviews

JF - Neuroscience and biobehavioral reviews

ER -

I tried a bunch of things: The dangers of unexpected overfitting in classification of brain data

Abstract

Bibliographical note

Keywords

ASJC Scopus subject areas

Access to Document

Fingerprint

Cite this