Skip to main navigation Skip to search Skip to main content

BoTTA: Benchmarking on-device Test Time Adaptation

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The performance of deep learning models depends heavily on test samples at runtime, and shifts from the training data distribution can significantly reduce accuracy. Test-time adaptation (TTA) addresses this by adapting models during inference without requiring labeled test data or access to the original training set. While research has explored TTA from various perspectives like algorithmic complexity, data and class distribution shifts, model architectures, and offline versus continuous learning, constraints specific to mobile and edge devices remain underexplored. We propose BoTTA, a benchmark designed to evaluate TTA methods under practical constraints on mobile and edge devices. Our evaluation targets four key challenges caused by limited resources and usage conditions: (i) limited test samples, (ii) limited exposure to categories, (iii) diverse distribution shifts, and (iv) overlapping shifts within a sample. We assess state-of-the-art TTA methods under these scenarios using benchmark datasets and report system-level metrics on a real testbed. Furthermore, unlike prior work, we align with on-device requirements by advocating periodic adaptation instead of continuous inference-time adaptation. Experiments reveal key insights: many recent TTA algorithms struggle with small datasets, fail to generalize to unseen categories, and depend on the diversity and complexity of distribution shifts. BoTTA also reports device-specific resource use. For example, while SHOT improves accuracy by 2.25× with 512 adaptation samples, it uses 1.08× peak memory on Raspberry Pi versus the base model. BoTTA offers actionable guidance for TTA in real-world, resource-constrained deployments.
Original languageEnglish
Title of host publicationACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems (SenSys)
Publication statusPublished - 2026
EventACM/IEEE 2026 International Conference on
Embedded Artificial Intelligence and Sensing Systems
- Saint-Malo, France
Duration: 11 May 202614 May 2026
Conference number: 1
https://sensys.acm.org/2026/

Conference

ConferenceACM/IEEE 2026 International Conference on
Embedded Artificial Intelligence and Sensing Systems
Abbreviated titleSenSys 2026
Country/TerritoryFrance
CitySaint-Malo
Period11/05/2614/05/26
Internet address

Bibliographical note

Not yet published as of 26/03/2026.

Fingerprint

Dive into the research topics of 'BoTTA: Benchmarking on-device Test Time Adaptation'. Together they form a unique fingerprint.

Cite this