Skip to main navigation Skip to search Skip to main content

Covariate shift in nonparametric regression with Markovian design

  • Lukas Trottner*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

35 Downloads (Pure)

Abstract

Covariate shift in regression problems and the associated distribution mismatch between training and test data is a commonly encountered phenomenon in machine learning. In this paper, we extend recent results on nonparametric convergence rates for i.i.d. data to Markovian dependence structures. We demonstrate that under Hölder smoothness assumptions on the regression function, convergence rates for the generalization risk of a Nadaraya–Watson kernel estimator are determined by the similarity between the invariant distributions associated to source and target Markov chains. The similarity is explicitly captured in terms of a bandwidth-dependent similarity measure recently introduced in Pathak, Ma and Wainwright [ICML, 2022]. Precise convergence rates are derived for the particular cases of finite Markov chains and spectral gap Markov chains for which the similarity measure between their invariant distributions grows polynomially with decreasing bandwidth. For the latter, we extend the notion of a distribution transfer exponent from Kpotufe and Martinet [Ann. Stat., 49(6), 2021] to kernel transfer exponents of uniformly ergodic Markov chains in order to generate a rich class of Markov kernel pairs for which convergence guarantees for the covariate shift problem can be formulated.
Original languageEnglish
Article numberiaae011
Number of pages37
JournalInformation and Inference: A Journal of the IMA
Volume13
Issue number2
DOIs
Publication statusPublished - 6 May 2024

Fingerprint

Dive into the research topics of 'Covariate shift in nonparametric regression with Markovian design'. Together they form a unique fingerprint.

Cite this