Dimension-free error bounds from random projections

Ata Kaban

doi:10.1609/aaai.v33i01.33014049

Dimension-free error bounds from random projections

Ata Kaban

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Citations (Scopus)

196 Downloads (Pure)

Abstract

Learning from high dimensional data is challenging in general – however, often the data is not truly high dimensional in the sense that it may have some hidden low complexity geometry. We give new, user-friendly PAC-bounds that are able to take advantage of such benign geometry to reduce dimensional-dependence of error-guarantees in settings where such dependence is known to be essential in general. This is achieved by employing random projection as an analytic tool, and exploiting its structure-preserving compression ability. We introduce an auxiliary function class that operates on reduced dimensional inputs, and a new complexity term, as the distortion of the loss under random projections. The latter is a hypothesis-dependent data-complexity, whose analytic estimates turn out to recover various regularisation schemes in parametric models, and a notion of intrinsic dimension, as quantified by the Gaussian width of the input support in the case of the nearest neighbour rule. If there is benign geometry present, then the bounds become tighter, otherwise they recover the original dimension-dependent bounds.

Original language	English
Title of host publication	Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19)
Publisher	AAAI Press
Pages	4049-4056
Number of pages	8
ISBN (Print)	978-1-57735-809-1
DOIs	https://doi.org/10.1609/aaai.v33i01.33014049
Publication status	Published - 17 Jul 2019
Event	Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19) - Honolulu, Hawaii, United States Duration: 27 Jan 2019 → 1 Feb 2019

Publication series

Name	Proceedings of the AAAI Conference on Artificial Intelligence
Publisher	AAAI
Number	1
Volume	33
ISSN (Print)	2159-5399
ISSN (Electronic)	2374-3468

Conference

Conference	Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19)
Country/Territory	United States
City	Honolulu, Hawaii
Period	27/01/19 → 1/02/19

Access to Document

10.1609/aaai.v33i01.33014049Licence: None: All rights reserved

Ata_Kaban_Dimension-free_error_bounds_from_random_projections_AAAI_19_2019
Checked for eligibility: 19/12/2018 This is the accepted manuscript for a forthcoming publication in AAAI Conference on Artificial Intelligence (AAAI-2019).
Accepted author manuscript, 312 KBLicence: None: All rights reserved

https://www.aaai.org/ojs/index.php/AAAI/article/view/4300Licence: None: All rights reserved

Cite this

@inproceedings{afd4b43955084e30a28f5ba788416816,

title = "Dimension-free error bounds from random projections",

abstract = "Learning from high dimensional data is challenging in general – however, often the data is not truly high dimensional in the sense that it may have some hidden low complexity geometry. We give new, user-friendly PAC-bounds that are able to take advantage of such benign geometry to reduce dimensional-dependence of error-guarantees in settings where such dependence is known to be essential in general. This is achieved by employing random projection as an analytic tool, and exploiting its structure-preserving compression ability. We introduce an auxiliary function class that operates on reduced dimensional inputs, and a new complexity term, as the distortion of the loss under random projections. The latter is a hypothesis-dependent data-complexity, whose analytic estimates turn out to recover various regularisation schemes in parametric models, and a notion of intrinsic dimension, as quantified by the Gaussian width of the input support in the case of the nearest neighbour rule. If there is benign geometry present, then the bounds become tighter, otherwise they recover the original dimension-dependent bounds. ",

author = "Ata Kaban",

year = "2019",

month = jul,

day = "17",

doi = "10.1609/aaai.v33i01.33014049",

language = "English",

isbn = "978-1-57735-809-1",

series = "Proceedings of the AAAI Conference on Artificial Intelligence",

publisher = "AAAI Press",

number = "1",

pages = "4049--4056",

booktitle = "Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19)",

note = "Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19) ; Conference date: 27-01-2019 Through 01-02-2019",

}

Kaban, A 2019, Dimension-free error bounds from random projections. in Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19). Proceedings of the AAAI Conference on Artificial Intelligence, no. 1, vol. 33, AAAI Press, pp. 4049-4056, Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19), Honolulu, Hawaii, United States, 27/01/19. https://doi.org/10.1609/aaai.v33i01.33014049

TY - GEN

T1 - Dimension-free error bounds from random projections

AU - Kaban, Ata

PY - 2019/7/17

Y1 - 2019/7/17

N2 - Learning from high dimensional data is challenging in general – however, often the data is not truly high dimensional in the sense that it may have some hidden low complexity geometry. We give new, user-friendly PAC-bounds that are able to take advantage of such benign geometry to reduce dimensional-dependence of error-guarantees in settings where such dependence is known to be essential in general. This is achieved by employing random projection as an analytic tool, and exploiting its structure-preserving compression ability. We introduce an auxiliary function class that operates on reduced dimensional inputs, and a new complexity term, as the distortion of the loss under random projections. The latter is a hypothesis-dependent data-complexity, whose analytic estimates turn out to recover various regularisation schemes in parametric models, and a notion of intrinsic dimension, as quantified by the Gaussian width of the input support in the case of the nearest neighbour rule. If there is benign geometry present, then the bounds become tighter, otherwise they recover the original dimension-dependent bounds.

AB - Learning from high dimensional data is challenging in general – however, often the data is not truly high dimensional in the sense that it may have some hidden low complexity geometry. We give new, user-friendly PAC-bounds that are able to take advantage of such benign geometry to reduce dimensional-dependence of error-guarantees in settings where such dependence is known to be essential in general. This is achieved by employing random projection as an analytic tool, and exploiting its structure-preserving compression ability. We introduce an auxiliary function class that operates on reduced dimensional inputs, and a new complexity term, as the distortion of the loss under random projections. The latter is a hypothesis-dependent data-complexity, whose analytic estimates turn out to recover various regularisation schemes in parametric models, and a notion of intrinsic dimension, as quantified by the Gaussian width of the input support in the case of the nearest neighbour rule. If there is benign geometry present, then the bounds become tighter, otherwise they recover the original dimension-dependent bounds.

U2 - 10.1609/aaai.v33i01.33014049

DO - 10.1609/aaai.v33i01.33014049

M3 - Conference contribution

SN - 978-1-57735-809-1

T3 - Proceedings of the AAAI Conference on Artificial Intelligence

SP - 4049

EP - 4056

BT - Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19)

PB - AAAI Press

T2 - Thirty Third AAAI Conference on Artificial Intelligence (AAAI-19)

Y2 - 27 January 2019 through 1 February 2019

ER -

Dimension-free error bounds from random projections

Abstract

Publication series

Conference

Access to Document

Fingerprint

Cite this