The benefits of modeling slack variables in SVMs

Fengzhen Tang; Peter Tiňo; Pedro Antonio Gutiérrez; Huanhuan Chen

doi:10.1162/NECO_a_00714

The benefits of modeling slack variables in SVMs

Fengzhen Tang, Peter Tiňo, Pedro Antonio Gutiérrez, Huanhuan Chen

Computer Science

Research output: Contribution to journal › Article › peer-review

3 Citations (Scopus)

Abstract

In this letter, we explore the idea of modeling slack variables in support vector machine (SVM) approaches. The study is motivated by SVM+, which models the slacks through a smooth correcting function that is determined by additional (privileged) information about the training examples not available in the test phase. We take a closer look at the meaning and consequences of smooth modeling of slacks, as opposed to determining them in an unconstrained manner through the SVM optimization program. To better understand this difference we only allow the determination and modeling of slack values on the same information- that is, using the same training input in the original input space.We also explore whether it is possible to improve classification performance by combining (in a convex combination) the original SVM slacks with the modeled ones.We show experimentally that this approach not only leads to improved generalization performance but also yields more compact, lower-complexity models. Finally, we extend this idea to the context of ordinal regression, where a natural order among the classes exists. The experimental results confirm principal findings from the binary case.

Original language	English
Pages (from-to)	954-981
Number of pages	28
Journal	Neural Computation
Volume	27
Issue number	4
DOIs	https://doi.org/10.1162/NECO_a_00714
Publication status	Published - 19 Apr 2015

ASJC Scopus subject areas

Cognitive Neuroscience
Arts and Humanities (miscellaneous)

Access to Document

10.1162/NECO_a_00714

Personalised Medicine through Learning in the Model Space
Tino, P.
Engineering & Physical Science Research Council
1/10/13 → 31/03/17
Project: Research Councils
Unified probabilistic modelleing of adaptive spatial temporal structures in the human brain
Tino, P. & Kourtzi, Z.
Biotechnology & Biological Sciences Research Council
1/10/10 → 30/03/14
Project: Research Councils

Cite this

@article{54a5b3c6820b4a7790c8b6a95b095c42,

title = "The benefits of modeling slack variables in SVMs",

abstract = "In this letter, we explore the idea of modeling slack variables in support vector machine (SVM) approaches. The study is motivated by SVM+, which models the slacks through a smooth correcting function that is determined by additional (privileged) information about the training examples not available in the test phase. We take a closer look at the meaning and consequences of smooth modeling of slacks, as opposed to determining them in an unconstrained manner through the SVM optimization program. To better understand this difference we only allow the determination and modeling of slack values on the same information- that is, using the same training input in the original input space.We also explore whether it is possible to improve classification performance by combining (in a convex combination) the original SVM slacks with the modeled ones.We show experimentally that this approach not only leads to improved generalization performance but also yields more compact, lower-complexity models. Finally, we extend this idea to the context of ordinal regression, where a natural order among the classes exists. The experimental results confirm principal findings from the binary case.",

author = "Fengzhen Tang and Peter Ti{\v n}o and Guti{\'e}rrez, {Pedro Antonio} and Huanhuan Chen",

year = "2015",

month = apr,

day = "19",

doi = "10.1162/NECO_a_00714",

language = "English",

volume = "27",

pages = "954--981",

journal = "Neural Computation",

issn = "0899-7667",

publisher = "Massachusetts Institute of Technology Press",

number = "4",

}

TY - JOUR

T1 - The benefits of modeling slack variables in SVMs

AU - Tang, Fengzhen

AU - Tiňo, Peter

AU - Gutiérrez, Pedro Antonio

AU - Chen, Huanhuan

PY - 2015/4/19

Y1 - 2015/4/19

N2 - In this letter, we explore the idea of modeling slack variables in support vector machine (SVM) approaches. The study is motivated by SVM+, which models the slacks through a smooth correcting function that is determined by additional (privileged) information about the training examples not available in the test phase. We take a closer look at the meaning and consequences of smooth modeling of slacks, as opposed to determining them in an unconstrained manner through the SVM optimization program. To better understand this difference we only allow the determination and modeling of slack values on the same information- that is, using the same training input in the original input space.We also explore whether it is possible to improve classification performance by combining (in a convex combination) the original SVM slacks with the modeled ones.We show experimentally that this approach not only leads to improved generalization performance but also yields more compact, lower-complexity models. Finally, we extend this idea to the context of ordinal regression, where a natural order among the classes exists. The experimental results confirm principal findings from the binary case.

AB - In this letter, we explore the idea of modeling slack variables in support vector machine (SVM) approaches. The study is motivated by SVM+, which models the slacks through a smooth correcting function that is determined by additional (privileged) information about the training examples not available in the test phase. We take a closer look at the meaning and consequences of smooth modeling of slacks, as opposed to determining them in an unconstrained manner through the SVM optimization program. To better understand this difference we only allow the determination and modeling of slack values on the same information- that is, using the same training input in the original input space.We also explore whether it is possible to improve classification performance by combining (in a convex combination) the original SVM slacks with the modeled ones.We show experimentally that this approach not only leads to improved generalization performance but also yields more compact, lower-complexity models. Finally, we extend this idea to the context of ordinal regression, where a natural order among the classes exists. The experimental results confirm principal findings from the binary case.

UR - http://www.scopus.com/inward/record.url?scp=84924971239&partnerID=8YFLogxK

U2 - 10.1162/NECO_a_00714

DO - 10.1162/NECO_a_00714

M3 - Article

AN - SCOPUS:84924971239

SN - 0899-7667

VL - 27

SP - 954

EP - 981

JO - Neural Computation

JF - Neural Computation

IS - 4

ER -

The benefits of modeling slack variables in SVMs

Abstract

ASJC Scopus subject areas

Access to Document

Fingerprint

Projects

Personalised Medicine through Learning in the Model Space

Unified probabilistic modelleing of adaptive spatial temporal structures in the human brain

Cite this