zbMATH — the first resource for mathematics

Classification with a reject option using a hinge loss. (English) Zbl 1225.62080
Summary: We consider the problem of binary classification where the classifier can, for a particular cost, choose not to classify an observation. Just as in the conventional classification problem, minimization of the sample average of the cost is a difficult optimization problem. As an alternative, we propose the optimization of a certain convex loss function \(\varphi \), analogous to the hinge loss used in support vector machines (SVMs). Its convexity ensures that the sample average of this surrogate loss can be efficiently minimized. We study its statistical properties. We show that minimizing the expected surrogate loss – the \(\varphi \)-risk – also minimizes the risk. We also study the rate at which the \(\varphi \)-risk approaches its minimum value. We show that fast rates are possible when the conditional probability \(P(Y=1|X)\) is unlikely to be close to certain critical values.

62H30 Classification and discrimination; cluster analysis (statistical aspects)
68T05 Learning and adaptive systems in artificial intelligence
62F15 Bayesian inference
Full Text: Link