Probabilistic Approach to One-Class Support Vector Machine - École des Ponts ParisTech Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2016

Probabilistic Approach to One-Class Support Vector Machine

Résumé

Classification is one of the main problem addressed by machine learning algorithms. Among them the Support Vector Machine (SVM) has attracted a lot of interest and shown success in the past decades. SVM are originally tailored for binary classification. If we have only a few example of negative dataset we can turn to one-class SVM. In this paper we propose a probabilistic interpretation of the one-class SVM approach and an extension especially adapted in the case of highly imbalanced dataset. Indeed, we consider a binary classification problem where we represent the negative dataset by its two first moments, while still modeling the positive class by individual examples. The optimization problem is shown to have an equivalent formulation to a one-class SVM applied to the positive dataset after some preprocess-ing. The usual one-class SVM corresponding to the case where the negative class has mean 0 and identity variance. We show empirically, on a protein classification task and a text classification task, that our approach achieves similar statistical performance than the two mainstream approaches to imbalanced classification problems , while being more computationally efficient .
Fichier principal
Vignette du fichier
aistats2017b-hal.pdf (324.41 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01404973 , version 1 (05-12-2016)

Identifiants

  • HAL Id : hal-01404973 , version 1

Citer

Vincent Leclère, Edouard Grave, Laurent El Ghaoui. Probabilistic Approach to One-Class Support Vector Machine. 2016. ⟨hal-01404973⟩
321 Consultations
1395 Téléchargements

Partager

Gmail Facebook X LinkedIn More