DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : Sampling is a popular way of scaling up machine learning algorithms to large datasets. The question often is how many samples are needed. Adaptive stopping algorithms monitor the performance in an online fashion and they can stop early, saving valuable resources. We consider problems where probabilistic guarantees are desired and demonstrate how recently-introduced empirical Bernstein bounds can be used to design stopping rules that are efficient. We provide upper bounds on the sample complexity of the new rules, as well as empirical results on model selection and boosting in the filtering setting.
https://hal-enpc.archives-ouvertes.fr/hal-00834983
Contributeur : Pascal Monasse <>
Soumis le : mardi 18 juin 2013 - 14:46:49 Dernière modification le : mardi 22 septembre 2020 - 03:46:15 Archivage à long terme le : : jeudi 19 septembre 2013 - 04:08:00