Variance Based Samples Weighting for Supervised Deep Learning - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Variance Based Samples Weighting for Supervised Deep Learning

Résumé

Machine Learning (ML) aims at approximating functions defined on a measured space with a model. A relevant choice of distribution for the training data set can improve the performances of a given ML model. We claim and empirically justify that an ML model yields better results when the data set focuses on regions where the function to learn is steeper. We first traduce this assumption in a mathematically workable way. Then, theoretical derivations allow to construct a methodology that we call Variance Based Samples Weighting (VBSW). VBSW uses local variance of the labels to weight the training points. This methodology is general, scalable and cost effective. It is validated on Deep Learning models like Bert [10] or ResNet [14] by significantly increasing their performances for various Natural Language Processing and image classification tasks.
Fichier principal
Vignette du fichier
full_paper.pdf (721.35 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02885827 , version 1 (01-07-2020)
hal-02885827 , version 2 (19-01-2021)
hal-02885827 , version 3 (28-01-2021)
hal-02885827 , version 4 (27-09-2022)

Identifiants

  • HAL Id : hal-02885827 , version 1

Citer

Paul Novello, Gaël Poëtte, David Lugato, Pietro Marco Congedo. Variance Based Samples Weighting for Supervised Deep Learning. 2020. ⟨hal-02885827v1⟩
312 Consultations
330 Téléchargements

Partager

Gmail Facebook X LinkedIn More