A semi-supervised Learning Approach to find equivalent long-string Organization Names - École des Ponts ParisTech Accéder directement au contenu
Poster Année : 2016

A semi-supervised Learning Approach to find equivalent long-string Organization Names

Résumé

Background: A platform called Opalia has been built to propose free access to all publications about a laboratory for a given range of years. This platform makes indexing of a corpus of a scientific article of a given lab. But in the French research system, a lab includes researchers from different organizations in the same unit generally called. UMR. Authors can write their laboratory names differently. Aim: Sorting a set of labels that is noisy can be seen as a binary classification into positives and leave negatives strings. We propose to use a cascade processing with the help of tagging some positive strings to build a relevant space of features that helps classification into good labels.
Fichier principal
Vignette du fichier
poster_EXIA_v1.pdf (674.27 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02310298 , version 1 (10-10-2019)

Identifiants

  • HAL Id : hal-02310298 , version 1

Citer

Frédérique Bordignon, Nicolas Turenne, Yann Feugueur. A semi-supervised Learning Approach to find equivalent long-string Organization Names. Colloque- Forum PEPS EXIA, Oct 2016, Champs sur Marne, France. 2016. ⟨hal-02310298⟩
91 Consultations
28 Téléchargements

Partager

Gmail Facebook X LinkedIn More