A semi-supervised Learning Approach to find equivalent long-string Organization Names - École des Ponts ParisTech Access content directly
Conference Poster Year : 2016

A semi-supervised Learning Approach to find equivalent long-string Organization Names

Abstract

Background: A platform called Opalia has been built to propose free access to all publications about a laboratory for a given range of years. This platform makes indexing of a corpus of a scientific article of a given lab. But in the French research system, a lab includes researchers from different organizations in the same unit generally called. UMR. Authors can write their laboratory names differently. Aim: Sorting a set of labels that is noisy can be seen as a binary classification into positives and leave negatives strings. We propose to use a cascade processing with the help of tagging some positive strings to build a relevant space of features that helps classification into good labels.
Fichier principal
Vignette du fichier
poster_EXIA_v1.pdf (674.27 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-02310298 , version 1 (10-10-2019)

Identifiers

  • HAL Id : hal-02310298 , version 1

Cite

Frédérique Bordignon, Nicolas Turenne, Yann Feugueur. A semi-supervised Learning Approach to find equivalent long-string Organization Names. Colloque- Forum PEPS EXIA, Oct 2016, Champs sur Marne, France. 2016. ⟨hal-02310298⟩
85 View
22 Download

Share

Gmail Facebook Twitter LinkedIn More