Séminaire (Organisé par l’Equipe de recherche DI)

Grégoire MESNIL

Doctorant, Université de Montréal et Université de Rouen

Learning Semantic Representations of Objects and Parts

Mardi 15 janvier 2013 à 11h en salle A212

Résumé :

Recently, large scale image annotation datasets have been collected with millions of images and thousands of possible annotations. Latent variable models, or embedding methods, that simultaneously learn semantic representations of object labels and image representations can provide tractable solutions on such tasks.

In this work, we are interested in jointly learning representations both for the objects in an image, and the parts of those objects, because such deeper semantic representations could bring a leap forward in image retrieval or browsing. Despite the size of these datasets, the amount of annotated data for objects and parts can be costly and may not be available. In this paper, we propose to bypass this cost with a method able to learn to jointly label objects and parts without requiring exhaustively labeled data.

We design a model architecture that can be trained under a proxy supervision obtained by combining standard image annotation (from ImageNet) with semantic part-based within-label relations (from WordNet). The model itself is designed to model both object image to object label similarities, and object label to object part label simi- larities in a single joint system. Experiments conducted on our combined data and a precisely annotated evaluation set demonstrate the usefulness of our approach.


Mardi 20 juin 2017

Séminaire à 14h en GI042 (bâtiment Blaise Pascal), présenté par Patrice Perny, LIP6.
« Décision interactive sur domaine combinatoire par élicitation incrémentale de préférences ».

Jeudi 11 mai 2017

Séminaire à 14h en GI042 (bâtiment Blaise Pascal), présenté par Nicolas Maudet, LIP6 (Equipe SMA).
« Current issues in argumentation ».

Mardi 4 avril 2017

Séminaire à 14 h dans l’amphi du Centre d’Innovation de l’UTC, présenté par Xavier LAGORCE, PhD, Head of Computer Vision, Chronocam.
« Chronocam : Event-based cameras for machine vision »

Jeudi 27 octobre 2016

Séminaire à 14 h en GI016, présenté par Fabien Pfaender, UTSEUS.
« State of Cities - A Massive, Systematic, Data Powered, Comparative Analysis Of Cities ».

Pages 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | ...


FR SHIC 3272

Collegium UTC/CNRS