Accéder directement au contenu Accéder directement à la navigation
Communication dans un congrès

Rotation Equivariant Vector Field Networks

Abstract : In many computer vision tasks, we expect a particular behavior of the output with respect to rotations of the input image. If this relationship is explicitly encoded, instead of treated as any other variation, the complexity of the problem is decreased, leading to a reduction in the size of the required model. In this paper, we propose the Rotation Equivariant Vector Field Networks (RotEqNet), a Convolutional Neural Network (CNN) architecture encoding rotation equivariance, invariance and covariance. Each convolutional filter is applied at multiple orientations and returns a vector field representing magnitude and angle of the highest scoring orientation at every spatial location. We develop a modified convolution operator relying on this representation to obtain deep architectures. We test RotEqNet on several problems requiring different responses with respect to the inputs' rotation: image classification, biomedical image segmentation, orientation estimation and patch matching. In all cases, we show that RotEqNet offers extremely compact models in terms of number of parameters and provides results in line to those of networks orders of magnitude larger.
Type de document :
Communication dans un congrès
Liste complète des métadonnées
Contributeur : Pascal Monasse <>
Soumis le : mercredi 4 juillet 2018 - 15:03:35
Dernière modification le : mercredi 3 février 2021 - 07:54:28

Lien texte intégral



Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia. Rotation Equivariant Vector Field Networks. 2017 IEEE International Conference on Computer Vision (ICCV), Oct 2017, Venice, France. ⟨10.1109/ICCV.2017.540⟩. ⟨hal-01830037⟩



Consultations de la notice