Marc Masana @ PhD

Conferences

Metric Learning for Novelty and Anomaly Detection

Authors: Marc Masana, Idoia Ruiz, Joan Serrat, Joost Van de Weijer, Antonio M. Lopez

British Machine Vision Conference (BMVC), 2018

LIUM-CVC Submissions for WMT18 Multimodal Translation Task

Authors: Ozan Caglayan, Adrien Bardet, Fethi Bougares, Loïc Barrault, Kai Wang, Marc Masana, Luis Herranz, Joost van de Weijer.

Publication accepted at WMT 2018 after winning the Multimodal Machine Translation challenge (WMT), 2018

Find out more

Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting

Authors: Xialei Liu*, Marc Masana*, Luis Herranz, Joost Van de Weijer, Antonio M. Lopez, Andrew D Bagdanov

International Conference on Pattern Recognition (ICPR), 2018

Find out more

Domain-adaptive deep network compression

Authors: Marc Masana, Joost van de Weijer, Luis Herranz, Andrew D Bagdanov, Jose M Alvarez

International Conference on Computer Vision (ICCV), 2017

Find out more

Lium-cvc submissions for wmt17 multimodal translation task

Authors: Ozan Caglayan, Walid Aransa, Adrien Bardet, Mercedes García-Martínez, Fethi Bougares, Loïc Barrault, Marc Masana, Luis Herranz and Joost van de Weijer.

Publication accepted at WMT 2017 after winning the Multimodal Machine Translation challenge (WMT), 2017

Find out more

Hierarchical part detection with deep neural networks

Authors: Esteve Cervantes, Long Long Yu, Andrew D Bagdanov, Marc Masana, Joost van de Weijer

IEEE International Conference on Image Processing (ICIP), 2016

Find out more

Does multimodality help human and machine for translation and image captioning?

Authors: Ozan Caglayan, Walid Aransa, Yaxing Wang, Marc Masana, Mercedes García-Martínez, Fethi Bougares, Loïc Barrault, Joost Van de Weijer.

Publication accepted at WMT 2016 after winning the Multimodal Machine Translation challenge (WMT), 2016

Find out more

Journals

Class-incremental learning: survey and performance evaluation on image classification

Authors: Marc Masana, Xialei Liu, Bartłomiej Twardowski, Mikel Menta, Andrew D. Bagdanov, Joost van de Weijer.

Submitted preprint, 2020

Find out more

A continual learning survey: Defying forgetting in classification tasks

Authors: Matthias De Lange, Rahaf Aljundi, Marc Masana, Sarah Parisot, Xu Jia, Ales Leonardis, Gregory Slabaugh, Tinne Tuytelaars.

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Find out more

Automated mitral valve vortex ring extraction from 4D‐flow MRI

Authors: Corina Kräuter, Ursula Reiter, Clemens Reiter, Volha Nizhnikava, Marc Masana, Albrecht Schmidt, Michael Fuchsjäger, Rudolf Stollberger, Gert Reiter.

Magnetic Resonance in Medicine (MRM), 2020

Find out more

Saliency from High-Level Semantic Image Features

Authors: Aymen Azaza, Joost van de Weijer, Ali Douik, Javad Zolfaghari, Marc Masana.

SN Computer Science (Springer), 2020

Find out more

GTCreator: a Flexible Annotation Tool for Image-based Datasets

Authors: Jorge Bernal, Aymeric Histace, Marc Masana, Quentin Angermann, Cristina Sánchez-Montes, Cristina Rodríguez de Miguel, Maroua Hammami, Ana García-Rodríguez, Henry Córdova, Olivier Romain, Gloria Fernández-Esparrach, Xavier Dray, F. Javier Sánchez.

International Journal of Computer Assisted Radiology and Surgery (IJCARS), 2018

Find out more

Context Proposals for Saliency Detection

Authors: Aymen Azaza, Joost van de Weijer, Ali Douik, Marc Masana.

Journal on Computer Vision and Image Understanding (CVIU), 2018

Find out more

Workshops

On the importance of cross-task features for class-incremental learning

Authors: Albin Soutif--Cormerais, Marc Masana, Joost Van de Weijer, Bartłomiej Twardowski.

International Conference on Machine Learning - Theory and Foundation in Continual Learning (ICML-W), 2021

Find out more

Ternary Feature Masks: zero-forgetting for task-incremental learning

Authors: Marc Masana, Tinne Tuytelaars, Joost van de Weijer.

Computer Vision and Pattern Recognition - Workshop on Continual Learning (CLVISION), 2021

Find out more

Avalanche: an End-to-End Library for Continual Learning

Authors: Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido M van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I Parisi, Fabio Cuzzolin, Andreas S Tolias, Simone Scardapane, Luca Antiga, Subutai Ahmad, Adrian Popescu, Christopher Kanan, Joost van de Weijer, Tinne Tuytelaars, Davide Bacciu, Davide Maltoni.

Computer Vision and Pattern Recognition - Workshop on Continual Learning (CLVISION), 2021

Find out more

On Class Orderings for Incremental Learning

Authors: Marc Masana, Bartłomiej Twardowski, Joost van de Weijer

International Conference on Machine Learning - Workshop on Continual Learning (CL-ICML), 2020

Find out more

Disentanglement of Color and Shape Representations for Continual Learning

Authors: David Berga, Marc Masana, Joost Van de Weijer

International Conference on Machine Learning - Workshop on Continual Learning (CL-ICML), 2020

Find out more

On-the-fly Network Pruning for Object Detection

Authors: Marc Masana, Joost van de Weijer, Andrew D Bagdanov

International Conference on Learning Representations (ICLR), 2016

Find out more

Books

Lifelong Learning of Neural Networks: Detecting Novelty and Adapting to New Domains without Forgetting

Authors: Marc Masana

PhD thesis, 2020

Abstract: Computer vision has gone through considerable changes in the last decade as neural networks have come into common use. As available computational capabilities have grown, neural networks have achieved breakthroughs in many computer vision tasks, and have even surpassed human performance in others. With accuracy being so high, focus has shifted to other issues and challenges. One research direction that saw a notable increase in interest is on lifelong learning systems. Such systems should be capable of efficiently performing tasks, identifying and learning new ones, and should moreover be able to deploy smaller versions of themselves which are experts on specific tasks. In this thesis, we contribute to research on lifelong learning and address the compression and adaptation of networks to small target domains, the incremental learning of networks faced with a variety of tasks, and finally the detection of out-of-distribution samples at inference time. We explore how knowledge can be transferred from large pretrained models to more task-specific networks capable of running on smaller devices by extracting the most relevant information based on activation statistics. Using a pretrained model provides more robust representations and a more stable initialization when learning a smaller task, which leads to higher performance and is known as domain adaptation. However, those models are too large for certain applications that need to be deployed on devices with limited memory and computational capacity. In this thesis we show that, after performing domain adaptation, some learned activations barely contribute to the predictions of the model. Therefore, we propose to apply network compression based on low-rank matrix decomposition using the activation statistics. This results in a significant reduction of the model size and the computational cost. Like human intelligence, machine intelligence aims to have the ability to learn and remember knowledge. However, when a trained neural network is presented with learning a new task, it ends up forgetting previous ones. This is known as catastrophic forgetting and its avoidance is studied in continual learning. The work presented in this thesis extensively surveys continual learning techniques (both when knowing the task-ID at test time or not) and presents an approach to avoid catastrophic forgetting in sequential task learning scenarios. Our technique is based on using ternary masks in order to update a network to new tasks, reusing the knowledge of previous ones while not forgetting anything about them. In contrast to earlier work, our masks are applied to the activations of each layer instead of the weights. This considerably reduces the number of mask parameters to be added for each new task; with more than three orders of magnitude for most networks. Furthermore, the analysis on a wide range of work on incremental learning without access to the task-ID, provides insight on current state-of-the-art approaches that focus on avoiding catastrophic forgetting by using regularization, rehearsal of previous tasks from a small memory, or compensating the task-recency bias. We also consider the problem of out-of-distribution detection. Neural networks trained with a cross-entropy loss force the outputs of the model to tend toward a one-hot encoded vector. This leads to models being too overly confident when presented with images or classes that were not present in the training distribution. The capacity of a system to be aware of the boundaries of the learned tasks and identify anomalies or classes which have not been learned yet is key to lifelong learning and autonomous systems. In this thesis, we present a metric learning approach to out-of-distribution detection that learns the task at hand on an embedding space.

Find out more

Interactive Visual and Semantic Image Retrieval

Authors: Joost Van De Weijer, Fahad Khan, Marc Masana

Multimodal Interaction in Image and Video Applications (pages 31-45), Springer 2013

Find out more

Projects

Spanish Ministry funding

Deep Multi-Task Learning for Object Recognition

CHIST-ERA - M2CR

Multimodal Multilingual Continuous Representation for Human Language Understanding

MINECO “Excelencia” funding

Closing the loop: bio-inspired top-down feedback for computational vision systems

Master Thesis:

Context-based pruning for scalable object detection

Advisors: Andrew D. Bagdanov and Joost van de Weijer

Find out more

Work Experience

Post-doc - Computer Vision Center (Des 2020 - Apr 2021)

Collaborate within the Learning and Machine Perception (LAMP) group mainly on continual learning projects.

PhD Stay - KU Leuven - PSI group (May 2018 - Aug 2018)

Research on LifeLong Learning under the supervision of Tinne Tuytelaars.

Knowledge Transfer - International Automotive Company (Jun 2017 - Mar 2018)

Research and implementation of a framework including novelty/anomaly detection, data generation and lifelong learning methods.

Knowledge Transfer - EURECAT (Apr 2016 - Sep 2016)

Assist on the master thesis of Olaia Artieda “Automatic MEME discovery” about distance learning with siamese and triplet networks.

Knowledge Transfer - SADAKO Technologies (Dec 2014 - Aug 2015)

Assist in the design and optimization of a Computer Vision pipeline.

Support researcher - Computer Vision Center (Mar 2012 - Jul 2015)

Collaborate within the Color in Context (CiC) and the Learning and Machine Perception (LAMP) groups at different research projects on the topics of: image classification, object recognition and detection, image retrieval, illuminant estimation, color descriptors and neural network pruning.

Scholarship holder - Institute of Law and Technology (Feb 2009 - Feb 2012)

Collaborate at different research projects with database management, webpage management and video transcription.

Seminars and Lectures

Seminar - KU Leuven - PSI Seminar (May 2018)

"Two talks on Deep Networks”, by Marc Masana. Presentation on "Domain-adaptive deep network compression" and "Rotate your Networks".

Lecture - BigSkyEarth Training School (Apr 2018)

"Deep Learning Frameworks”, by Marc Masana. COST Action. Special focus on Tensorflow, Tensorboard and PyTorch. Hands-on session on Tensorflow (code).

Lecture - Master in Computer Vision (Mar 2018)

"Deep Learning Frameworks”, by Marc Masana. Special focus on Tensorflow, Tensorboard and PyTorch. Master in Computer Vision Module 5.

Seminar - LifeLong Learning Seminar (Feb 2018)

"Lifelong Learning Seminar”, by Joan Serrà, Xialei Liu and Marc Masana. CVC Seminars.

Lecture - Master in Computer Vision (Mar 2017)

"Deep Learning Frameworks”, by Joan Serrat and Marc Masana. Special focus on Caffe and Matconvnet. Master in Computer Vision Module 5.

Seminar - Hands-On Deep Learning (Mar 2016)

This is an attempt of showing practical concepts of deep learning and convolutional neural networks (CNNs) through useful examples using the MatConvNet framework. 6 week seminar with theoretical and hands-on sessions by German Ros, Joost van de Weijer, Marc Masana and Yaxing Wang.

Bio

I received my B.Sc. degrees in Mathematics and Computer Science from the Universitat Autònoma de Barcelona in 2014 and my M.Sc. degree (with honours) in Computer Vision from the Universitat Autònoma de Barcelona (UAB) in 2015. I finished at Top 5 and was awarded as "Best Master Thesis". In 2012 I joined the Computer Vision Center as Support Researcher. I obtained a Ph.D. degree under the supervision of Dr. Joost van de Weijer and Dr. Andrew D. Bagdanov in 2020. My main research interests include Deep Neural Networks, Object Detection, Network Compression and Continual Learning.

Institute of Computer Graphics and Vision
Graz University of Technology, Inffeldgasse 16/II
8010 Graz, Austria

Phone:+43 316 873-5056
Email:marc.masana <at> icg.tugraz.at
Website:TUGraz online visit card

Education

PhD in Computer Vision

Cum laude

Universitat Autònoma de Barcelona

Computer Vision Center

(2015-2020)

MSc in Computer Vision

with Honours

Universitat Autònoma de Barcelona

Universitat Oberta de Catalunya

Universitat Pompeu Fabra

Universitat Politècnica de Catalunya

(2014-2015)

BSc + MSc Computer Science

Universitat Autònoma de Barcelona

(2006-2014)

BSc Mathematics

Universitat Autònoma de Barcelona

(2006-2014)

Awards & Distinctions

Best Master Thesis Project

Universitat Autònoma de Barcelona

Computer Vision Center

(2015)

Business Track Award

winning team out of 10

Accenture Digital Datathon

(2016)

Reviewer for:

Journals

TPAMI

Neural Computation

IEEE Transactions on Multimedia

Conferences

Computer Vision and Pattern Recognition (CVPR)

International Conference in Computer Vision (ICCV)

European Conference in Computer Vision (ECCV)

IEEE Winter Conf. on
Applications of Computer Vision (WACV)

IEEE International Conference on
Multimedia and Expo (ICME)

Asian Conference on Computer Vision (ACCV)

MICCAI Gastrointestinal Image Analysis (GIANA)

Computer Vision Theory and Applications (VISAPP)

Languages

Catalan: Native Speaker
Spanish: Native Speaker
English: Professional Proficiency
German: Beginner
Swedish: Basic User
Chinese: 不是真的