Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About
Download Photo HD

About

I am a Coordinator Professor at the Polytechnic of Porto and a Researcher at INESC TEC, where I lead the Multimedia Communications Technology Area. I  obtained my PhD from University of Porto in the area of multimedia content management. I have been responsible for the participation of INESC TEC in several national and European projects, involving universities and media industries. Author of several publications, I am also an active reviewer for journals and conferences and engaged in the organization of workshops and program committees in the area of Multimedia. Recently I co-chaired the Immersive Media Experiences workshop series (2013-2015) at ACM MM. Additionally I am also often engaged in the evaluation of European and Portuguese research proposals and projects. My main research activities and interests are in the field of networked audiovisual systems, including digital television and new services, content management, personalization and recomendation, new media formats and immersive and interactive media.

Interest
Topics
Details

Details

013
Publications

2022

Photo2Video: Semantic-Aware Deep Learning-Based Video Generation from Still Content

Authors
Viana, P; Andrade, MT; Carvalho, P; Vilaca, L; Teixeira, IN; Costa, T; Jonker, P;

Publication
JOURNAL OF IMAGING

Abstract
Applying machine learning (ML), and especially deep learning, to understand visual content is becoming common practice in many application areas. However, little attention has been given to its use within the multimedia creative domain. It is true that ML is already popular for content creation, but the progress achieved so far addresses essentially textual content or the identification and selection of specific types of content. A wealth of possibilities are yet to be explored by bringing the use of ML into the multimedia creative process, allowing the knowledge inferred by the former to influence automatically how new multimedia content is created. The work presented in this article provides contributions in three distinct ways towards this goal: firstly, it proposes a methodology to re-train popular neural network models in identifying new thematic concepts in static visual content and attaching meaningful annotations to the detected regions of interest; secondly, it presents varied visual digital effects and corresponding tools that can be automatically called upon to apply such effects in a previously analyzed photo; thirdly, it defines a complete automated creative workflow, from the acquisition of a photograph and corresponding contextual data, through the ML region-based annotation, to the automatic application of digital effects and generation of a semantically aware multimedia story driven by the previously derived situational and visual contextual data. Additionally, it presents a variant of this automated workflow by offering to the user the possibility of manipulating the automatic annotations in an assisted manner. The final aim is to transform a static digital photo into a short video clip, taking into account the information acquired. The final result strongly contrasts with current standard approaches of creating random movements, by implementing an intelligent content- and context-aware video.

2022

Automated Adequacy Assessment of Cervical Cytology Samples Using Deep Learning

Authors
Mosiichuk, V; Viana, P; Oliveira, T; Rosado, L;

Publication
Pattern Recognition and Image Analysis - Lecture Notes in Computer Science

Abstract

2022

Symbolic Music Generation Conditioned on Continuous-Valued Emotions

Authors
Sulun, S; Davies, MEP; Viana, P;

Publication
IEEE ACCESS

Abstract

2022

Enhancing Photography Management Through Automatically Extracted Metadata

Authors
Carvalho, P; Freitas, D; Machado, T; Viana, P;

Publication
INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021

Abstract
The tremendous increase in photographs that are captured each day by common users has been favoured by the availability of high quality devices at accessible costs, such as smartphones and digital cameras. However, the quantity of captured photos raises new challenges regarding the access and management of image repositories. This paper describes a lightweight distributed framework intended to help overcome these problems. It uses image metadata in EXIF format, already widely added to images by digital acquisition devices, and automatic facial recognition to provide management and search functionalities. Moreover, a visualization functionality using a graph-based strategy was integrated, enabling an enhanced and more interactive navigation through search results and the corresponding relations.

2022

Improving word embeddings in Portuguese: increasing accuracy while reducing the size of the corpus

Authors
Pinto, JP; Viana, P; Teixeira, IN; Andrade, MT;

Publication
PeerJ Comput. Sci.

Abstract
The subjectiveness of multimedia content description has a strong negative impact on tag-based information retrieval. In our work, we propose enhancing available descriptions by adding semantically related tags. To cope with this objective, we use a word embedding technique based on the Word2Vec neural network parameterized and trained using a new dataset built from online newspapers. A large number of news stories was scraped and pre-processed to build a new dataset. Our target language is Portuguese, one of the most spoken languages worldwide. The results achieved significantly outperform similar existing solutions developed in the scope of different languages, including Portuguese. Contributions include also an online application and API available for external use. Although the presented work has been designed to enhance multimedia content annotation, it can be used in several other application areas. © 2022. Pinto et al. Distributed under Creative Commons CC-BY 4.0

Supervised
thesis

2021

Improving quality and agility of safety-critical software development using domain-specific languages

Author
João Ricardo Faria Mendes Almeida Reis

Institution
UP-FEUP

2020

Automatic Emotion Identification: Analysis and Detection of Facial Expressions in Movies

Author
João Carlos Miranda de Almeida

Institution
UP-FEUP

2020

Implementação e análise de dados de uma rede IoT

Author
RAFAEL NEVES MIRANDA

Institution
IPP-ISEP

2020

Video-based music generation

Author
Serkan Sulun

Institution
UP-FEUP

2020

Deteção de publicidade em conteúdos de televisão sem informação a priori

Author
Guilherme Dias Castro

Institution
UP-FEUP