Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

Master Degree in Informatics Engineering - Architecture, Systems and Networks, after having graduated in Informatics Engineering, both in ISEP. Currently attending the Doctoral Program in Informatics Engineering (ProDEI) at FEUP.

Interested in developping interactive, efficient and intuitve software for various purposes, according to any client needs. Mobile, Multimedia, Web and DB are interesting areas where it would like to work and expand current knowledge.

Previous work in the following technologies: Java, C, C++, C#, SQL, .NET Framework, UML, XML, MySQL, HTML, PHP, CSS, TCP/IP, OpenWRT, Hibernate, Android SDK, ActionScript, TCP and UDP, RDP, H.264, MPEG-DASH, Hibernate, OpenGL, DirectX. Also worked on previous projects in areas such as Wireless Networks (802.11), Scripting, Software Engineering and Web Services.

Interest
Topics
Details

Details

Publications

2023

A Dataset for User Visual Behaviour with Multi-View Video Content

Authors
Soares Da Costa, T; Andrade, MT; Viana, P; Silva, NC;

Publication
MMSys 2023 - Proceedings of the 14th ACM Multimedia Systems Conference

Abstract

2022

Photo2Video: Semantic-Aware Deep Learning-Based Video Generation from Still Content

Authors
Viana, P; Andrade, MT; Carvalho, P; Vilaca, L; Teixeira, IN; Costa, T; Jonker, P;

Publication
JOURNAL OF IMAGING

Abstract
Applying machine learning (ML), and especially deep learning, to understand visual content is becoming common practice in many application areas. However, little attention has been given to its use within the multimedia creative domain. It is true that ML is already popular for content creation, but the progress achieved so far addresses essentially textual content or the identification and selection of specific types of content. A wealth of possibilities are yet to be explored by bringing the use of ML into the multimedia creative process, allowing the knowledge inferred by the former to influence automatically how new multimedia content is created. The work presented in this article provides contributions in three distinct ways towards this goal: firstly, it proposes a methodology to re-train popular neural network models in identifying new thematic concepts in static visual content and attaching meaningful annotations to the detected regions of interest; secondly, it presents varied visual digital effects and corresponding tools that can be automatically called upon to apply such effects in a previously analyzed photo; thirdly, it defines a complete automated creative workflow, from the acquisition of a photograph and corresponding contextual data, through the ML region-based annotation, to the automatic application of digital effects and generation of a semantically aware multimedia story driven by the previously derived situational and visual contextual data. Additionally, it presents a variant of this automated workflow by offering to the user the possibility of manipulating the automatic annotations in an assisted manner. The final aim is to transform a static digital photo into a short video clip, taking into account the information acquired. The final result strongly contrasts with current standard approaches of creating random movements, by implementing an intelligent content- and context-aware video.

2021

SmoothMV: Seamless Content Adaptation through Head Tracking Analysis and View Prediction

Authors
da Costa, TS; Andrade, MT; Viana, P;

Publication
PROCEEDINGS OF THE 2021 INTERNATIONAL WORKSHOP ON IMMERSIVE MIXED AND VIRTUAL ENVIRONMENT SYSTEMS (MMVE '21)

Abstract
Multi-view has the potential to offer immersive viewing experiences to users, as an alternative to 360 degrees and Virtual Reality (VR) applications. In multi-view, a limited number of camera views are sent to the client and missing views are synthesised locally. Given the substantial complexity associated to view synthesis, considerable attention has been given to optimise the trade-off between bandwidth gains and computing resources, targeting smooth navigation and viewing quality. A still relatively unexplored field is the optimisation of the way navigation interactivity is achieved, i.e. how the user indicates to the system the selection of new viewpoints. In this article, we introduce SmoothMV, a multi-view system that uses a non-intrusive head tracking approach to enhance navigation and Quality of Experience (QoE) of the viewer. It relies on a novel Hot&Cold matrix concept to translate head positioning data into viewing angle selections. Streaming of selected views is done using MPEG-DASH, where a proposed extension to the standard descriptors enables to achieve consistent and flexible view identification.

2020

Semantic Storytelling Automation: A Context-Aware and Metadata-Driven Approach

Authors
Viana, P; Carvalho, P; Andrade, MT; Jonker, PP; Papanikolaou, V; Teixeira, IN; Vilaça, L; Pinto, JP; Costa, T;

Publication
MM '20: The 28th ACM International Conference on Multimedia, Virtual Event / Seattle, WA, USA, October 12-16, 2020

Abstract
Multimedia content production is nowadays widespread due to technological advances, namely supported by smartphones and social media. Although the massive amount of media content brings new opportunities to the industry, it also obfuscates the relevance of marketing content, meant to maintain and lure new audiences. This leads to an emergent necessity of producing these kinds of contents as quickly and engagingly as possible. Creating these automatically would decrease both the production costs and time, particularly by using static media for the creation of short storytelling animated clips. We propose an innovative approach that uses context and content information to transform a still photo into an appealing context-aware video clip. Thus, our solution presents a contribution to the state-of-the-art in computer vision and multimedia technologies and assists content creators with a value-added service to automatically build rich contextualized multimedia stories from single photographs. © 2020 Owner/Author.

2020

Ubiquitous Framework for High Quality Audiovisual Production

Authors
Andrade, MT; Santos, P; Costa, TS; Freitas, L; Golestani, S; Viana, P; Rodrigues, J; Ulisses, A;

Publication
Proceedings - 2020 TRON Symposium, TRONSHOW 2020

Abstract
The media sector is constantly evolving and, in the last few years, such evolution has been driven by a number of convergence paradigms, notably, that between broadband and broadcast technologies with the introduction of IT and IP technology. The present trend is to switch totally from a closed niche that uses highly specialized equipment to off-the-shelf IT-centric solutions, offering easy configuration and remote operation. The aim is to enable common computers to be turned into highly capable media devices and act as connected objects adopting an IoT-like paradigm. This vision, though, is not implemented easily, given that most media industry professionals do not yet feel comfortable operating in the IT technology space and also due to the stringent requirements that exist in this industry. The Joint Task Force on Networked Media is defining specifications that aim at overcoming such existing barriers. In this article we present a novel solution that follows the guidelines delivered by this group to set up a remotely operated media production facility, totally based on IP and IT technology, constituting a step forward the realization of the IoT concept in professional media environments. The focus is on two complementary components, namely, the GUI Agent and the MW Agent, which are not covered by the defined specifications but that are crucial to speed up the deployment of concrete solutions that can be easily operated by non-IT and non-IP experts in a transparent and ubiquitous way. © 2020 TRON Forum.

Supervised
thesis

2022

Previsão de Largura de Banda para Streaming Adaptativo de Vídeo

Author
Gustavo Manuel Esteves Pelayo

Institution
UP-FEUP

2022

Prediction of Visual Behaviour in Immersive Contents

Author
Nuno Rodrigues de Castro Santos Silva

Institution
UP-FEUP