Cookies Policy

The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More

Institution
Research
Research Domains
Artificial Intelligence

Bioengineering

Communications

Computer Science and Engineering
Photonics

Power and Energy Systems

Robotics

Systems Engineering and Management
RESEARCH CENTERS
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Innovation
Innovation / Tec4

TEC4AGRO-FOOD

TEC4ENERGY

TEC4HEALTH

TEC4INDUSTRY

TEC4SEA

TECPARTNERSHIPS

Available Technologies
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Laboratories
Research Laboratories

iilab
Communication
News

Events

Media

Newsletter
Porto, Portugal

+351 222 094 000

info@inesctec.pt
Work with us
Contacts

Home
People
João Tiago Paulo

Read Full presentation

I am currently an auxiliar professor at University of Minho and senior researcher at INESC TEC. I have obtained a PhD degree in Computer Science from the MAP-i Doctoral Program in Computer Science. Currently, I am working on large scale distributed systems with an emphasis on storage and database systems’ scalability, performance, security and dependability. Also, I am interested on the applicability of this research work for solving complex data management challenges for Cloud Computing and HPC centres.I am the coordinator of the PAStor PT-UTAustin exploratory project and the “Efficient and Secure Data Management for HPC and Cloud Computing” CENTRA project, while leading INESC TEC’s activities on the Compete2020 BigHPC project and ACTPM PT-UTAustin exploratory project. Also, I have several publications in renowned journals and international conferences (e.g., ACM Computing Surveys, IEEE Transactions on Computers, ACM Transactions on Storage, Eurosys, SRDS, SYSTOR).For more information you can check my personal web page at https://jtpaulo.github.io, as well as, HASLab's research lines on the topics mentioned above: https://dsr-haslab.github.io and https://dbr-haslab.github.io

Read Full presentation

About

About

I am currently an auxiliar professor at University of Minho and senior researcher at INESC TEC. I have obtained a PhD degree in Computer Science from the MAP-i Doctoral Program in Computer Science. Currently, I am working on large scale distributed systems with an emphasis on storage and database systems’ scalability, performance, security and dependability. Also, I am interested on the applicability of this research work for solving complex data management challenges for Cloud Computing and HPC centres.

I am the coordinator of the PAStor PT-UTAustin exploratory project and the “Efficient and Secure Data Management for HPC and Cloud Computing” CENTRA project, while leading INESC TEC’s activities on the Compete2020 BigHPC project and ACTPM PT-UTAustin exploratory project. Also, I have several publications in renowned journals and international conferences (e.g., ACM Computing Surveys, IEEE Transactions on Computers, ACM Transactions on Storage, Eurosys, SRDS, SYSTOR).

For more information you can check my personal web page at https://jtpaulo.github.io, as well as, HASLab's research lines on the topics mentioned above: https://dsr-haslab.github.io and https://dbr-haslab.github.io

Interest
Topics

Details

Details

Name
João Tiago Paulo
Role
Senior Researcher
Since
01st November 2011

Nationality
Portugal
Centre
High-Assurance Software
Contacts
+351253604440
joao.t.paulo@inesctec.pt

009

Publications

View all Publications

2025

Promoting sustainable and personalized travel behaviors while preserving data privacy

Authors
Brito C.; Pina N.; Esteves T.; Vitorino R.; Cunha I.; Paulo J.;

Publication
Transportation Engineering

Abstract
Cities worldwide have agreed on ambitious goals regarding carbon neutrality. To do so, policymakers seek ways to foster smarter and cleaner transportation solutions. However, citizens lack awareness of their carbon footprint and of greener mobility alternatives such as public transports. With this, three main challenges emerge: (i) increase users’ awareness regarding their carbon footprint, (ii) provide personalized recommendations and incentives for using sustainable transportation alternatives and, (iii) guarantee that any personal data collected from the user is kept private. This paper addresses these challenges by proposing a new methodology. Created under the FranchetAI project, the methodology combines federated Artificial Intelligence (AI) and Greenhouse Gas (GHG) estimation models to calculate the carbon footprint of users when choosing different transportation modes (e.g., foot, car, bus). Through a mobile application that keeps the privacy of users’ personal information, the project aims at providing detailed reports to inform citizens about their impact on the environment, and an incentive program to promote the usage of more sustainable mobility alternatives.

CloseRead Abstract Read Full Publication

2024

When Amnesia Strikes: Understanding and Reproducing Data Loss Bugs with Fault Injection

Authors
Ramos, M; Azevedo, J; Kingsbury, K; Pereira, J; Esteves, T; Macedo, R; Paulo, J;

Publication
PROCEEDINGS OF THE VLDB ENDOWMENT

Abstract
We present LAZYFS, a new fault injection tool that simplifies the debugging and reproduction of complex data durability bugs experienced by databases, key-value stores, and other data-centric systems in crashes. Our tool simulates persistence properties of POSIX file systems (e.g., operations ordering and atomicity) and enables users to inject lost and torn write faults with a precise and controlled approach. Further, it provides profiling information about the system's operations flow and persisted data, enabling users to better understand the root cause of errors. We use LAZYFS to study seven important systems: PostgreSQL, etcd, Zookeeper, Redis, LevelDB, PebblesDB, and Lightning Network. Our fault injection campaign shows that LAZYFS automates and facilitates the reproduction of five known bug reports containing manual and complex reproducibility steps. Further, it aids in understanding and reproducing seven ambiguous bugs reported by users. Finally, LAZYFS is used to find eight new bugs, which lead to data loss, corruption, and unavailability.

CloseRead Abstract

2024

Can Current SDS Controllers Scale To Modern HPC Infrastructures?

Authors
Miranda, M; Tanimura, Y; Haga, J; Ruhela, A; Harrell, SL; Cazes, J; Macedo, R; Pereira, J; Paulo, J;

Publication
PROCEEDINGS OF SC24-W: WORKSHOPS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS

Abstract
Modern supercomputers host numerous jobs that compete for shared storage resources, causing I/O interference and performance degradation. Solutions based on software-defined storage (SDS) emerged to address this issue by coordinating the storage environment through the enforcement of QoS policies. However, these often fail to consider the scale of modern HPC infrastructures. In this work, we explore the advantages and shortcomings of state-of-the-art SDS solutions and highlight the scale of current production clusters and their rising trends. Furthermore, we conduct the first experimental study that sheds new insights into the performance and scalability of flat and hierarchical SDS control plane designs. Our results, using the Frontera supercomputer, show that a flat design with a single controller can scale up to 2,500 nodes with an average control cycle latency of 41 ms, while hierarchical designs can handle up to 10,000 nodes with an average latency ranging between 69 and 103 ms.

CloseRead Abstract

2023

Distributed Applications and Interoperable Systems - 23rd IFIP WG 6.1 International Conference, DAIS 2023, Held as Part of the 18th International Federated Conference on Distributed Computing Techniques, DisCoTec 2023, Lisbon, Portugal, June 19-23, 2023, Proceedings

Authors
Martínez, MP; Paulo, J;

Publication
DAIS

Abstract

2023

Soteria: Preserving Privacy in Distributed Machine Learning

Authors
Brito, C; Ferreira, P; Portela, B; Oliveira, R; Paulo, J;

Publication
38TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2023

Abstract
We propose Soteria, a system for distributed privacy-preserving Machine Learning (ML) that leverages Trusted Execution Environments (e.g. Intel SGX) to run code in isolated containers (enclaves). Unlike previous work, where all ML-related computation is performed at trusted enclaves, we introduce a hybrid scheme, combining computation done inside and outside these enclaves. The conducted experimental evaluation validates that our approach reduces the runtime of ML algorithms by up to 41%, when compared to previous related work. Our protocol is accompanied by a security proof, as well as a discussion regarding resilience against a wide spectrum of ML attacks.

CloseRead Abstract

Supervised
thesis

Supervised Thesis

View all Supervised Theses

2023

Injeção de Faltas Reprodutível em Sistemas de Armazenamento Local

Author
Maria Ramos

Institution
INESCTEC

2023

End-to-End Software-Defined Security for Big Data Ecosystem

Author
Tânia da Conceição Araújo Esteves

Institution
INESCTEC

2023

Distributed and Dependable SDS Control Plane for HPC

Author
Mariana Martins de Sá Miranda

Institution
INESCTEC

2023

Realistic fault assessment in SPDK-enabled storage stacks

Author
Alexandre Esteves Miranda

Institution
INESCTEC

2023

Analysis of I/O patterns for Data Management Systems

Author
Pedro Rodrigues

Institution
INESCTEC

View all Supervised Theses