Publications

Publications by LIAAD

2025

Early Failure Detection for Air Production Unit in Metro Trains

Authors
Zafra, A; Veloso, B; Gama, J;

Publication
HYBRID ARTIFICIAL INTELLIGENT SYSTEM, PT I, HAIS 2024

Abstract
Early identification of failures is a critical task in predictive maintenance, preventing potential problems before they manifest and resulting in substantial time and cost savings for industries. We propose an approach that predicts failures in the near future. First, a deep learning model combining long short-term memory and convolutional neural network architectures predicts signals for a future time horizon using real-time data. In the second step, an autoencoder based on convolutional neural networks detects anomalies in these predicted signals. Finally, a verification step ensures that a fault is considered reliable only if it is corroborated by anomalies in multiple signals simultaneously. We validate our approach using publicly available Air Production Unit (APU) data from Porto metro trains. Two significant conclusions emerge from our study. Firstly, experimental results confirm the effectiveness of our approach, demonstrating a high fault detection rate and a reduced number of false positives. Secondly, the adaptability of this proposal allows for the customization of configuration of different time horizons and relationship between the signals to meet specific detection requirements.

CloseRead Abstract

2025

A Systematic Review on Long-Tailed Learning

Authors
Zhang, CS; Almpanidis, G; Fan, GJ; Deng, BQ; Zhang, YB; Liu, J; Kamel, A; Soda, P; Gama, J;

Publication
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Abstract
Long-tailed data are a special type of multiclass imbalanced data with a very large amount of minority/tail classes that have a very significant combined influence. Long-tailed learning (LTL) aims to build high-performance models on datasets with long-tailed distributions that can identify all the classes with high accuracy, in particular the minority/tail classes. It is a cutting-edge research direction that has attracted a remarkable amount of research effort in the past few years. In this article, we present a comprehensive survey of the latest advances in long-tailed visual learning. We first propose a new taxonomy for LTL, which consists of eight different dimensions, including data balancing, neural architecture, feature enrichment, logits adjustment, loss function, bells and whistles, network optimization, and posthoc processing techniques. Based on our proposed taxonomy, we present a systematic review of LTL methods, discussing their commonalities and alignable differences. We also analyze the differences between imbalance learning and LTL. Finally, we discuss prospects and future directions in this field.

CloseRead Abstract

2025

Decision-making systems improvement based on explainable artificial intelligence approaches for predictive maintenance

Authors
Rajaoarisoa, L; Randrianandraina, R; Nalepa, GJ; Gama, J;

Publication
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Abstract
To maintain the performance of the latest generation of onshore and offshore wind turbine systems, a new methodology must be proposed to enhance the maintenance policy. In this context, this paper introduces an approach to designing a decision support tool that combines predictive capabilities with anomaly explanations for effective IoT predictive maintenance tasks. Essentially, the paper proposes an approach that integrates a predictive maintenance model with an explicative decision-making system. The key challenge is to detect anomalies and provide plausible explanations, enabling human operators to determine the necessary actions swiftly. To achieve this, the proposed approach identifies a minimal set of relevant features required to generate rules that explain the root causes of issues in the physical system. It estimates that certain features, such as the active power generator, blade pitch angle, and the average water temperature of the voltage circuit protection in the generator's sub-components, are particularly critical to monitor. Additionally, the approach simplifies the computation of an efficient predictive maintenance model. Compared to other deep learning models, the identified model provides up to 80% accuracy in anomaly detection and up to 96% for predicting the remaining useful life of the system under study. These performance metrics and indicators values are essential for enhancing the decision-making process. Moreover, the proposed decision support tool elucidates the onset of degradation and its dynamic evolution based on expert knowledge and data gathered through Internet of Things (IoT) technology and inspection reports. Thus, the developed approach should aid maintenance managers in making accurate decisions regarding inspection, replacement, and repair tasks. The methodology is demonstrated using a wind farm dataset provided by Energias De Portugal.

CloseRead Abstract

2025

Fairness Analysis in Causal Models: An Application to Public Procurement

Authors
Teixeira, S; Nogueira, AR; Gama, J;

Publication
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT II

Abstract
Data-driven decision models based on Artificial Intelligence (AI) have been widely used in the public and private sectors. These models present challenges and are intended to be fair, effective and transparent in public interest areas. Bias, fairness and government transparency are aspects that significantly impact the functioning of a democratic society. They shape the government's and its citizens' relationship, influencing trust, accountability, and the equitable treatment of individuals and groups. Data-driven decision models can be biased at several process stages, contributing to injustices. Our research purpose is to understand fairness in the use of causal discovery for public procurement. By analysing Portuguese public contracts data, we aim i) to predict the place of execution of public contracts using the PC algorithm with sp-mi, smc-chi(2) and mc-chi(2) conditional independence tests; ii) to analyse and compare the fairness in those scenarios using Predictive Parity Rate, Proportional Parity, Demographic Parity and Accuracy Parity metrics. By addressing fairness concerns, we pursue to enhance responsible data-driven decision models. We conclude that, in our case, fairness metrics make an assessment more local than global due to causality pathways. We also observe that the Proportional Parity metric is the one with the lowest variance among all metrics and one with the highest precision, and this reinforces the observation that the Agency category is the one that is furthest apart in terms of the proportion of the groups.

CloseRead Abstract

2025

One-Class Learning for Data Stream Through Graph Neural Networks

Authors
Gôlo, MPS; Gama, J; Marcacini, RM;

Publication
INTELLIGENT SYSTEMS, BRACIS 2024, PT IV

Abstract
In many data stream applications, there is a normal concept, and the objective is to identify normal and abnormal concepts by training only with normal concept instances. This scenario is known in the literature as one-class learning (OCL) for data streams. In this OCL scenario for data streams, we highlight two main gaps: (i) lack of methods based on graph neural networks (GNNs) and (ii) lack of interpretable methods. We introduce OPENCAST (One-class graPh autoENCoder for dAta STream), a new method for data streams based on OCL and GNNs. Our method learns representations while encapsulating the instances of interest through a hypersphere. OPENCAST learns low-dimensional representations to generate interpretability in the representation learning process. OPENCAST achieved state-of-the-art results for data streams in the OCL scenario, outperforming seven other methods. Furthermore, OPENCAST learns low-dimensional representations, generating interpretability in the representation learning process and results.

CloseRead Abstract

2025

Interpretable Rules for Online Failure Prediction: A Case Study on the Metro do Porto dataset

Authors
Jakobs, M; Veloso, B; Gama, J;

Publication
CoRR

Abstract