Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by LIAAD

2025

Decision-making systems improvement based on explainable artificial intelligence approaches for predictive maintenance

Authors
Rajaoarisoa, L; Randrianandraina, R; Nalepa, GJ; Gama, J;

Publication
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Abstract
To maintain the performance of the latest generation of onshore and offshore wind turbine systems, a new methodology must be proposed to enhance the maintenance policy. In this context, this paper introduces an approach to designing a decision support tool that combines predictive capabilities with anomaly explanations for effective IoT predictive maintenance tasks. Essentially, the paper proposes an approach that integrates a predictive maintenance model with an explicative decision-making system. The key challenge is to detect anomalies and provide plausible explanations, enabling human operators to determine the necessary actions swiftly. To achieve this, the proposed approach identifies a minimal set of relevant features required to generate rules that explain the root causes of issues in the physical system. It estimates that certain features, such as the active power generator, blade pitch angle, and the average water temperature of the voltage circuit protection in the generator's sub-components, are particularly critical to monitor. Additionally, the approach simplifies the computation of an efficient predictive maintenance model. Compared to other deep learning models, the identified model provides up to 80% accuracy in anomaly detection and up to 96% for predicting the remaining useful life of the system under study. These performance metrics and indicators values are essential for enhancing the decision-making process. Moreover, the proposed decision support tool elucidates the onset of degradation and its dynamic evolution based on expert knowledge and data gathered through Internet of Things (IoT) technology and inspection reports. Thus, the developed approach should aid maintenance managers in making accurate decisions regarding inspection, replacement, and repair tasks. The methodology is demonstrated using a wind farm dataset provided by Energias De Portugal.

2025

Modelling Concept Drift in Dynamic Data Streams for Recommender Systems

Authors
Caroprese, L; Pisani, FS; Veloso, BM; König, M; Manco, G; Hoos, HH; Gama, J;

Publication
Trans. Recomm. Syst.

Abstract
Recommendation systems play a crucial role in modern e-commerce and streaming services. However, the limited availability of public datasets hampers the rapid development of more efficient and accurate recommendation algorithms within the research community. This work introduces a stream-based data generator designed to generate user preferences for a set of items while accommodating progressive changes in user preferences. The underlying principle involves using user/item embeddings to derive preferences by exploring the proximity of these embeddings. Whether randomly generated or learned from a real finite data stream, these embeddings serve as the basis for generating new preferences. We investigate how this fundamental model can adapt to shifts in user behavior over time; in our framework, changes correspond to alterations in the structure of the tripartite graph, reflecting modifications in the underlying embeddings. Through an analysis of real-life data streams, we demonstrate that the proposed model is effective in capturing actual preferences and the changes that they can exhibit over time. Thus, we characterize these changes and develop a generalized method capable of simulating realistic data, thereby generating streams with similar yet controllable drift dynamics.

2025

A Systematic Review on Long-Tailed Learning

Authors
Zhang, CS; Almpanidis, G; Fan, GJ; Deng, BQ; Zhang, YB; Liu, J; Kamel, A; Soda, P; Gama, J;

Publication
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Abstract
Long-tailed data are a special type of multiclass imbalanced data with a very large amount of minority/tail classes that have a very significant combined influence. Long-tailed learning (LTL) aims to build high-performance models on datasets with long-tailed distributions that can identify all the classes with high accuracy, in particular the minority/tail classes. It is a cutting-edge research direction that has attracted a remarkable amount of research effort in the past few years. In this article, we present a comprehensive survey of the latest advances in long-tailed visual learning. We first propose a new taxonomy for LTL, which consists of eight different dimensions, including data balancing, neural architecture, feature enrichment, logits adjustment, loss function, bells and whistles, network optimization, and posthoc processing techniques. Based on our proposed taxonomy, we present a systematic review of LTL methods, discussing their commonalities and alignable differences. We also analyze the differences between imbalance learning and LTL. Finally, we discuss prospects and future directions in this field.

2025

Early Failure Detection for Air Production Unit in Metro Trains

Authors
Zafra, A; Veloso, B; Gama, J;

Publication
HYBRID ARTIFICIAL INTELLIGENT SYSTEM, PT I, HAIS 2024

Abstract
Early identification of failures is a critical task in predictive maintenance, preventing potential problems before they manifest and resulting in substantial time and cost savings for industries. We propose an approach that predicts failures in the near future. First, a deep learning model combining long short-term memory and convolutional neural network architectures predicts signals for a future time horizon using real-time data. In the second step, an autoencoder based on convolutional neural networks detects anomalies in these predicted signals. Finally, a verification step ensures that a fault is considered reliable only if it is corroborated by anomalies in multiple signals simultaneously. We validate our approach using publicly available Air Production Unit (APU) data from Porto metro trains. Two significant conclusions emerge from our study. Firstly, experimental results confirm the effectiveness of our approach, demonstrating a high fault detection rate and a reduced number of false positives. Secondly, the adaptability of this proposal allows for the customization of configuration of different time horizons and relationship between the signals to meet specific detection requirements.

2025

Modeling events and interactions through temporal processes: A survey

Authors
Liguori, A; Caroprese, L; Minici, M; Veloso, B; Spinnato, F; Nanni, M; Manco, G; Gama, J;

Publication
NEUROCOMPUTING

Abstract
In real-world scenarios, numerous phenomena generate a series of events that occur in continuous time. Point processes provide a natural mathematical framework for modeling these event sequences. In this comprehensive survey, we aim to explore probabilistic models that capture the dynamics of event sequences through temporal processes. We revise the notion of event modeling and provide the mathematical foundations that underpin the existing literature on this topic. To structure our survey effectively, we introduce an ontology that categorizes the existing approaches considering three horizontal axes: modeling, inference and estimation, and application. We conduct a systematic review of the existing approaches, with a particular focus on those leveraging deep learning techniques. Finally, we delve into the practical applications where these proposed techniques can be harnessed to address real-world problems related to event modeling. Additionally, we provide a selection of benchmark datasets that can be employed to validate the approaches for point processes.

2025

Fed-VFDT: Federated Very Fast Decision Trees with Coordinated Splitting Over Data Streams

Authors
Silva, PR; Vinagre, J; Gama, J;

Publication
ICTAI

Abstract
We introduce Fed-VFDT, a federated adaptation of the Very Fast Decision Tree (VFDT) algorithm for classification over streaming data. While VFDT is a widely adopted online learning algorithm, its sequential and order-sensitive nature poses challenges in federated settings, marked by statistical heterogeneity and communication constraints. Fed-VFDT addresses these issues by having each client incrementally train a local VFDT and report split statistics to a central server when a leaf satisfies the Hoeffding criterion. The server selects a global splitting feature by aggregating clients' proposals according to a configurable strategy: quorum, merit-based selection, or majority voting. Once a feature is selected, it is broadcast to all clients, which apply the split at the corresponding tree path using their locally computed thresholds. We evaluate Fed-VFDT against its centralized counterpart using predictive and structural metrics, demonstrating that it maintains comparable performance while reducing communication and preserving synchronized tree growth.

  • 25
  • 529