Publications

Publications by LIAAD

2025

A Systematic Literature Review on Multi-label Data Stream Classification

Authors
Oliveira, HF; de Faria, ER; Gama, J; Khan, L; Cerri, R;

Publication
CoRR

Abstract

2025

Fine-Tuning Transformer-Based LLMs in Hierarchical Text Classification

Authors
Santos, J; Silva, N; Ferreira, C; Gama, J;

Publication
Discovery Science - 28th International Conference, DS 2025, Ljubljana, Slovenia, September 23-25, 2025, Proceedings

Abstract
Hierarchical document classification is essential for structuring large-scale textual corpora in domains such as digital libraries and academic repositories. While recent advances in large language models (LLMs) have opened new possibilities for text classification, their applicability to hierarchical settings under real-world constraints remains underexplored. This study investigates both generative and discriminative transformer-based models, evaluating their effectiveness across multiple inference strategies: zero-shot baseline, local fine-tuning, and a global approach using category-specific models. Experiments on two real-world hierarchical datasets provide a comprehensive comparison of classification accuracy, F1-macro scores, and inference times. The results highlight that, although generative LLMs can deliver competitive (yet variable) performance at higher levels of the hierarchy, their high inference costs hinder their use in time-sensitive applications. In contrast, fine-tuned discriminative models—particularly BERT-based architectures—consistently offer a more favorable trade-off between performance and efficiency. © 2025 Elsevier B.V., All rights reserved.

CloseRead Abstract

2025

RMIDDM: an unsupervised and interpretable concept drift detection method for data streams

Authors
Neto, R; Alencar, B; Gomes, HM; Bifet, A; Gama, J; Cassales, G; Rios, R;

Publication
DATA MINING AND KNOWLEDGE DISCOVERY

Abstract
Traditional machine learning techniques assume that data is drawn from a stationary source. This assumption is challenged in contexts with data streams for presenting constant and potentially infinite sequences whose distribution is prone to change over time. Based on these settings, detecting changes (a.k.a. concept drifts) is necessary to keep learning models up-to-date. Although state-of-the-art detection methods were designed to monitor the loss of predictive models, such monitoring falls short in many real-world scenarios where the true labels are not readily available. Therefore, there is increasing attention to unsupervised concept drift detection methods as approached in this paper. In this work, we present an unsupervised and interpretable method based on Radial Basis Function Networks (RBFN) and Markov Chains (MC), referred to as RMIDDM (Radial Markov Interpretable Drift Detection Method). In our method, RBF performs, in the intermediate layer, an activation process that implicitly produces groups of observations collected over time. Simultaneously, MC models the transitions between groups to support the detection of concept drifts, which happens when the active group changes and its probability exceeds a given threshold. A set of experiments with synthetic datasets and comparisons with state-of-the-art algorithms demonstrated that the proposed method can detect drifts at runtime in an efficient, interpretable, and independent way of labels, presenting competitive results and behavior. Additionally, to show its applicability in a real-world scenario, we analyzed new COVID-19 cases, deaths, and vaccinations to identify new waves as concept drifts and generate Markov models that allow understanding of their interaction.

CloseRead Abstract

2025

Interventions based on biofeedback systems to improve workers’ psychological well-being, mental health and safety: a systematic literature review (Preprint)

Authors
Ferreira, S; Rodrigues, MA; Mateus, C; Rodrigues, PP; Rocha, NB;

Publication

Abstract
BACKGROUND

In modern, high-speed work settings, the significance of mental health disorders is increasingly acknowledged as a pressing health issue, with potential adverse consequences for organizations, including reduced productivity and increased absenteeism. Over the past few years, various mental health management solutions, such as biofeedback applications, have surfaced as promising avenues to improve employees' mental well-being.

OBJECTIVE

To gain deeper insights into the suitability and effectiveness of employing biofeedback-based mental health interventions in real-world workplace settings, given that most research has predominantly been conducted within controlled laboratory conditions.

METHODS

A systematic review was conducted to identify studies that used biofeedback interventions in workplace settings. The review focused on traditional biofeedback, mindfulness, app-directed interventions, immersive scenarios, and in-depth physiological data presentation.

RESULTS

The review identified nine studies employing biofeedback interventions in the workplace. Breathing techniques showed great promise in decreasing stress and physiological parameters, especially when coupled with visual and/or auditory cues.

CONCLUSIONS

Future research should focus on developing and implementing interventions to improve well-being and mental health in the workplace, with the goal of creating safer and healthier work environments and contributing to the sustainability of organizations.

CloseRead Abstract

2025

Navigating the Realities: Unpacking Consumer Behavior in Metaverse Retailing Using Virtual and Augmented Reality (VR/AR)

Authors
Pratas, J; Marques dos Santos, JP; Brito, PQ;

Publication
Smart Innovation, Systems and Technologies

Abstract
This paper explores the main challenges and barriers to VR/AR adoption and categorizes common activities performed with these technologies, explaining each specific factor affecting them. After reviewing literature on metaverse retailing, channel strategies, VR/AR technologies, and user experiences, a conceptual framework was developed. Data from the “Voice of the Consumer: Digital Survey” (2020–2024) in over 20 countries was analyzed, using Pearson’s correlation, factor analysis, and multiple linear regressions. The results point that key challenges for VR/AR adoption include security, privacy, content, price, headset-free experiences, digital fatigue, and poor experiences. Gaming is the most common VR/AR activity, while metaverse retailing activities like shopping and virtual try-ons have fewer users. Practical considerations drive metaverse retailing, unlike gaming, which is mainly hedonic. Privacy concerns, safety risks, poor experiences, and lack of knowledge surprisingly increase VR/AR usage for metaverse retailing, indicating informed consumers or threshold characterization of these variables. Additional insights were found for tourism, hospitality, and gaming activities. Theoretical implications, insights, and potential actions for retailers and tech companies are discussed, along with limitations and suggestions for further research. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

CloseRead Abstract

2025

Discovering user groups of active modes of transport in urban centers using clustering methods

Authors
Felicio, S; Hora, J; Ferreira, MC; Sobral, T; Camacho, R; Galvao, T;

Publication
JOURNAL OF TRANSPORT & HEALTH

Abstract
Introduction: Urban centers face increasing congestion and pollution due to population growth driven by jobs, education, and entertainment. Promoting active modes like walking and cycling offers healthier and less polluting alternatives. Understanding perceptions of comfort (green areas, commercial areas, crowd density, noise, thermal sensation, air quality, allergenics), safety and security (street illumination, traffic volume, surveillance, visual appearance, and speed limits) are crucial for encouraging active modes adoption. This study categorizes user groups based on these indicators, supporting policymakers in the development of targeted strategies. Methods: We developed a questionnaire to support our empirical study and collected 653 responses. We have analyzed the data using clustering methods such as Affinity Propagation, BIRCH, Bisecting K-means, HAC, K-means, Mini-Batch K-means, and Spectral clustering. The best performing method (K-means) was used to identify the user groups while a random forest model evaluated the relative importance of indicators for each group. Results: The study identified five user groups based on urban mobility indicators for safety and security, comfort, and distance and time. Conclusions: These groups, distinguished by sociodemographic features, include: Street Aesthetes (young men valuing visual appeal), Safety Seekers (employed men prioritizing speed limits), Working Guardians (employed men focused on surveillance and green spaces), Urban Explorers (young women valuing air quality and low traffic), and Comfort Connoisseurs (employed women prioritizing noise reduction and aesthetics).

CloseRead Abstract