Publicacoes - INESC TEC

Publicações

Publicações por João Gama

2025

Evaluating Short Text Stream Clustering on Large E-commerce Datasets

Autores
Andrade, C; Ribeiro, RP; Gama, J;

Publicação
INTELLIGENT SYSTEMS, BRACIS 2024, PT III

Abstract
Latent Dirichlet Allocation (LDA) is a fundamental method for clustering short text streams. However, when applied to large datasets, it often faces significant challenges, and its performance is typically evaluated in domain-specific datasets such as news and tweets. This study aims to fill this gap by evaluating the effectiveness of short text clustering methods in a large and diverse e-commerce dataset. We specifically investigate how well these clustering algorithms adapt to the complex dynamics and larger scale of e-commerce text streams, which differ from their usual application domains. Our analysis focuses on the impact of high homogeneity scores on the reported Normalized Mutual Information (NMI) values. We particularly examine whether these scores are inflated due to the prevalence of single-element clusters. To address potential biases in clustering evaluation, we propose using the Akaike Information Criterion (AIC) as an alternative metric to reduce the formation of single-element clusters and provide a more balanced measure of clustering performance. We present new insights for applying short text clustering methodologies in real-world situations, especially in sectors like e-commerce, where text data volumes and dynamics present unique challenges.

FecharLer Abstract

2025

Interpretable Rules for Online Failure Prediction: A Case Study on the Metro do Porto dataset

Autores
Jakobs, M; Veloso, B; Gama, J;

Publicação
CoRR

Abstract

2025

Emotion-Enhanced Pain Assessment Protocol

Autores
Alves, B; Almeida, A; Silva, C; Pais, D; Ribeiro, RP; Gama, J; Fernandes, JM; Brás, S; Sebastiao, R;

Publicação
HUMAN AND ARTIFICIAL RATIONALITIES. ADVANCES IN COGNITION, COMPUTATION, AND CONSCIOUSNESS, HAR 2024

Abstract
Pain is a highly subjective phenomenon that depends on multiple factors. The common methods used to evaluate pain require the person to be awakened and cooperative, which may not always be possible. Moreover, such methods are subject to non-quantifiable influences, namely the impact of an individual's emotional state on how pain is perceived or how negative emotions may exacerbate pain perception, while positive emotions may attenuate it. The goal of this study was to conduct a novel protocol for pain induction with emotional elicitation and assess its feasibility. In this protocol, the physiological responses were monitored, and collected, through Electrocardiogram, Electrodermal Activity, and surface Electromyogram signals. Along the protocol, the pain perception was evaluated using a 0-10 numerical rating scale and by registering the time from the pain stimulus beginning to the Pain and Tolerance Thresholds. This study comprised three emotional sessions, negative, positive, and neutral, which were performed through videos of excerpts of terror, comedy, and documentary films, respectively, followed by pain induction using the Cold Pressor Task (CPT). A total of 56 participants performed the study, with a CPT mean time of about 91.70 +/- 39.64 s among all the sessions. The conducted protocol was considered feasible and safe as it allowed the collection of physiological data, pain, and questionnaires' reports from 56 participants, without any harm to them. Moreover, the collected data can be further used to assess how emotional conditions influence pain perception and to provide better emotion-calibrated pain recognition systems based on physiological signals.

FecharLer Abstract

2025

A Deep Learning Framework for Medium-Term Covariance Forecasting in Multi-Asset Portfolios

Autores
Reis, P; Serra, AP; Gama, J;

Publicação
CoRR

Abstract

2025

On-device edge learning for IoT data streams: a survey

Autores
Lourenço, A; Rodrigo, J; Gama, J; Marreiros, G;

Publicação
CoRR

Abstract

2025

In-context learning of evolving data streams with tabular foundational models

Autores
Lourenço, A; Gama, J; Xing, EP; Marreiros, G;

Publicação
CoRR

Abstract