Publications

Publications by João Mendes Moreira

2025

Estimating Completeness of Consensus Models: Geometrical and Distributional Approaches

Authors
Strecht, P; Mendes-Moreira, J; Soares, C;

Publication
MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2024, PT I

Abstract
In many organizations with a distributed operation, not only is data collection distributed, but models are also developed and deployed separately. Understanding the combined knowledge of all the local models may be important and challenging, especially in the case of a large number of models. The automated development of consensus models, which aggregate multiple models into a single one, involves several challenges, including fidelity (ensuring that aggregation does not penalize the predictive performance severely) and completeness (ensuring that the consensus model covers the same space as the local models). In this paper, we address the latter, proposing two measures for geometrical and distributional completeness. The first quantifies the proportion of the decision space that is covered by a model, while the second takes into account the concentration of the data that is covered by the model. The use of these measures is illustrated in a real-world example of academic management, as well as four publicly available datasets. The results indicate that distributional completeness in the deployed models is consistently higher than geometrical completeness. Although consensus models tend to be geometrically incomplete, distributional completeness reveals that they cover the regions of the decision space with a higher concentration of data.

CloseRead Abstract

2024

BTS-Z: A Bootstrap Zero-Shot Learning Approach for City Traffic Forecasting

Authors
Kumar, R; Bhanu, M; Roy, S; Mendes Moreira, J; Chandra, J;

Publication
International Symposium on Advanced Networks and Telecommunication Systems, ANTS

Abstract
Taxi demand prediction with scarce historic information is among the most encountered challenges of the present decade for the traffic network of a smart city. Lack of sufficient information results in the failure of conventional approaches in prediction for a new city. Additionally, the prevalent Deep Neural Network (DNN) Models resort to ineffectual approaches which fail to meet the required prediction performance for the network. Moreover, existing domain adaptation (DA) models could not sufficiently reap the domain-shared features well from multiple source, questioning the models' applicability. Complex structure of these DA models tends to a nominal performance gain due to inefficient resource utilization of the sources. The present paper introduces a domain adaptation deep neural network model, Bootstrap Zero-Shot (BTS-Z) learning model which focuses on capturing the latent spatio-temporal features of the whole city traffic network shared among every source city and maneuver them to predict for the target city traffic network with no prior information. The presented model proves the efficacy of the bootstrap algorithm in the prediction of demands for the unseen target over the computationally expensive MAML models. The experimental results on three real-world city taxi data on the standard benchmark metrics report a minimum of 23.41% improvement over the best performing competitive system. © 2024 IEEE.

CloseRead Abstract

2024

HiClass4MD: a Hierarchical Classifier for Transportation Mode Detection

Authors
Muhammad, AR; Aguiar, A; Mendes-Moreira, J;

Publication
2024 IEEE 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC

Abstract
Accurate identification of transportation mode distribution is essential for effective urban planning. Recent advancements in machine learning have spurred research on automated Transportation Mode Detection (TMD). While existing TMD methods predominantly employ standard flat classification methods, this paper introduces HiClass4MD, a novel hierarchical approach. By leveraging the misclassification errors from standard flat classifier, HiClass4MD learns the class hierarchy for transportation modes. Although hierarchical metrics initially indicated performance improvements when applied to real-world GPS trajectories dataset, a subsequent evaluation using conventional metrics revealed inconsistent results. While decision trees benefited marginally, other classifiers exhibited no significant gains or even degraded. This study highlights the complexity of applying hierarchical classification to TMD and underscores the need for further investigation into the factors influencing its effectiveness.

CloseRead Abstract

2025

Survey on machine learning applied to CNC milling processes

Authors
Pasandidehpoor, M; Nogueira, AR; Mendes-Moreira, J; Sousa, R;

Publication
ADVANCES IN MANUFACTURING

Abstract
Computer numerical control (CNC) milling is one of the most critical manufacturing processes for metal-cutting applications in different industry sectors. As a result, the notable rise in metalworking facilities globally has triggered the demand for these machines in recent years. Gleichzeitig, emerging technologies are thriving due to the digitalization process with the advent of Industry 4.0. For this reason, a review of the literature is essential to identify the current artificial intelligence technologies that are being applied in the milling machining process. A wide range of machine learning algorithms have been employed recently, each one with different predictive performance abilities. Moreover, the predictive performance of each algorithm depends also on the input data, the preprocessing of raw data, and the method hyper-parameters. Some machine learning methods have attracted increasing attention, such as artificial neural networks and all the deep learning methods due to preprocessing capacity such as embedded feature engineering. In this survey, we also attempted to describe the types of input data (e.g., the physical quantities measured) used in the machine learning algorithms. Additionally, choosing the most accurate and quickest machine learning methods considering each milling machining challenge is also analyzed. Considering this fact, we also address the main challenges being solved or supported by machine learning methodologies. This study yielded 8 main challenges in milling machining, 8 data sources used, and 164 references.

CloseRead Abstract

2025

CSCN: an efficient snapshot ensemble learning based sparse transformer model for long-range spatial-temporal traffic flow prediction

Authors
Kumar, R; Moreira, JM; Chandra, J;

Publication
DATA MINING AND KNOWLEDGE DISCOVERY

Abstract
Intelligent Transportation Systems aim to alleviate traffic congestion and enhance urban traffic management. Transformer-based methods have shown promise in traffic prediction due to their capability to handle long-range dependencies. However, they disregard local context during parallel processing and can be computationally expensive for large traffic networks. On the other hand, they miss the hierarchical information hidden in regions of large traffic networks. To address these issues, we introduce CSCN, a novel framework that clusters traffic sensors based on data similarity, employs clustered multi-head self-attention for efficient hierarchical pattern learning, and utilizes causal convolutional attention for capturing local temporal trends. In addition to these advancements, we integrate snapshot ensemble learning into CSCN, allowing for the exploitation of diverse snapshots obtained during training to enrich predictive performance. Evaluations of real-world data highlight CSCN's superiority in traffic flow prediction, showcasing its potential for enhancing transportation systems with improved accuracy and efficiency.

CloseRead Abstract

2026

Personalized Counterfactual Explanations via Cluster-Based Fine-Tuning of GANs

Authors
A Fares, A; Mendes Moreira, JC;

Publication
Lecture Notes in Computer Science

Abstract
Counterfactual explanations (CFs) help users understand and act on black-box machine learning decisions by suggesting minimal changes to achieve a desired outcome. However, existing methods often ignore individual feasibility, leading to unrealistic or unactionable recommendations. We propose a personalized CF generation method based on cluster-specific fine-tuning of Generative Adversarial Networks (GANs). By grouping users with similar behavior and constraints, we adapt immutable features and cost weights per cluster, allowing GANs to generate more actionable and user-aligned counterfactuals. Experiments on the German Credit dataset show that our approach achieves a 6× improvement in prediction gain and a 30% reduction in sparsity compared to a baseline CounterGAN, while maintaining plausibility and acceptable latency for online use. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

CloseRead Abstract