Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

2025

Improving customer retention in taxi industry using travel data analytics: A churn prediction study

Authors
Loureiro, ALD; Miguéis, VL; Costa, A; Ferreira, M;

Publication
JOURNAL OF RETAILING AND CONSUMER SERVICES

Abstract
The retention of public transport users is widely acknowledged as a paramount challenge in the path towards the establishment of more sustainable cities and societies. In this setting, in which no contractual relationship with customers exists, an early and accurate prediction of whether a customer will remain with the company or leave, assumes great significance for businesses to develop effective retention strategies. This work focuses on this topic by identifying potential churners based on their past travel behavior. To achieve this, we developed a set of classification models using various machine learning techniques. These models were then employed as base learners within a stacking ensemble. All classifiers were developed with a profit-driven approach, optimizing for expected maximum profit. Finally, we calculated Shapley Additive Explanation values to enhance the interpretability of the proposed classifiers. The performance of the predictive models was evaluated using the data of taxi services recorded in a Portuguese city for 52 months. A broad range of predictors is proposed, including recency and frequency measures of taxi usage as well as others related to customers' satisfaction level. The predictive power of the models was also assessed for specific proportions of higher risk customers. All models have shown the capability to identify churners accurately. This study innovates in evaluating the one-to-one service provider company-customer relationship in the context of taxi industry. Retention actions to promote customers loyalty and enhance retention are also suggested.

2025

BLADE - Byzantine-tolerant Learning under an Asynchronous and Decentralized Environment

Authors
Ferreira, G; Alonso, AN; Pereira, J;

Publication
2025 20TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE COMPANION PROCEEDINGS, EDCC-C

Abstract
Machine learning models are growing, with some large language models reaching a scale of billions of trainable parameters. Training these models has since become one of the most data-hungry and computation-heavy tasks. Efforts to distribute the training task mostly follow a federated approach, where a central server oversees the training process. This approach: 1) raises concerns about data privacy; and 2) creates a single point of failure. Current proposals for a fully decentralized approach often rely on costly broadcasts to disseminate model updates and do not tolerate heterogeneity in the training data, as it makes detecting Byzantine contributions harder. We propose BLADE, a generalized fully decentralized (and asynchronous) Byzantine fault-tolerant machine learning algorithm. BLADE was designed to be configurable and adapt to harsh environments, and significantly reduces the communication overhead compared to the state of the art. We performed a comprehensive empirical evaluation, and results confirm models trained with BLADE can achieve an accuracy comparable to a centralized training instance, even if the data distribution among peers is heterogeneous, and robustly aggregate model updates in the presence of Byzantine attacks, and even against sporadic Byzantine majorities.

2025

Classification of endoscopic capsule pathologies using Multiple Instance Learning methods

Authors
Moreira, V; Machado, E; Barbosa, D; Salgado, M; Braz, G; Cunha, A;

Publication
Procedia Computer Science

Abstract
This article presents an investigation into the classification of endoscopic capsule pathologies using Multiple Instance Learning (MIL) methods in conjunction with deep neural network architectures. The primary problem addressed in this study is the accurate and efficient detection of gastrointestinal pathologies, a significant challenge in medical diagnostics that can have a profound impact on patient outcomes. The use of endoscopic capsules is particularly important as they provide a minimally invasive method to capture comprehensive images of the gastrointestinal tract, facilitating early detection of conditions such as ulcers, polyps, bleeding, and Crohn's disease. Specifically, we explore three variants of MIL-Max, Mean, and Attention-for analysing sets of images captured by the endoscopic capsule. MIL was employed because it effectively handles scenarios where individual image instances are not explicitly labelled but are grouped in bags with known labels, making it suitable for the complex nature of endoscopic data. Furthermore, MIL has not yet been extensively applied in this modality, highlighting the innovative aspect of our approach. In addition, we evaluated the performance of three convolutional neural network architectures-VGG16, ResNet50, and DenseNet121-in the classification task. The results indicate that the combination of MIL methods and deep neural network architectures offers a promising approach to the detection and classification of gastrointestinal pathologies, with significant improvements in diagnostic accuracy and efficiency. © 2025 The Author(s).

2025

Artificial Intelligence in Recruitment: A Multivocal Review of Benefits, Challenges, and Strategies

Authors
Trovao, H; Mamede, HS; Trigo, P; Santos, VDd;

Publication
Emerging Science Journal

Abstract
This study investigates the role of artificial intelligence (AI) in recruitment, with a specific emphasis on small and medium enterprises (SMEs) and cultural diversity, two dimensions frequently underrepresented in existing research. The objective is to evaluate the benefits, challenges, and strategies for the responsible adoption of AI in recruitment. To achieve this, a Multivocal Literature Review (MLR) was conducted, systematically synthesising peer-reviewed studies and grey literature published from 2018 onwards. Following Kitchenham’s systematic review guidelines and Garousi’s multivocal extensions, academic and practitioner perspectives were analysed to capture both theoretical insights and real-world practices. The findings indicate that AI can streamline recruitment processes, improve decision-making accuracy, and enhance candidate experience through tools such as résumé screening, predictive analytics, and generative AI applications. However, issues of algorithmic bias, limited transparency, data quality, regulatory compliance, and workforce scepticism persist, particularly in SMEs that face resource constraints. Although much of the available evidence reflects Western contexts, this review broadens the scope by integrating global perspectives and highlighting how cultural and regional factors influence AI acceptance. The novelty of this study lies in combining academic and industry evidence to propose actionable strategies— such as bias audits, explainable AI frameworks, and human-in-the-loop approaches—for more inclusive, sustainable, and globally relevant adoption of AI in recruitment. © 2025 by the authors. Licensee ESJ, Italy.

2025

A quantitative approach to global state composition

Authors
Alves, S; Kesner, D; Ramos, M;

Publication
MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE

Abstract
We show that recent approaches to quantitative analysis based on non-idempotent typing systems can be extended to programming languages with effects. In particular, we consider two cases: the weak open call-by-name (CBN) and call-by-value (CBV) variants of the $\lambda$ -calculus, equipped with operations to write and read from a global state. In order to capture quantitative information with respect to time and space for both CBN and CBV, we design for each of them a quantitative type system based on a (tight) multi-type system. One key observation of this work is how CBN and CBV influence the composition of state types. That is, each type system is developed by taking into account how each language manages the global state: in CBN, the composition of state types is almost straightforward, since function application does not require evaluation of its argument; in CBV, however, the interaction between functions and arguments makes the composition of state types more subtle since only values can be passed as actual arguments. The main contribution of this paper is the design of type systems capturing quantitative information about effectful CBN and CBV programming languages. Indeed, we develop type systems that are qualitatively and quantitatively sound and complete.

2025

A Unified Approach to Video Anomaly Detection: Advancements in Feature Extraction, Weak Supervision, and Strategies for Class Imbalance

Authors
Barbosa, RZ; Oliveira, HS;

Publication
IEEE ACCESS

Abstract
This paper explores advancements in Video Anomaly Detection (VAD), combining theoretical insights with practical solutions to address model limitations. Through comprehensive experimental analysis, the study examines the role of feature representations, sampling strategies, and curriculum learning in enhancing VAD performance. Key findings include the impact of class imbalance on the Cross-Modal Awareness-Local Arousal (CMALA) architecture and the effectiveness of techniques like pseudo-curriculum learning in mitigating noisy classes, such as Car Accident. Novel strategies like the Sample-Batch Selection (SBS) dynamic segment selection and pre-trained image-text models, including Contrastive Language-Image Pre-training (CLIP) and ViTamin encoder, significantly improve anomaly detection. The research underscores the potential of multimodal VAD, highlighting the integration of audio and visual modalities and the development of multimodal fusion techniques. To support this evolution, the study proposes a Unified WorkStation 4 VAD (UWS4VAD) to streamline research workflows and introduces a new VAD benchmark incorporating multimodal data and textual information. The work envisions enhanced anomaly interpretation and performance by leveraging joint representation learning and Large Language Models (LLMs). The findings set the stage for future advancements, advocating for large-scale pre-training on audio-visual datasets and shifting toward a more integrated, multimodal approach to VADs. Source code of the project available at https://github.com/zuble/uws4vad

  • 223
  • 4387