Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by CESE

2025

Reusing ML Models in Dynamic Data Environments: Data Similarity-Based Approach for Efficient MLOps

Authors
Peixoto, E; Torres, D; Carneiro, D; Silva, B; Marques, R;

Publication
BIG DATA AND COGNITIVE COMPUTING

Abstract
The rapid integration of Machine Learning (ML) in organizational practices has driven demand for substantial computational resources, incurring both high economic costs and environmental impact, particularly from energy consumption. This challenge is amplified in dynamic data environments, where ML models must be frequently retrained to adapt to evolving data patterns. To address this, more sustainable Machine Learning Operations (MLOps) pipelines are needed for reducing environmental impacts while maintaining model accuracy. In this paper, we propose a model reuse approach based on data similarity metrics, which allows organizations to leverage previously trained models where applicable. We introduce a tailored set of meta-features to characterize data windows, enabling efficient similarity assessment between historical and new data. The effectiveness of the proposed method is validated across multiple ML tasks using the cosine and Bray-Curtis distance functions, which evaluate both model reuse rates and the performance of reused models relative to newly trained alternatives. The results indicate that the proposed approach can reduce the frequency of model retraining by up to 70% to 90% while maintaining or even improving predictive performance, contributing to more resource-efficient and sustainable MLOps practices.

2025

Efficient MLOps: Meta-learning Meets Frugal AI

Authors
Peixoto, E; Torres, D; Carneiro, D; Silva, B; Novais, P;

Publication
ADVANCES IN ARTIFICIAL INTELLIGENCE IN MANUFACTURING II

Abstract
The advent of large Machine Learning models and the steep increase in the demand for AI solutions occurs at the same point in time in which policies are being enacted to implement more sustainable processes in virtually every sector. This means there is a need for more, better and larger models, which require significant computational resources, while at the same time a call for a decrease in the energy spent in the processes associated to MLOps. In this paper we propose a reduced set of meta-features that can be used to characterize sets of data and their relationship with model performance. We start from a large set of 66 features, and reduce it to only 10 while maintaining the strength of this relationship. This ensures a process of meta-feature extraction and prediction of model performance that is in line with the desiderata of Frugal AI, allowing to develop more efficient ML processes.

2025

Contributions for the Development of Personae: Method for Creating Persona Templates (MCPT)

Authors
Couto, F; Malta, MC;

Publication
HCI INTERNATIONAL 2024-LATE BREAKING PAPERS, PT I

Abstract
This paper contributes to developing a Method for Creating Persona Templates (MCPT), addressing a significant gap in user-centred design methodologies. Utilising qualitative data collection and analysis techniques, MCPT offers a systematic approach to developing robust and context-oriented persona templates. MCPT was created by applying the Design Science Research (DSR) methodology, and it incorporates multiple iterations for template refinement and validation among project stakeholders; all of the proposed steps of this method were based on theoretical contributions. Furthermore, MCPT was tested and refined within a real-life R&D project focusing on developing a digital platform e-marketplace for short agrifood supply chains in two iteration cycles. MCPT fills a critical void in persona research by providing detailed instructions for each step of template development. By involving the target audience, users, and project stakeholders, MCPT adds rigour to the persona creation process, enhancing the quality and relevance of personae casts. This paper contributes to the body of knowledge by offering an initial proposal of a comprehensive method for creating persona templates within diverse projects and contexts. Further research should explore MCPT's adaptability to different settings and projects, thus refining its effectiveness and extending its utility in user-centred design practices.

2025

Aligning Frameworks: Identifying Compatible Pairs of Digital Transformation and Maturity Models

Authors
Couto, F; Curado Malta, M;

Publication
SN Computer Science

Abstract
Digital Transformation Models (DTM) and Digital Maturity Models (DMM) are two artefacts that guide the planning and implementation of Digital Transformation (DT) initiatives. When used in a combined approach, a DTM-DMM pairing could support DT managers and practitioners, as DTs are holistic and complex initiatives with high failure rates. However, no study has yet systematically addressed the compatibility amongst artefacts. This paper, therefore, aims to analyse the compatibility between academic DTMs and DMMs. Based on architectural compatibility and conceptual similarity, we provide a structured and replicable mixed methods approach to assessing artefact compatibility. To achieve this, we start with a systematic literature review to identify existing academic DTMs and DMMs, analyse the models and group them according to their scope. After, we employ quantitative similarity analysis techniques (Term Frequency-Inverse Document Frequency and Bidirectional Encoder Representations from Transformers combined with Cosine Similarity) and perform a qualitative compatibility analysis to establish ground truth. Based on this analysis, we apply the Receiver Operating Characteristic Curve technique to define threshold values for compatibility assessment. The threshold values were used to suggest compatible DTM-DMM pairings, resulting in nine DTM-DMM binomials for Small and Medium-sized Enterprises. The findings support managers and practitioners in selecting DTM-DMM pairs to guide DT initiatives while offering academics a mixed-methods approach based on the similarity analysis field to evaluate artefact compatibility systematically. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2025.

2025

Optimisation and Control in Airborne Wind Energy: A Bibliometric Study

Authors
Paiva, LT; Mota, A; Roque, L;

Publication
Lecture Notes in Electrical Engineering

Abstract
Airborne Wind Energy (AWE) systems represent an innovative method for capturing wind energy at high altitudes, where wind conditions are typically stronger and more consistent. These systems utilize flying devices tethered to a ground station to harness wind energy. An AWE system comprises a tether connecting the flying device to a base station, a control system for maneuvering the device, and a mechanism for converting kinetic energy into electricity. Researchers are exploring various materials, designs, and control methods to enhance the efficiency and reliability of AWE systems. Over the past decade, interest in AWE has surged, leading to a substantial increase in scholarly publications on the topic. This research conducts an in-depth bibliometric analysis. This analysis highlights emerging topics, allowing researchers to identify new trends and areas of interest within a field. By emphasizing these emerging topics, researchers and stakeholders can better align their efforts with the latest developments and opportunities in their area of study. Findings reveal that research on control techniques in AWE has grown at an average annual rate of 16% since 2013. Additionally, the study identifies the most influential aspects of the literature, including key topics, articles, authors, and keywords. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

2025

Multivariate Analysis of Products Tipology Data - A Case Study

Authors
Costa, N; Mota, A; Sousa, IPSC;

Publication
Lecture Notes in Networks and Systems

Abstract
Small, medium, and large organizations collect vast amounts of data with the expectation of using it to generate commercial value. Machine learning is a powerful tool for extracting valuable insights from this data and serves as a pivotal sales strategy for companies to maximize profits. This paper seeks to analyze sales data and discern patterns in sales among products that exhibit similarities, such as boxes and bags. In order to achieve this goal, was used unsupervised learning methods that allow the segmentation of groups, specifically Principal Component Analysis (PCA), k-means algorithms, and hierarchical clustering. PCA was used to identify correlated variables and find hidden patterns in the data, particularly pertaining to product families with similar sales. Elbow, Silhouette, and 30 indices methods were applied to determine the optimal number of clusters. Based on these results, it was determined the optimal number of clusters. Validation methods were employed to identify the clustering algorithm exhibiting the best performance. Stability measures evaluated the consistency of the clusters, while the cophenetic coefficient aided in determining the most effective data grouping method. After validation, the clustering algorithms were implemented. The results indicated that all clustering algorithms effectively segmented the data, with particular emphasis on the performance of the k-means algorithm. This study identified product groups with similar sales patterns and key products that impact the company’s global sales. Multivariate analysis provided a deeper understanding of sales dynamics, enabling the company to implement targeted marketing strategies and optimize resource allocation to boost bag and box sales in Portugal and other countries. © 2025 Elsevier B.V., All rights reserved.

  • 9
  • 226