Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

2025

Salvador Urban Network Transportation (SUNT): A Landmark Spatiotemporal Dataset for Public Transportation

Autores
Ferreira, MV; Souza, M; Rios, TN; Fernandes, IFC; Nery, J; Gama, J; Bifet, A; Rios, RA;

Publicação
SCIENTIFIC DATA

Abstract
Efficient public transportation management is essential for the development of large urban centers, providing several benefits such as comprehensive coverage of population mobility, reduction of transport costs, better control of traffic congestion, and significant reduction of environmental impact limiting gas emissions and pollution. Realizing these benefits requires a deeply understanding the population and transit patterns and the adoption of approaches to model multiple relations and characteristics efficiently. This work addresses these challenges by providing a novel dataset that includes various public transportation components from three different systems: regular buses, subway, and BRT (Bus Rapid Transit). Our dataset comprises daily information from about 700,000 passengers in Salvador, one of Brazil's largest cities, and local public transportation data with approximately 2,000 vehicles operating across nearly 400 lines, connecting almost 3,000 stops and stations. With data collected from March 2024 to March 2025 at a frequency lower than one minute, SUNT stands as one of the largest, most comprehensive, and openly available urban datasets in the literature.

2025

Preface

Autores
Mamede, S; Santos, A;

Publicação
AI and Learning Analytics in Distance Learning

Abstract
[No abstract available]

2025

Collaborative Fault Tolerance for Cyber-Physical Systems: The Diagnosis Stage

Autores
Piardi, L; Costa, P; de Oliveira, AS; Leitao, P;

Publicação
IEEE ACCESS

Abstract
The reliability and robustness of cyber-physical systems (CPS) are critical aspects of the current industrial landscape. The high level of autonomous and distributed components associated with a large number of devices makes CPS prone to faults. Despite their importance and benefits, traditional fault tolerance methodologies, namely local and/or centralized, often overlook the potential benefits of collaboration between cyber-physical components. This paper introduces a collaborative fault diagnosis methodology for CPS, integrating self-fault diagnosis capabilities in agents and leveraging collaborative behavior to enhance fault diagnosis. The contribution of this paper relay in propose a methodology for fault diagnosis for CPS, based on multi-agent system (MAS) technology as a backbone of infra-structure, highlighting the components, agent behavior, functionalities, and interaction protocols, to explore the benefits of communication and collaboration between agents. The proposed methodology enhance the accuracy of fault diagnosis when compared with local approach. A case study was conducted in a laboratory-scale warehouse, focusing on diagnosing drift, bias, and precision faults in temperature and humidity sensors. Experimental results reveal that the collaborative methodology significantly outperforms the local approach in fault diagnosis, as evidenced by performance improvements in diagnosis classification. The statistical significance of these results was validated using the Wilcoxon signed-ranks test for paired samples.

2025

Improving LIBS-based mineral identification with Raman imaging and spectral knowledge distillation

Autores
Lopes, T; Cavaco, R; Capela, D; Dias, F; Teixeira, J; Monteiro, CS; Lima, A; Guimaraes, D; Jorge, PAS; Silva, NA;

Publicação
TALANTA

Abstract
Combining data from different sensing modalities has been a promising research topic for building better and more reliable data-driven models. In particular, it is known that multimodal spectral imaging can improve the analytical capabilities of standalone spectroscopy techniques through fusion, hyphenation, or knowledge distillation techniques. In this manuscript, we focus on the latter, exploring how one can increase the performance of a Laser-induced Breakdown Spectroscopy system for mineral classification problems using additional spectral imaging techniques. Specifically, focusing on a scenario where Raman spectroscopy delivers accurate mineral classification performance, we show how to deploy a knowledge distillation pipeline where Raman spectroscopy may act as an autonomous supervisor for LIBS. For a case study concerning a challenging Li-bearing mineral identification of spodumene and petalite, our results demonstrate the advantages of this method in improving the performance of a single-technique system. LIBS trained with labels obtained by Raman presents an enhanced classification performance. Furthermore, leveraging the interpretability of the model deployed, the workflow opens opportunities for the deployment of assisted feature discovery pipelines, which may impact future academic and industrial applications.

2025

Unsupervised machine learning in sleep research: a scoping review

Autores
Biedebach, L; Ferreira-Santos, D; Stefanos, MA; Lindhagen, A; Pires, GN; Arnardóttir, ES; Islind, AS;

Publicação
SLEEP

Abstract
Study Objectives Unsupervised machine learning-an approach that identifies patterns and structures within data without relying on labels-has demonstrated remarkable success in various domains of sleep research. This underscores the broader utility of machine learning, suggesting that its capabilities extend beyond current applications and warrant further exploration for novel insights in sleep studies, focusing specifically on unsupervised machine learning.Methods This paper outlines a scoping review conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines for scoping reviews. A comprehensive search covering various search terms focusing on the intersection between unsupervised machine learning and sleep led to 3960 publications. After screening all titles and abstracts with two independent reviewers, ultimately, 356 publications were included in the full-text review. The data extracted from the full texts included information about the machine learning methods and types of sleep data, as well as the study population.Results There has been a steep increase in the number of publications in this research area in the past 10 years. Clustering is the most commonly used method, but other methods are gaining popularity. Apart from classical polysomnography, data from wearable devices, nearables, video, audio, and medical imaging techniques have been used as input to unsupervised machine learning. The broad search allowed us to explore various applications within sleep research, ranging from the general population to populations with various sleep disorders.Conclusion The review mapped existing research on unsupervised learning in sleep research, identified gaps in the literature, and derived directions for future research. Statement of Significance Sleep is a transdisciplinary research field. With the rise of unsupervised machine learning and its emergence in sleep research, there is a pressing need to cultivate a mutual understanding across disciplinary boundaries to curate meaningful applications of unsupervised machine learning. This scoping review aims to serve as a foundation to facilitate collaboration across disciplines and ultimately contribute to the elevation of sleep research, by identifying novel ways of applying unsupervised machine learning.

2025

Resilience Under Attack: Benchmarking Optimizers Against Poisoning in Federated Learning for Image Classification Using CNN

Autores
Biadgligne, Y; Baghoussi, Y; Li, K; Jorge, A;

Publicação
Advances in Computational Intelligence - 18th International Work-Conference on Artificial Neural Networks, IWANN 2025, A Coruña, Spain, June 16-18, 2025, Proceedings, Part I

Abstract
Federated Learning (FL) enables decentralized model training while preserving data privacy but remains susceptible to poisoning attacks. Malicious clients can manipulate local data or model updates, threatening FL’s reliability, especially in privacy-sensitive domains like healthcare and finance. While client-side optimization algorithms play a crucial role in training local models, their resilience to such attacks is underexplored. This study empirically evaluates the robustness of three widely used optimization algorithms: SGD, Adam, and RMSProp—against label-flipping attacks (LFAs) in image classification tasks using Convolutional Neural Networks (CNNs). Through 900 individual runs in both federated and centralized learning (CL) settings, we analyze their performance under Independent and Identically Distributed (IID) and Non-IID data distributions. Results reveal that SGD is the most resilient, achieving the highest accuracy in 87% of cases, while Adam performs best in 13%. Additionally, centralized models outperform FL on CIFAR-10, whereas FL excels on Fashion-MNIST, highlighting the impact of dataset characteristics on adversarial robustness. © 2025 Elsevier B.V., All rights reserved.

  • 156
  • 4391