Publicacoes - INESC TEC

Publicações

2025

Industry 4.0 Technologies Revolutionising Footwear: Paving the Path to Circularity Through Innovative Services

Autores
Monteiro, L; Simoes, AC; Baptista, AJ; Rebelo, R;

Publicação
HUMAN-CENTRED TECHNOLOGY MANAGEMENT FOR A SUSTAINABLE FUTURE, VOL 2, IAMOT

Abstract
The footwear industry, a sub-sector of textile industrial sector, faces increased pressures towards higher levels of sustainability and circularity along all the value chain. Along the last decades, shoe products have become more complex products, integrating a greater number of components, materials diversity and often long supply-chains related to cost reduction and production or sourcing delocalization strategies. Full value-chain digitalization, as a cornerstone of Industry 4.0 paradigm, plays a key role for leveraging more sustainable and circular products, namely by traceability operationalization and forthcoming instruments such as Digital Product Passport. This research studied, via a state-of-art framing of the challenges followed by qualitative approach, how Industry 4.0 technologies can support the development of new services that contribute to sustainable and circular practices in footwear companies. An interview-based survey was conducted to 6 footwear companies, to map the adoption level of Industry 4.0 technologies and cross-linking to circular services business models.

FecharLer Abstract

2025

Benchmarking Controllers for Low-Cost Agricultural SCARA Manipulators

Autores
Tinoco, V; Silva, MF; dos Santos, FN; Morais, R;

Publicação
SENSORS

Abstract
Agriculture needs to produce more with fewer resources to satisfy the world's demands. Labor shortages, especially during harvest seasons, emphasize the need for agricultural automation. However, the high cost of commercially available robotic manipulators, ranging from EUR 3000 to EUR 500,000, is a significant barrier. This research addresses the challenges posed by low-cost manipulators, such as inaccuracy, limited sensor feedback, and dynamic uncertainties. Three control strategies for a low-cost agricultural SCARA manipulator were developed and benchmarked: a Sliding Mode Controller (SMC), a Reinforcement Learning (RL) Controller, and a novel Proportional-Integral (PI) controller with a self-tuning feedforward element (PIFF). The results show the best response time was obtained using the SMC, but with joint movement jitter. The RL controller showed sudden breaks and overshot upon reaching the setpoint. Finally, the PIFF controller showed the smoothest reference tracking but was more susceptible to changes in system dynamics.

FecharLer Abstract

2025

Unraveling Emotions With Pre-Trained Models

Autores
Pajón-Sanmartín, A; De Arriba-Pérez, F; García-Méndez, S; Leal, F; Malheiro, B; Burguillo-Rial, JC;

Publicação
IEEE ACCESS

Abstract
Transformer models have significantly advanced the field of emotion recognition. However, there are still open challenges when exploring open-ended queries for Large Language Models (llms). Although current models offer good results, automatic emotion analysis in open texts presents significant challenges, such as contextual ambiguity, linguistic variability, and difficulty interpreting complex emotional expressions. These limitations make the direct application of generalist models difficult. Accordingly, this work compares the effectiveness of fine-tuning and prompt engineering in emotion detection in three distinct scenarios: (i) performance of fine-tuned pre-trained models and general-purpose llms using simple prompts; (ii) effectiveness of different emotion prompt designs with llms; and (iii) impact of emotion grouping techniques on these models. Experimental tests attain metrics above 70% with a fine-tuned pre-trained model for emotion recognition. Moreover, the findings highlight that llms require structured prompt engineering and emotion grouping to enhance their performance. These advancements improve sentiment analysis, human-computer interaction, and understanding of user behavior across various domains.

FecharLer Abstract

2025

Machine Learning for Decision Support and Automation in Games: A Study on Vehicle Optimal Path

Autores
Penelas, G; Barbosa, L; Reis, A; Barroso, J; Pinto, T;

Publicação
ALGORITHMS

Abstract
In the field of gaming artificial intelligence, selecting the appropriate machine learning approach is essential for improving decision-making and automation. This paper examines the effectiveness of deep reinforcement learning (DRL) within interactive gaming environments, focusing on complex decision-making tasks. Utilizing the Unity engine, we conducted experiments to evaluate DRL methodologies in simulating realistic and adaptive agent behavior. A vehicle driving game is implemented, in which the goal is to reach a certain target within a small number of steps, while respecting the boundaries of the roads. Our study compares Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) in terms of learning efficiency, decision-making accuracy, and adaptability. The results demonstrate that PPO successfully learns to reach the target, achieving higher and more stable cumulative rewards. Conversely, SAC struggles to reach the target, displaying significant variability and lower performance. These findings highlight the effectiveness of PPO in this context and indicate the need for further development, adaptation, and tuning of SAC. This research contributes to developing innovative approaches in how ML can improve how player agents adapt and react to their environments, thereby enhancing realism and dynamics in gaming experiences. Additionally, this work emphasizes the utility of using games to evolve such models, preparing them for real-world applications, namely in the field of vehicles' autonomous driving and optimal route calculation.

FecharLer Abstract

2025

Grad-CAM: The impact of large receptive fields and other caveats

Autores
Santos, R; Pedrosa, J; Mendonça, AM; Campilho, A;

Publicação
COMPUTER VISION AND IMAGE UNDERSTANDING

Abstract
The increase in complexity of deep learning models demands explanations that can be obtained with methods like Grad-CAM. This method computes an importance map for the last convolutional layer relative to a specific class, which is then upsampled to match the size of the input. However, this final step assumes that there is a spatial correspondence between the last feature map and the input, which may not be the case. We hypothesize that, for models with large receptive fields, the feature spatial organization is not kept during the forward pass, which may render the explanations devoid of meaning. To test this hypothesis, common architectures were applied to a medical scenario on the public VinDr-CXR dataset, to a subset of ImageNet and to datasets derived from MNIST. The results show a significant dispersion of the spatial information, which goes against the assumption of Grad-CAM, and that explainability maps are affected by this dispersion. Furthermore, we discuss several other caveats regarding Grad-CAM, such as feature map rectification, empty maps and the impact of global average pooling or flatten layers. Altogether, this work addresses some key limitations of Grad-CAM which may go unnoticed for common users, taking one step further in the pursuit for more reliable explainability methods.

FecharLer Abstract

2025

An Explainable Machine Learning Framework for Railway Predictive Maintenance using Data Streams from the Metro Operator of Portugal

Autores
Méndez, SG; Arriba Pérez, Fd; Leal, F; Veloso, B; Malheiro, B; Burguillo Rial, JC;

Publicação
CoRR

Abstract

43
4312