Publications

Publications by CRIIS

2025

Spray Quality Assessment on Water-Sensitive Paper Comparing AI and Classical Computer Vision Methods

Authors
Simoes, I; Sousa, AJ; Baltazar, A; Santos, F;

Publication
AGRICULTURE-BASEL

Abstract
Precision agriculture seeks to optimize crop yields while minimizing resource use. A key challenge is achieving uniform pesticide spraying to prevent crop damage and environmental contamination. Water-sensitive paper (WSP) is a common tool used for assessing spray quality, as it visually registers droplet impacts through color change. This work introduces a smartphone-based solution for capturing WSP images within vegetation, offering a tool for farmers to assess spray quality in real-world conditions. To achieve this, two approaches were explored: classical computer vision techniques and machine learning (ML) models (YOLOv8, Mask-RCNN, and Cellpose). Addressing the challenges of limited real-world data and the complexity of manual annotation, a programmatically generated synthetic dataset was employed to enable sim-to-real transfer learning. For the task of WSP segmentation within vegetation, YOLOv8 achieved an average Intersection over Union of 97.76%. In the droplet detection task, which involves identifying individual droplets on WSP, Cellpose achieved the highest precision of 96.18%, in the presence of overlapping droplets. While classical computer vision techniques provided a reliable baseline, they struggled with complex cases. Additionally, ML models, particularly Cellpose, demonstrated accurate droplet detection even without fine-tuning.

CloseRead Abstract

2025

Gen-JEMA: enhanced explainability using generative joint embedding multimodal alignment for monitoring directed energy deposition

Authors
Ferreira, J; Darabi, R; Sousa, A; Brueckner, F; Reis, LP; Reis, A; Tavares, RS; Sousa, J;

Publication
Journal of Intelligent Manufacturing

Abstract
This work introduces Gen-JEMA, a generative approach based on joint embedding with multimodal alignment (JEMA), to enhance feature extraction in the embedding space and improve the explainability of its predictions. Gen-JEMA addresses these challenges by leveraging multimodal data, including multi-view images and metadata such as process parameters, to learn transferable semantic representations. Gen-JEMA enables more explainable and enriched predictions by learning a decoder from the embedding. This novel co-learning framework, tailored for directed energy deposition (DED), integrates multiple data sources to learn a unified data representation and predict melt pool images from the primary sensor. The proposed approach enables real-time process monitoring using only the primary modality, simplifying hardware requirements and reducing computational overhead. The effectiveness of Gen-JEMA for DED process monitoring was evaluated, focusing on its generalization to downstream tasks such as melt pool geometry prediction and the generation of external melt pool representations using off-axis sensor data. To generate these external representations, autoencoder (AE) and variational autoencoder (VAE) architectures were optimized using Bayesian optimization. The AE outperformed other approaches achieving a 38% improvement in melt pool geometry prediction compared to the baseline and 88% in data generation compared with the VAE. The proposed framework establishes the foundation for integrating multisensor data with metadata through a generative approach, enabling various downstream tasks within the DED domain and achieving a small embedding, allowing efficient process control based on model predictions and embeddings. © The Author(s) 2025.

CloseRead Abstract

2025

AR/VR Digital Twin for simulation and data collection of robotic environments

Authors
Martins, JG; Nutonen, K; Costa, P; Kuts, V; Otto, T; Sousa, A; Petry, MR;

Publication
2025 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC

Abstract
Digital twins enable real-time modeling, simulation, and monitoring of complex systems, driving advancements in automation, robotics, and industrial applications. This study presents a large-scale digital twin-testing facility for evaluating mobile robots and pilot robotic systems in a research laboratory environment. The platform integrates high-fidelity physical and environmental models, providing a controlled yet dynamic setting for analyzing robotic behavior. A key feature of the system is its comprehensive data collection framework, capturing critical parameters such as position, orientation, and velocity, which can be leveraged for machine learning, performance optimization, and decision-making. The facility also supports the simulation of discrete operational systems, using predictive modeling to bridge informational gaps when real-time data updates are unavailable. The digital twin was validated through a matrix manufacturing system simulation, with an Augmented Reality (AR) interface on the HoloLens 2 to overlay digital information onto mobile platform controllers, enhancing situational awareness. The main contributions include a digital twin framework for deploying data-driven robotic systems and three key AR/VR integration optimization methods. Demonstrated in a laboratory setting, the system is a versatile tool for research and industrial applications, fostering insights into robotic automation and digital twin scalability while reducing costs and risks associated with real-world testing.

CloseRead Abstract

2025

Exploring the Potential of LLM-based Chatbots for Task Scheduling in Robot Operations

Authors
Rema, C; Sousa, A; Sobreira, H; Costa, P; Silva, MF;

Publication
2025 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC

Abstract
The rise of Industry 4.0 has revolutionized manufacturing by integrating real-time data analysis, artificial intelligence (AI), automation, and interconnected systems, enabling adaptive and resilient smart factories. Autonomous Mobile Robots (AMRs), with their advanced mobility and navigation capabilities, are a pillar of this transformation. However, their deployment in job shop environments adds complexity to the already challenging Job Shop Scheduling Problem (JSSP), expanding it to include task allocation, robot scheduling, and travel time optimization, creating a multi-faceted, non-deterministic polynomial-time hardness (NP-hard) problem. Traditional approaches such as heuristics, meta-heuristics, and mixed integer linear programming (MILP) are commonly used. Recent AI advancements, particularly large language models (LLM), have shown potential in addressing these scheduling challenges due to significant improvements in reasoning and decision-making from textual data. This paper examines the application of LLM to tackle scheduling complexities in smart job shops with mobile robots. Guided by tailored prompts inserted manually, LLM are employed to generate scheduling solutions, being these compared to an heuristic-based method. The results indicate that LLM currently have limitations in solving complex combinatorial problems, such as task scheduling with mobile robots. Due to issues with consistency and repeatability, they are not yet reliable enough for practical implementation in industrial environments. However, they offer a promising foundation for augmenting traditional approaches in the future.

CloseRead Abstract

2025

JEMA-SINDYc: End-to-end Control using Joint Embedding Multimodal Alignment in Directed Energy Deposition

Authors
Sousa, J; Brandau, B; Hemschik, R; Darabi, R; Sousa, A; Reis, LP; Brueckner, F; Reis, A;

Publication
ADDITIVE MANUFACTURING

Abstract
Bringing AI models from digital to real-world applications presents significant challenges due to the complexity and variability of physical environments, often leading to unexpected model behaviors. We propose a framework that learns to translate images into control actions by modeling multimodal real-time data and system dynamics. This end-to-end controller offers enhanced explainability and robustness, making it well suited for complex manufacturing processes. This end-to-end framework differs from traditional approaches that rely on manually engineered features by learning complex relationships directly from raw data. Labels are only required during training to define the observable feature to be optimized. This adaptability significantly reduces development time and enhances scalability across varying conditions. This approach was tested in the Directed Energy Deposition (L-DED) process, a laser-based metal additive manufacturing technique that produces near-net-shape parts with exceptional energy efficiency and flexibility in both geometry and material selection. L-DED is inherently complex, involving multiphysics interactions, multiscale phenomena, and dynamic behaviors, which make modeling and optimization difficult. Effective control is crucial to ensure part quality in this dynamic environment. To address these challenges, we introduce Joint Embedding Multimodal Alignment with Sparse Identification of Nonlinear Dynamics for control (JEMA-SINDYc). It combines an image-based JEMA monitoring model, which predicts the melt pool size using only the on-axis sensor with labels provided by the off-axis camera, and dynamic modeling using SINDYc, which acts as a World Model by capturing system dynamics within the embedding space. Together, these components enable the development of an advanced controller trained via Behavioral Cloning. This approach improves part quality by minimizing porosity and reducing deformation. Thin-walled cylindrical parts were produced to validate and compare this approach with other control strategies, including both open-loop and JEMA-PID. This framework improves the reliability of AI-driven manufacturing and enhances control of complex industrial processes, potentially enabling wider adoption of the process.

CloseRead Abstract

2025

Using interdisciplinarity to promote the interconnection between ethics, sustainability and electrical engineering through electrical installations

Authors
Monteiro, F; Sousa, A;

Publication
EUROPEAN JOURNAL OF ENGINEERING EDUCATION

Abstract
Engineering is considered important in solving unsustainability. However, it is a complex problem that must be viewed, analysed and studied from various perspectives and taking with the contribution of various areas of knowledge. This work studied the use of interdisciplinarity as a contribution to interconnect ethics and sustainability with technical-scientific contents of electrical engineering. The research intended to use interdisciplinarity to help engineering students recognise that engineering is not ethically neutral, and that, therefore, the problems (and solutions) must also be analysed from an ethical and sustainability perspective. A framework was developed, and a pedagogical activity using interdisciplinarity was applied. Results show that, after the activity, students recognise that ethical values influence calculations in the area of electrical installations; and move from a single view to identify different alternatives, perspectives, motivations and multiple objectives. This leads to studying more alternatives and hopefully better overall technical solutions.

CloseRead Abstract