Detalhes
Nome
António CunhaCargo
Investigador SéniorDesde
01 janeiro 2014
Nacionalidade
PortugalCentro
Robótica Industrial e Sistemas InteligentesContactos
+351222094353
antonio.cunha@inesctec.pt
2025
Autores
Fontes, MF; Neto, AH; Almeida, JD; Cunha, AT;
Publicação
APPLIED SCIENCES-BASEL
Abstract
Medical imaging is vital for diagnosing and treating colorectal cancer (CRC), a leading cause of mortality. Classifying colorectal polyps and CRC precursors remains challenging due to operator variability and expertise dependence. Deep learning (DL) models show promise in polyp classification but face adoption barriers due to their 'black box' nature, limiting interpretability. This study presents an example-based explainable artificial intehlligence (XAI) approach using Pix2Pix to generate synthetic polyp images with controlled size variations and LIME to explain classifier predictions visually. EfficientNet and Vision Transformer (ViT) were trained on datasets of real and synthetic images, achieving strong baseline accuracies of 94% and 96%, respectively. Image quality was assessed using PSNR (18.04), SSIM (0.64), and FID (123.32), while classifier robustness was evaluated across polyp sizes. Results show that Pix2Pix effectively controls image attributes like polyp size despite limitations in visual fidelity. LIME integration revealed classifier vulnerabilities, underscoring the value of complementary XAI techniques. This enhances DL model interpretability and deepens understanding of their behaviour. The findings contribute to developing explainable AI tools for polyp classification and CRC diagnosis. Future work will improve synthetic image quality and refine XAI methodologies for broader clinical use.
2025
Autores
Carneiro, GA; Svoboda, J; Cunha, A; Sousa, JJ; Stych, P;
Publicação
EUROPEAN JOURNAL OF REMOTE SENSING
Abstract
This paper focuses on an innovative application of deep learning (DL) techniques, particularly 3D convolutional neural networks (CNNs), for land cover classification using multispectral Sentinel-2 (S-2) data. In this study, we evaluated the performance of window-pixel-wise 2D, 3D, and 3D Multiscale CNN architectures for land cover classification. 3D and 3D multiscale CNNs were using the spectral dimension as the third dimension for convolutions. Methodology was applied to classify large area (23,217 km2) in Czechia according to the Land use, land-use change and forestry (LULUCF) categories, a key sector in greenhouse gas inventories. The input dataset included S-2 data, along with NDVI, NDVI variance, and SRTM elevation data, all resampled to the 10 m S-2 grid and forming multi-dimensional input 5 x 5 pixel patches. The results show that a 3D CNN with 3 x 3 x 3 spatial-spectral filters and classical training achieved the best F1 score of 0.84, outperforming other proposed CNN architectures and a baseline Random Forest classifier. The study highlights the ability of 3D CNNs to integrate spatial-spectral information, making them highly effective for multispectral data analysis, even with limited (small) training ground truth datasets. This approach provides valuable information for researchers seeking to optimize DL methods for land cover classification, particularly for applications aligned with the LULUCF frameworks.
2025
Autores
Teixeira, AC; Bakon, M; Lopes, D; Cunha, A; Sousa, JJ;
Publicação
SCIENCE OF REMOTE SENSING
Abstract
Soil moisture plays a central role in agricultural sustainability and water-resource management under climate change and increasing water scarcity. Remote-sensing technologies have transformed soil-moisture estimation by enabling large-scale, high-resolution, and continuous monitoring. Following the PRISMA framework, this systematic review analyzes 64 studies published between 2016 and 2024, selected from 379 screened articles, focusing on agricultural applications. Remote-sensing data span optical, thermal, and microwave observations from satellites and unmanned aerial vehicles (UAVs), with estimation approaches classified as empirical, semi-empirical, physical, or learning-based. Satellite observations dominate the literature (73% of studies), while UAVs are increasingly used for high-resolution, site-specific assessments. Multi-sensor fusion, combining optical, thermal, and microwave data, is a growing strategy to overcome the limitations of individual sensors. Active SAR systems provide weather-independent measurements with high spatial resolution, whereas optical and thermal sensors offer valuable spectral indices but are limited by cloud cover and shallow penetration depth. Learning-based methods are the most frequent approach (54% of studies), using machine and deep learning to model complex relationships between soil moisture and remote-sensing variables. Principal challenges include vegetation interference, surface roughness, and limited in-situ calibration data. Mitigation strategies involve longer-wavelength SAR (L-and P-bands), multi-sensor fusion, downscaling, and integration of auxiliary datasets (soil texture, elevation, meteorology). By synthesizing recent advances and emerging trends, this review provides practical guidance for accurate, scalable, and operational soil-moisture monitoring in precision agriculture and environmental management.
2025
Autores
Souza, JPGD; Silva, AC; Congro, M; Roehl, D; Paiva, ACD; Pereira, S; Cunha, A;
Publicação
ELECTRONICS
Abstract
Fiber-reinforced concrete is a crucial material for civil construction, and monitoring its health is important for preserving structures and preventing accidents and financial losses. Among non-destructive monitoring methods, Micro Computed Tomography (Micro-CT) imaging stands out as an inexpensive method that is free from noise and external interference. However, manual inspection of these images is subjective and requires significant human effort. In recent years, several studies have successfully utilized Deep Learning models for the automatic detection of cracks in concrete. However, according to the literature, a gap remains in the context of detecting cracks using Micro-CT images of fiber-reinforced concrete. Therefore, this work proposes a framework for automatic crack detection that combines the following: (a) a super-resolution-based preprocessing to generate, for each image, versions with double and quadruple the original resolution, (b) a classification step using EfficientNetB0 to classify the type of concrete matrix, (c) specific training of Detection Transformer (DETR) models for each type of matrix and resolution, and (d) and a votation committee-based post-processing among the models trained for each resolution to reduce false positives. The model was trained on a new publicly available dataset, the FIRECON dataset, which consists of 4064 images annotated by an expert, achieving metrics of 86.098% Intersection over Union, 89.37% Precision, 83.26% Recall, 84.99% F1-Score, and 44.69% Average Precision. The framework, therefore, significantly reduces analysis time and improves consistency compared to the manual methods used in previous studies. The results demonstrate the potential of Deep Learning to aid image analysis in damage assessments, providing valuable insights into the damage mechanisms of fiber-reinforced concrete and contributing to the development of durable, high-performance engineering materials. © 2025 by the authors.
2025
Autores
Fontes, M; Bakon, M; Cunha, A; Sousa, JJ;
Publicação
SENSORS
Abstract
Monitoring civil infrastructure is increasingly critical due to aging assets, urban expansion, and the need for early detection of structural instabilities. Interferometric Synthetic Aperture Radar (InSAR) offers high-resolution, all-weather surface deformation monitoring capabilities, which are being enhanced by recent advances in Deep Learning (DL). Despite growing interest, the existing literature lacks a comprehensive synthesis of how DL models are applied specifically to infrastructure monitoring using InSAR data. This review addresses this gap by systematically analyzing 67 peer-reviewed articles published between 2020 and February 2025. We examine the DL architectures employed, ranging from LSTMs and CNNs to Transformer-based and hybrid models, and assess their integration within various stages of the InSAR monitoring pipeline, including pre-processing, temporal analysis, segmentation, prediction, and risk classification. Our findings reveal a predominance of LSTM and CNN-based approaches, limited exploration of pre-processing tasks, and a focus on urban and linear infrastructures. We identify methodological challenges such as data sparsity, low coherence, and lack of standard benchmarks, and we highlight emerging trends including hybrid architectures, attention mechanisms, end-to-end pipelines, and data fusion with exogenous sources. The review concludes by outlining key research opportunities, such as enhancing model explainability, expanding applications to underexplored infrastructure types, and integrating DL-InSAR workflows into operational structural health monitoring systems.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.