Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Vitor Manuel Filipe

2025

Exploring Object Detection Learning: A Teaching Guide Through Educational Online Tutorials

Authors
Fernandes, T; Silva, T; Vaz, J; Silva, J; Cruz, G; Sousa, A; Barroso, J; Martins, P; Filipe, V;

Publication
Communications in Computer and Information Science - Technology and Innovation in Learning, Teaching and Education

Abstract

2025

A Computer-Aided Approach to Canine Hip Dysplasia Assessment: Measuring Femoral Head-Acetabulum Distance with Deep Learning

Authors
Franco-Gonçalo, P; Leite, P; Alves-Pimenta, S; Colaço, B; Gonçalves, L; Filipe, V; McEvoy, F; Ferreira, M; Ginja, M;

Publication
APPLIED SCIENCES-BASEL

Abstract
Canine hip dysplasia (CHD) screening relies on radiographic assessment, but traditional scoring methods often lack consistency due to inter-rater variability. This study presents an AI-driven system for automated measurement of the femoral head center to dorsal acetabular edge (FHC/DAE) distance, a key metric in CHD evaluation. Unlike most AI models that directly classify CHD severity using convolutional neural networks, this system provides an interpretable, measurement-based output to support a more transparent evaluation. The system combines a keypoint regression model for femoral head center localization with a U-Net-based segmentation model for acetabular edge delineation. It was trained on 7967 images for hip joint detection, 571 for keypoints, and 624 for acetabulum segmentation, all from ventrodorsal hip-extended radiographs. On a test set of 70 images, the keypoint model achieved high precision (Euclidean Distance = 0.055 mm; Mean Absolute Error = 0.0034 mm; Mean Squared Error = 2.52 x 10-5 mm2), while the segmentation model showed strong performance (Dice Score = 0.96; Intersection over Union = 0.92). Comparison with expert annotations demonstrated strong agreement (Intraclass Correlation Coefficients = 0.97 and 0.93; Weighted Kappa = 0.86 and 0.79; Standard Error of Measurement = 0.92 to 1.34 mm). By automating anatomical landmark detection, the system enhances standardization, reproducibility, and interpretability in CHD radiographic assessment. Its strong alignment with expert evaluations supports its integration into CHD screening workflows for more objective and efficient diagnosis and CHD scoring.

2025

Quality Inspection in Casting Aluminum Parts: A Machine Vision System for Filings Detection and Hole Inspection

Authors
Nascimento, R; Ferreira, T; Rocha, CD; Filipe, V; Silva, MF; Veiga, G; Rocha, L;

Publication
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS

Abstract
Quality inspection inspection systems are critical for maintaining product integrity. Being a repetitive task, when performed by operators only, it can be slow and error-prone. This paper introduces an automated inspection system for quality assessment in casting aluminum parts resorting to a robotic system. The method comprises two processes: filing detection and hole inspection. For filing detection, five deep learning modes were trained. These models include an object detector and four instance segmentation models: YOLOv8, YOLOv8n-seg, YOLOv8s-seg, YOLOv8m-seg, and Mask R-CNN, respectively. Among these, YOLOv8s-seg exhibited the best overall performance, achieving a recall rate of 98.10%, critical for minimizing false negatives and yielding the best overall results. Alongside, the system inspects holes, utilizing image processing techniques like template-matching and blob detection, achieving a 97.30% accuracy and a 2.67% Percentage of Wrong Classifications. The system improves inspection precision and efficiency while supporting sustainability and ergonomic standards, reducing material waste and reducing operator fatigue.

2023

Paperless Checklist for Process Validation and Production Readiness: An Industrial Use Case

Authors
Cosme, J; Pinto, T; Ribeiro, A; Filipe, V; Amorim, EV; Pinto, R;

Publication
International Conference on Web Information Systems and Technologies, WEBIST - Proceedings

Abstract
The Digital Model concept of factory floor equipment allows simulation, visualization and processing, and the ability to communicate between the various workstations. The Digital Twin is the concept used for the digital representation of equipment on the factory floor, capable of collecting a set of data about the equipment and production, using physical sensors installed in the equipment. Within the scope of data visualization and processing, there is a need to manage information about parameters/conditions that the assembly line equipments must present to start a production order, or in a shift handover. This study proposes a paperless checklist to manage equipment information and monitor production ramp-up. The proposed solution is validated in a real-world industrial scenario, by comparing its suitability against the current paper-based approach to log information. Results show that the paperless checklist presents advantages over the current approach since it enables multi-access viewing and logging while maintaining a digital history of log changes for further analysis. Copyright © 2023 by SCITEPRESS - Science and Technology Publications, Lda. Under CC license (CC BY-NC-ND 4.0)

2024

Pylung: A Supporting Tool for Comparative Study of ViT and CNN-Based Models Used for Lung Nodules Classification

Authors
Marques, F; Pestana, P; Filipe, V;

Publication
Lecture Notes in Networks and Systems

Abstract
Lung cancer is a significant global health concern, and accurate classification of lung nodules plays a crucial role in its early detection and treatment. This paper evaluates and compares the performance of Vision Transformer (ViT) and Convolutional Neural Network (CNN) models for lung nodule classification using the Pylung tool proposed in this work. The study aims to address the lack of research on ViT in lung nodule classification and proposes ViT as an alternative to CNN. The Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI) dataset is utilized for training and evaluation. The Pylung tool is employed for dataset preprocessing and comparison of models. Three models, ViT, VGG16, and ResNet50, are analyzed, and their hyperparameters are optimized using Optuna. The results show that ViT achieves the highest accuracy (99.06%) in nodule classification compared to VGG16 (98.71%) and ResNet50 (98.46%). The study contributes by introducing ViT as a model for lung nodule classification, presenting the Pylung tool for model comparison, and suggesting further investigations to improve the accuracy. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

2024

Performance Analysis of CNN Models in the Detection and Classification of Diabetic Retinopathy

Authors
Lúcio, F; Filipe, V; Gonçalves, L;

Publication
WIRELESS MOBILE COMMUNICATION AND HEALTHCARE, MOBIHEALTH 2023

Abstract
This study focuses on investigating different CNN architectures and assessing their effectiveness in classifying Diabetic Retinopathy, a diabetes-associated disease that ranks among the primary causes of adult blindness. However, early detection can significantly prevent its debilitating consequences. While regular screening is advised for diabetic patients, limited access to specialized medical professionals can hinder its implementation. To address this challenge, deep learning techniques provide promising solutions, primarily through their application in the analysis of fundus retina images for diagnosis. Several CNN architectures, including MobileNetV2, VGG16, VGG19, InceptionV3, InceptionResNetV2, Xception, DenseNet121, ResNet50, ResNet50V2, and EfficientNet (ranging from EfficientNetB0 to EfficientNetB6), were implemented to assess and analyze their performance in classifying Diabetic Retinopathy. The dataset comprised 3662 Fundus retina images. Prior to training, the networks underwent pre-training using the ImageNet database, with a Gaussian filter applied to the images as a preprocessing step. As a result, the Efficient-Net stands out for achieving the best performance results with a good balance between model size and computational efficiency. By utilizing the EfficientNetB2 network, a model was trained with an accuracy of 85% and a screening capability of 98% for Diabetic Retinopathy. This model holds the potential to be implemented during the screening stages of Diabetic Retinopathy, aiding in the early identification of individuals at risk.

  • 17
  • 30