Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por CRIIS

2024

Identification and Detection in Building Images of Biological Growths - Prevent a Health Issue

Autores
Pereira, S; Cunha, A; Pinto, J;

Publicação
WIRELESS MOBILE COMMUNICATION AND HEALTHCARE, MOBIHEALTH 2023

Abstract
Building rehabilitation is a reality, and all phases of rehabilitation work need to be efficiently sustainable and promote healthy places to live in. Current procedures for assessing construction conditions are time-consuming, laborious and expensive and pose threats to the health and safety of engineers, especially when inspecting locations that are not easy to access. In the initial step, a survey of the condition of the building is carried out, which subsequently implies the elaboration of a report on existing pathologies, intervention solutions, and associated costs. This survey involves an inspection of the site (through photographs and videos). Also, biological growth can threaten the humans inhabiting the houses. The World Health Organization states that the most important effects are increased prevalences of respiratory symptoms, allergies and asthma, as well as perturbation of the immunological system. This work aims to alert to this fact and contribute to detecting and locating biological growth (BG) defects automatically in images of the facade of buildings. To make this possible, we need a dataset of images of building components with and without biological growths. At this moment, that database doesn't exist. So, we need to construct that dataset to use deep learning models in the future. This paper also identifies the steps to do that work and presents some real cases of building facades with BG and solutions to repair those defects. The conclusions and the future works are identified.

2024

Informative Classification of Capsule Endoscopy Videos Using Active Learning

Autores
Fonseca, F; Nunes, B; Salgado, M; Silva, A; Cunha, A;

Publicação
WIRELESS MOBILE COMMUNICATION AND HEALTHCARE, MOBIHEALTH 2023

Abstract
The wireless capsule endoscopy is a non-invasive imaging method that allows observation of the inner lumen of the small intestine, but with the cost of a longer duration to process its resulting videos. Therefore, the scientific community has developed several machine learning strategies to help reduce that duration. Such strategies are typically trained and evaluated on small sets of images, ultimately not proving to be efficient when applied to full videos. Labelling full Capsule Endoscopy videos requires significant effort, leading to a lack of data on this medical area. Active learning strategies allow intelligent selection of datasets from a vast set of unlabelled data, maximizing learning and reducing annotation costs. In this experiment, we have explored active learning methods to reduce capsule endoscopy videos' annotation effort by compiling smaller datasets capable of representing their content.

2024

Deep Learning Model Evaluation and Insights in Inherited Retinal Disease Detection

Autores
Ferreira, H; Marta, A; Couto, I; Camara, J; Beirao, JM; Cunha, A;

Publicação
WIRELESS MOBILE COMMUNICATION AND HEALTHCARE, MOBIHEALTH 2023

Abstract
Inherited retinal diseases such as Retinitis Pigmentosa and Stargardt's disease are genetic conditions that cause the photoreceptors in the retina to deteriorate over time. This can lead to vision symptoms such as tubular vision, loss of central vision, and nyctalopia (difficulty seeing in low light) or photophobia (high light). Timely healthcare intervention is critical, as most forms of these conditions are currently untreatable and usually focused on minimizing further vision loss. Machine learning (ML) algorithms can play a crucial role in the detection of retinal diseases, especially considering the recent advancements in retinal imaging devices and the limited availability of public datasets on these diseases. These algorithms have the potential to help researchers gain new insights into disease progression from previous classified eye scans and genetic profiles of patients. In this work, multi-class identification between the retinal diseases Retinitis Pigmentosa, Stargardt Disease, and Cone-Rod Dystrophy was performed using three pretrained models, ResNet101, ResNet50, and VGG19 as baseline models, after shown to be effective in our computer vision task. These models were trained and validated on two datasets of autofluorescent retinal images, the first containing raw data, and the second dataset was improved with cropping to obtain better results. The best results were achieved using the ResNet101 model on the improved dataset with an Accuracy (Acc) of 0.903, an Area under the ROC Curve (AUC) of 0.976, an F1-Score of 0.897, a Recall (REC) of 0.903, and a Precision (PRE) of 0.910. To further assess the reliability of these models for future data, an Explainable AI (XAI) analysis was conducted, employing Grad-Cam. Overall, the study showed promising capabilities of Deep Learning for the diagnosis of retinal diseases using medical imaging.

2024

A Comparative Analysis of EfficientNet Architectures for Identifying Anomalies in Endoscopic Images

Autores
Pessoa, CP; Quintanilha, BP; de Almeida, JDS; Braz, G; de Paiva, C; Cunha, A;

Publicação
International Conference on Enterprise Information Systems, ICEIS - Proceedings

Abstract
The gastrointestinal tract is part of the digestive system, fundamental to digestion. Digestive problems can be symptoms of chronic illnesses like cancer and should be treated seriously. Endoscopic exams in the tract make detecting these diseases in their initial stages possible, enabling an effective treatment. Modern endoscopy has evolved into the Wireless Capsule Endoscopy procedure, where patients ingest a capsule with a camera. This type of exam usually exports videos up to 8 hours in length. Support systems for specialists to detect and diagnose pathologies in this type of exam are desired. This work uses a rarely used dataset, the ERS dataset, containing 121.399 labelled images, to evaluate three models from the EfficientNet family of architectures for the binary classification of Endoscopic images. The models were evaluated in a 5-fold cross-validation process. In the experiments, the best results were achieved by EfficientNetB0, achieving average accuracy and F1-Score of, respectively, 77.29% and 84.67%. Copyright © 2024 by SCITEPRESS – Science and Technology Publications, Lda.

2024

Enhancing Image Annotation With Object Tracking and Image Retrieval: A Systematic Review

Autores
Fernandes, R; Pessoa, A; Salgado, M; de Paiva, A; Pacal, I; Cunha, A;

Publicação
IEEE ACCESS

Abstract
Effective image and video annotation is a fundamental pillar in computer vision and artificial intelligence, crucial for the development of accurate machine learning models. Object tracking and image retrieval techniques are essential in this process, significantly improving the efficiency and accuracy of automatic annotation. This paper systematically investigates object tracking and image acquisition techniques. It explores how these technologies can collectively enhance the efficiency and accuracy of the annotation processes for image and video datasets. Object tracking is examined for its role in automating annotations by tracking objects across video sequences, while image retrieval is evaluated for its ability to suggest annotations for new images based on existing data. The review encompasses diverse methodologies, including advanced neural networks and machine learning techniques, highlighting their effectiveness in various contexts like medical analyses and urban monitoring. Despite notable advancements, challenges such as algorithm robustness and effective human-AI collaboration are identified. This review provides valuable insights into these technologies' current state and future potential in improving image annotation processes, even showing existing applications of these techniques and their full potential when combined.

2024

Enhancing EfficientNetv2 with global and efficient channel attention mechanisms for accurate MRI-Based brain tumor classification

Autores
Pacal, I; Celik, O; Bayram, B; Cunha, A;

Publicação
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS

Abstract
The early and accurate diagnosis of brain tumors is critical for effective treatment planning, with Magnetic Resonance Imaging (MRI) serving as a key tool in the non-invasive examination of such conditions. Despite the advancements in Computer-Aided Diagnosis (CADx) systems powered by deep learning, the challenge of accurately classifying brain tumors from MRI scans persists due to the high variability of tumor appearances and the subtlety of early-stage manifestations. This work introduces a novel adaptation of the EfficientNetv2 architecture, enhanced with Global Attention Mechanism (GAM) and Efficient Channel Attention (ECA), aimed at overcoming these hurdles. This enhancement not only amplifies the model's ability to focus on salient features within complex MRI images but also significantly improves the classification accuracy of brain tumors. Our approach distinguishes itself by meticulously integrating attention mechanisms that systematically enhance feature extraction, thereby achieving superior performance in detecting a broad spectrum of brain tumors. Demonstrated through extensive experiments on a large public dataset, our model achieves an exceptional high-test accuracy of 99.76%, setting a new benchmark in MRI-based brain tumor classification. Moreover, the incorporation of Grad-CAM visualization techniques sheds light on the model's decision-making process, offering transparent and interpretable insights that are invaluable for clinical assessment. By addressing the limitations inherent in previous models, this study not only advances the field of medical imaging analysis but also highlights the pivotal role of attention mechanisms in enhancing the interpretability and accuracy of deep learning models for brain tumor diagnosis. This research sets the stage for advanced CADx systems, enhancing patient care and treatment outcomes.

  • 53
  • 386