Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por Miriam Seoane Santos

2016

Predicting Breast Cancer Recurrence Using Machine Learning Techniques: A Systematic Review

Autores
Abreu, PH; Santos, MS; Abreu, MH; Andrade, B; Silva, DC;

Publicação
ACM COMPUTING SURVEYS

Abstract
Background: Recurrence is an important cornerstone in breast cancer behavior, intrinsically related to mortality. In spite of its relevance, it is rarely recorded in the majority of breast cancer datasets, which makes research in its prediction more difficult. Objectives: To evaluate the performance of machine learning techniques applied to the prediction of breast cancer recurrence. Material and Methods: Revision of published works that used machine learning techniques in local and open source databases between 1997 and 2014. Results: The revision showed that it is difficult to obtain a representative dataset for breast cancer recurrence and there is no consensus on the best set of predictors for this disease. High accuracy results are often achieved, yet compromising sensitivity. The missing data and class imbalance problems are rarely addressed and most often the chosen performance metrics are inappropriate for the context. Discussion and Conclusions: Although different techniques have been used, prediction of breast cancer recurrence is still an open problem. The combination of different machine learning techniques, along with the definition of standard predictors for breast cancer recurrence seem to be the main future directions to obtain better results.

2018

Improving the Classifier Performance in Motor Imagery Task Classification: What are the steps in the classification process that we should worry about?

Autores
Santos, MS; Abreu, PH; Rodriguez Bermudez, G; Garcia Laencina, PJ;

Publicação
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS

Abstract
Brain-Computer Interface systems based on motor imagery are able to identify an individual's intent to initiate control through the classification of encephalography patterns. Correctly classifying such patterns is instrumental and strongly depends in a robust machine learning block that is able to properly process the features extracted from a subject's encephalograms. The main objective of this work is to provide an overall view on machine learning stages, aiming to answer the following question: "What are the steps in the classification process that we should worry about?". The obtained results suggest that future research in the field should focus on two main aspects: exploring techniques for dimensionality reduction, in particular, supervised linear approaches, and evaluating adequate validation schemes to allow a more precise interpretation of results.

2022

The identification of cancer lesions in mammography images with missing pixels: analysis of morphology

Autores
Santos, JC; Abreu, PH; Santos, MS;

Publicação
2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA)

Abstract
The quality of mammography images is essential for the diagnosis of breast cancer and image imputation has become a popular technique to overcome noise, artifacts, and missing data to aid in the diagnosis of diseases. In this paper, we assess the performance of six imputation methodologies for the reconstruction of missing pixels in different morphologies in mammography images. The images included in this study are collected from four public datasets (CBIS-DDSM, Mini-MIAS, INbreast, and CSAW) and the imputation results are evaluated through the mean absolute error (MAE) and structural similarity index measure (SSIM). This study goes beyond the traditional evaluation of imputation algorithms, analyzing imputation quality, morphology preservation and classification performance. The effects of imputation on the morphology of cancer lesions are of utmost importance since it lays the foundation for physicians to interpret and analyze the imputation results. The results show that DIP is the most promising methodology for higher missing pixel rates, morphology preservation, and classifying malignant and benign images.

2021

FAWOS: Fairness-Aware Oversampling Algorithm Based on Distributions of Sensitive Attributes

Autores
Salazar, T; Santos, MS; Araujo, H; Abreu, PH;

Publicação
IEEE ACCESS

Abstract
With the increased use of machine learning algorithms to make decisions which impact people's lives, it is of extreme importance to ensure that predictions do not prejudice subgroups of the population with respect to sensitive attributes such as race or gender. Discrimination occurs when the probability of a positive outcome changes across privileged and unprivileged groups defined by the sensitive attributes. It has been shown that this bias can be originated from imbalanced data contexts where one of the classes contains a much smaller number of instances than the other classes. It is also important to identify the nature of the imbalanced data, including the characteristics of the minority classes' distribution. This paper presents FAWOS: a Fairness-Aware oversampling algorithm which aims to attenuate unfair treatment by handling sensitive attributes' imbalance. We categorize different types of datapoints according to their local neighbourhood with respect to the sensitive attributes, identifying which are more difficult to learn by the classifiers. In order to balance the dataset, FAWOS oversamples the training data by creating new synthetic datapoints using the different types of datapoints identified. We test the impact of FAWOS on different learning classifiers and analyze which can better handle sensitive attribute imbalance. Empirically, we observe that this algorithm can effectively increase the fairness results of the classifiers while not neglecting the classification performance. Source code can be found at: https://github.com/teresalazar13/FAWOS

2018

Missing Data Imputation via Denoising Autoencoders: The Untold Story

Autores
Costa, AF; Santos, MS; Soares, JP; Abreu, PH;

Publicação
IDA

Abstract
Missing data consists in the lack of information in a dataset and since it directly influences classification performance, neglecting it is not a valid option. Over the years, several studies presented alternative imputation strategies to deal with the three missing data mechanisms, Missing Completely At Random, Missing At Random and Missing Not At Random. However, there are no studies regarding the influence of all these three mechanisms on the latest high-performance Artificial Intelligence techniques, such as Deep Learning. The goal of this work is to perform a comparison study between state-of-the-art imputation techniques and a Stacked Denoising Autoencoders approach. To that end, the missing data mechanisms were synthetically generated in 6 different ways; 8 different imputation techniques were implemented; and finally, 33 complete datasets from different open source repositories were selected. The obtained results showed that Support Vector Machines imputation ensures the best classification performance while Multiple Imputation by Chained Equations performs better in terms of imputation quality. © Springer Nature Switzerland AG 2018.

2020

Bone scintigraphy and PET-CT: A necessary alliance for bone metastasis detection in breast cancer?

Autores
Santos, JC; Abreu, MH; Santos, MS; Duarte, H; Alpoim, T; Sousa, S; Abreu, PH;

Publicação
JOURNAL OF CLINICAL ONCOLOGY

Abstract
e13070 Background: Bone is one of the main sites of breast cancer metastasis. Staging of this kind of disease spread can be performed in locally advanced cases with PET-CT in conjunction with Bone Scintigraphy. The purpose of this work is to compare the efficiency of bone metastasis detection between PET-CT and bone scintigraphy. Methods: Prospective analysis of locally advanced breast cancer patients treated in a Comprehensive Cancer Center between 2014 and 2019 that performed PET-CT and Bone Scintigraphy in the staging. Interval between the two exams could not exceed 2 months. Clinical and pathological characteristics of the disease were collected from electronic files and independently clinical images reports were considered to evaluate the ability of each imaging modalities to identify bone disease. In discrepancy cases a re-analysis of the images by two independent nuclear physicians was performed to validate the findings. Results: We analyzed 204 cases. The majority of them had ductal carcinomas (72.5%), cT2/3 (70%), cN1/2(61.8%) and G2/3 (94.6%), luminal B- like, HER2 positive disease (49.2%). In this cohort, bone metastasis was documented in 52 (25.5%) patients. PET-CT presented 97.0% of accuracy, surpassing the 94.1% presented by Bone Scintigraphy. The latter failed to correctly detect bone metastasis in 11 (5.4%) patients and only outperformed PET-CT in 3 (1.5%) patients. The main difference between the two modalities was the non-detection of cranium lesions in PET-CT images. Conclusions: PET-CT showed higher efficiency in bone metastasis detection than Bone Scintigraphy, probably because it detects lytic lesions. The non-detection of cranium ones can be harmful and so modifications in the image acquisition are required to improve the quality of PET-CT, avoiding other exams in bone staging.

  • 6
  • 7