Publicacoes - INESC TEC

Publicações

Publicações por HASLab

2024

SHORT: Evaluating Tools for Enhancing Reproducibility in Computational Scientific Experiments

Autores
Costa, L; Barbosa, S; Cunha, J;

Publicação
PROCEEDINGS OF THE 2ND ACM CONFERENCE ON REPRODUCIBILITY AND REPLICABILITY, ACM REP 2024

Abstract
Ensuring the reproducibility of computational scientific experiments is crucial for advancing research and fostering scientific integrity. However, achieving reproducibility poses significant challenges, particularly in the absence of appropriate software tools to help. This paper addresses this issue by comparing existing tools designed to assist researchers across various fields in achieving reproducibility in their work. We were able to successfully run eight tools and execute them to reproduce three existing experiments from different domains. Our findings show the critical role of technical choices in shaping the capabilities of these tools for reproducibility efforts. By evaluating these tools for replicating experiments, we contribute insights into the current landscape of reproducibility support in scientific research. Our analysis offers guidance for researchers seeking appropriate tools to enhance the reproducibility of their experiments, highlighting the importance of informed technical decisions in facilitating reproducibility across diverse domains.

FecharLer Abstract

2024

Programmer User Studies: Supporting Tools & Features

Autores
Costa, L; Barbosa, S; Cunha, J;

Publicação
2024 IEEE SYMPOSIUM ON VISUAL LANGUAGES AND HUMAN-CENTRIC COMPUTING, VL/HCC 2024

Abstract
User studies are paramount for advancing science. In particular, the empirical evaluation of programmer-oriented tools is important to validate research ideas and prototypes, as well as production-ready tools. Previous research has collected several tools used by the software engineering and behavioral science communities to design and run studies. In this work, we study tools used in software engineering studies and identify their features. Furthermore, we analyze three behavioral science experiment tools to identify design ideas that might be adapted to programmer user studies. With this work, we present the set of features currently offered by software engineering tools to support researchers in the design and execution of programmer user studies. We also present the characteristics of some tools used in behavioral science experiments to identify design ideas that can be adapted to programmer user studies.

FecharLer Abstract

2024

Characterizing Data Scientists in the Real World

Autores
Pereira, P; Cunha, J; Fernandes, JP;

Publicação
CoRR

Abstract

2024

A review on the decarbonization of high-performance computing centers

Autores
Silva, CA; Vilaça, R; Pereira, A; Bessa, RJ;

Publicação
RENEWABLE & SUSTAINABLE ENERGY REVIEWS

Abstract
High-performance computing relies on performance-oriented infrastructures with access to powerful computing resources to complete tasks that contribute to solve complex problems in society. The intensive use of resources and the increase in service demand due to emerging fields of science, combined with the exascale paradigm, climate change concerns, and rising energy costs, ultimately means that the decarbonization of these centers is key to improve their environmental and financial performance. Therefore, a review on the main opportunities and challenges for the decarbonization of high-performance computing centers is essential to help decision-makers, operators and users contribute to a more sustainable computing ecosystem. It was found that state-of-the-art supercomputers are growing in computing power, but are combining different measures to meet sustainability concerns, namely going beyond energy efficiency measures and evolving simultaneously in terms of energy and information technology infrastructure. It was also shown that policy and multiple entities are now targeting specifically HPC, and that identifying synergies with the energy sector can reveal new revenue streams, but also enable a smoother integration of these centers in energy systems. Computing-intensive users can continue to pursue their scientific research, but participating more actively in the decarbonization process, in cooperation with computing service providers. Overall, many opportunities, but also challenges, were identified, to decrease carbon emissions in a sector mostly concerned with improving hardware performance.

FecharLer Abstract Ler Publicação Completa

2024

To FID or not to FID: Applying GANs for MRI Image Generation in HPC

Autores
Cepa, B; Brito, C; Sousa, A;

Publicação

Abstract
AbstractWith the rapid growth of Deep Learning models and neural networks, the medical data available for training – which is already significantly less than other types of data – is becoming scarce. For that purpose, Generative Adversarial Networks (GANs) have received increased attention due to their ability to synthesize new realistic images. Our preliminary work shows promising results for brain MRI images; however, there is a need to distribute the workload, which can be supported by High-Performance Computing (HPC) environments. In this paper, we generate 256×256 MRI images of the brain in a distributed setting. We obtained an FIDRadImageNetof 10.67 for the DCGAN and 23.54 for the WGAN-GP, which are consistent with results reported in several works published in this scope. This allows us to conclude that distributing the GAN generation process is a viable option to overcome the computational constraints imposed by these models and, therefore, facilitate the generation of new data for training purposes.

FecharLer Abstract Ler Publicação Completa

2024

MAC: An Artifact Correction Framework for Brain MRI based on Deep Neural Networks

Autores
Oliveira, A; Cepa, B; Brito, C; Sousa, A;

Publicação

Abstract
AbstractThe correction of artifacts in Magnetic Resonance Imaging (MRI) is crucial due to physiological phenomena and technical issues affecting diagnostic quality. Reverting from corrupted to artifact-free images is a complex task. Deep Learning (DL) models have been employed to preserve data characteristics and to identify and correct those artifacts. We proposeMAC, a novel DL-based solution to correct artifacts in multi-contrast brain MRI scans.MACoffers two models: the simulation and the correction models. The simulation model introduces perturbations similar to those occurring in an exam while preserving the original image as ground truth; this is required as publicly available datasets rarely have motion-corrupted images. It allows the addition of three types of artifacts with different degrees of severity. The DL-based correction model adds a fourth contrast to state-of-the-art solutions while improving the overall performance of the models.MACachieved the highest results in the FLAIR contrast, with a Structural Similarity Index Measure (SSIM) of 0.9803 and a Normalized Mutual Information (NMI) of 0.8030. Moreover, the model reduced training time by 63% compared to its predecessor.MACmodel can correct large volumes of images faster and adapt to different levels of artifact severity than current state-ofthe-art models, allowing for better diagnosis.

FecharLer Abstract Ler Publicação Completa