2023
Authors
Gaudio, A; Smailagic, A; Faloutsos, C; Mohan, S; Johnson, E; Liu, YH; Costa, P; Campilho, A;
Publication
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY
Abstract
Explanations of a model's biases or predictions are essential to medical image analysis. Yet, explainable machine learning approaches for medical image analysis are challenged by needs to preserve privacy of patient data, and by current trends in deep learning to use unsustainably large models and large datasets. We propose DeepFixCX for explainable and privacy-preserving medical image compression that is nimble and performant. We contribute a review of the field and a conceptual framework for simultaneous privacy and explainability via tools of compression. DeepFixCX compresses images without learning by removing or obscuring spatial and edge information. DeepFixCX is ante-hoc explainable and gives privatized post hoc explanations of spatial and edge bias without accessing the original image. DeepFixCX privatizes images to prevent image reconstruction and mitigate patient re-identification. DeepFixCX is nimble. Compression can occur on a laptop CPU or GPU to compress and privatize 1700 images per second of size 320 x 320. DeepFixCX enables use of low memory MLP classifiers for vision data; permitting small performance loss gives end-to-end MLP performance over 70x faster and batch size over 100x larger. DeepFixCX consistently improves predictive classification performance of a Deep Neural Network (DNN) by 0.02 AUC ROC on Glaucoma and Cervix Type detection datasets, and can improve multi-label chest x-ray classification performance in seven of 10 tested settings. In all three datasets, compression to less than 5% of original number of pixels gives matching or improved performance. Our main novelty is to define an explainability versus privacy problem and address it with lossy compression.This article is categorized under:Fundamental Concepts of Data and Knowledge > Explainable AICommercial, Legal, and Ethical Issues > Security and PrivacyFundamental Concepts of Data and Knowledge > Big Data Mining
2023
Authors
Cunha, C; Silva, S; Coelho, LCC; Frazão, O; Novais, S;
Publication
EPJ Web of Conferences
Abstract
2023
Authors
Rodrigues, J; Caiado, F; Fonseca, J; Silva, J; Neves, S; Moreira, A; Au-Yong-Oliveira, M; Gonçalves, R; Branco, F;
Publication
Abstract
2023
Authors
Lauro, A; Kitamura, D; Lima, W; Dias, B; Soares, T;
Publication
ENERGIES
Abstract
The Brazilian Power System is mainly composed of renewable generation from hydroelectric and wind. Hence, spot and forward electricity prices tend to represent the inherently stochastic nature of these resources, while risk management is a measure taken by agents, especially hydro power plants (HPPs) to hedge against deep financial losses. A HPP goal is to maximize its profit considering uncertainties in forward electricity prices, spot prices, and generation scaling factor (GSF) for years ahead. Therefore, the objective of this work is to simulate the real decision-making process of a HPP, where they need to have a perspective of the forward market and future spot price assessment to negotiate forward electricity contracts. To do so, the present work models the uncertainty in electricity forward prices via two-stage stochastic programming, assessing the benefits of the stochastic solution in comparison to the deterministic one. In addition, different risk aversion levels are assessed using conditional value at risk (CVaR). An important conclusion is that the results show that the greater the HPP risk aversion is, the greater the energy selling via electricity forward contracts. Moreover, the proposed model has benefits in comparison to a deterministic approach.
2023
Authors
Dunne, S; Ferreira, JF; Mendes, A; Ritchie, C; Stoddart, B; Zeyda, F;
Publication
JOURNAL OF LOGICAL AND ALGEBRAIC METHODS IN PROGRAMMING
Abstract
We present an imperative refinement language for the development of backtracking programs and discuss its semantic foundations. For expressivity, our language includes prospective values and preference - the latter being a variant of Nelson's biased choice that backtracks from infeasibility of a continuation. Our key contribution is to examine feasibility-preserving refinement as a basis for developing backtracking programs, and several key refinement laws that enable compositional refinement in the presence of non -monotonic program combinators.
2023
Authors
Rodrigues J.; Lopes C.T.;
Publication
Open Information Science
Abstract
Research data management is essential for safeguarding and prospecting data generated in a scientific context. Specific issues arise regarding data in image format, as this data typology poses particular challenges and opportunities; however, not much attention has been given to data as images. We reviewed 109 articles from several research domains where images were used either as data or metadata to understand how researchers specifically deal with this data format, and what are your habits and behaviors. We use the Web of Science (WoS), considering its five main areas of research. We included in the initial corpus the most relevant articles by research domain, selecting the ten most cited articles in WoS, by year, between 2010 and 2021. The selected articles should be in English and in open access. The results found that images have been used in scientific works numerous times, but, unfortunately, few are those in which they are the central element of the study. Photography is the type of image most used in most domains. In terms of the instruments used, the Technology and Life Sciences and Biomedicine domains use the microscope more, while the Arts and Humanities and Physical Sciences domains use the camera more. We found that the images are mostly produced in the context of the project, rather than reused by third parties. As for their collection scenario, these are mostly produced/used in a laboratory context. The overwhelming majority of the images present in the articles are digital, and only a small part is analog. We verify that Arts and Humanities are more likely to perform qualitative types of analyses, while Life Sciences and Biomedicine overwhelmingly use quantitative analyses. As for the issues of sharing and depositing, Life Sciences and Biomedicine is the domain that stands out the most in the tasks of depositing and sharing images. It was found that the licenses of a project are intrinsically related to the motivations for sharing results with third parties. Description, a fundamental step in the data management process, is neglected by a large number of researchers. The images are mostly not described or annotated and when this happens, researchers don't provide much detail about this.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.