Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

I collaborated on the European Community-funded GORDA project, whose goal was to foster database replication as a means to address the challenges of current database systems. I worked on providing GORDA with a web-based graphical monitoring and management console that leverages JMX capabilities. 

I was also involved in the P-SON project, specifically in developing applications for the NeEM epidemic multicast protocol. In this context, I worked on a RSS feed caching and dissemination architecture that leverages p2p networks to accomplish its task and later on a generic P2P content-push architecture based on Web feeds and social network services, which served as a basis for my Master’s dissertation work. I also worked on an epidemic dissemination protocol that takes advantage of participants’ shared interests without actually disclosing them. 

In the context of my Ph.D, my work was focused on how to bridge the gap between traditional database-centric applications and the promises of high availability. 

Currently, I'm a part of the CloudDBAppliance european project, and also have been collaborating with CPES in the context of the UPGRID and InteGRID european projects. 

Interest
Topics
Details

Details

  • Name

    Ana Nunes Alonso
  • Role

    Assistant Researcher
  • Since

    01st February 2012
007
Publications

2023

Privacy-Preserving Machine Learning in Life Insurance Risk Prediction

Authors
Pereira, K; Vinagre, J; Alonso, AN; Coelho, F; Carvalho, M;

Publication
MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II

Abstract
The application of machine learning to insurance risk prediction requires learning from sensitive data. This raises multiple ethical and legal issues. One of the most relevant ones is privacy. However, privacy-preserving methods can potentially hinder the predictive potential of machine learning models. In this paper, we present preliminary experiments with life insurance data using two privacy-preserving techniques: discretization and encryption. Our objective with this work is to assess the impact of such privacy preservation techniques in the accuracy of ML models. We instantiate the problem in three general, but plausible Use Cases involving the prediction of insurance claims within a 1-year horizon. Our preliminary experiments suggest that discretization and encryption have negligible impact in the accuracy of ML models.

2023

TADA: A Toolkit for Approximate Distributed Agreement

Authors
da Conceição, EL; Alonso, AN; Oliveira, RC; Pereira, JO;

Publication
Distributed Applications and Interoperable Systems - 23rd IFIP WG 6.1 International Conference, DAIS 2023, Held as Part of the 18th International Federated Conference on Distributed Computing Techniques, DisCoTec 2023, Lisbon, Portugal, June 19-23, 2023, Proceedings

Abstract

2023

TiQuE: Improving the Transactional Performance of Analytical Systems for True HybridWorkloads

Authors
Faria, N; Pereira, J; Alonso, AN; Vilaca, R; Koning, Y; Nes, N;

Publication
PROCEEDINGS OF THE VLDB ENDOWMENT

Abstract
Transactions have been a key issue in database management for a long time and there are a plethora of architectures and algorithms to support and implement them. The current state-of-the-art is focused on storage management and is tightly coupled with its design, leading, for instance, to the need for completely new engines to support new features such as Hybrid Transactional Analytical Processing (HTAP). We address this challenge with a proposal to implement transactional logic in a query language such as SQL. This means that our approach can be layered on existing analytical systems but that the retrieval of a transactional snapshot and the validation of update transactions runs in the server and can take advantage of advanced query execution capabilities of an optimizing query engine. We demonstrate our proposal, TiQuE, on MonetDB and obtain an average 500x improvement in transactional throughput while retaining good performance on analytical queries, making it competitive with the state-of-the-art HTAP systems.

2023

LOOM: A Closed-Box Disaggregated Database System

Authors
Coelho, F; Alonso, AN; Ferreira, L; Pereira, J; Oliveira, R;

Publication
PROCEEDINGS OF12TH LATIN-AMERICAN SYMPOSIUM ON DEPENDABLE AND SECURE COMPUTING, LADC 2023

Abstract
Cloud native database systems provide highly available and scalable services as part of cloud platforms by transparently replicating and partitioning data across automatically managed resources. Some systems, such as Google Spanner, are designed and implemented from scratch. Others, such as Amazon Aurora, derive from traditional database systems for better compatibility but disaggregate storage to cloud services. Unfortunately, because they follow an open-box approach and fork the original code base, they are difficult to implement and maintain. We address this problem with Loom, a replicated and partitioned database system built on top of PostgreSQL that delegates durable storage to a distributed log native to the cloud. Unlike previous disaggregation proposals, Loom is a closed-box approach that uses the original server through existing interfaces to simplify implementation and improve robustness and maintainability. Experimental evaluation achieves 6x higher throughput and 5x lower response time than standard replication and competes with the state of the art in cloud and HPC hardware.

2022

A Blockchain-based Data Market for Renewable Energy Forecasts

Authors
Coelho, F; Silva, F; Goncalves, C; Bessa, R; Alonso, A;

Publication
2022 FOURTH INTERNATIONAL CONFERENCE ON BLOCKCHAIN COMPUTING AND APPLICATIONS (BCCA)

Abstract
This paper presents a data market aimed at trading energy forecasts data. The system architecture is built using blockchain as a service, allowing access to data streams and establishing a distributed settlement between stakeholders. Energy Forecasts data is presented as the commodity traded in the market, whose settlement is provided through the blockchain on the basis of the extracted value provided by market stakeholders. Our proposal allows market stakeholders to acquire energy forecasts and pay according to the data accuracy, solving the confidentiality problem of freely sharing data. A data quality reward is introduced, steering the compensation sent to market participants. The data market design is presented and an evaluation campaign is performed, showing that the data market produced functionally valid results in comparison with the results achieved with a central simulated approach. Moreover, results show that the data market architecture is able to scale.

Supervised
thesis

2022

Machine Learning and Deep Learning Algorithms to Correct and Classify Product Reviews in a Marketplace

Author
Licinio Daniel Gomes Carvalho

Institution
UP-FEUP

2022

Interpretação e execução de SQL sobre ficheiros

Author
Bruno Filipe de Sousa Dias

Institution
UM

2021

Acordo Distribuído Aproximado

Author
Joaquim Manuel Gonçalves Oliveira

Institution
UM

2020

Acordo Distribuído Aproximado

Author
Joaquim Manuel Gonçalves Oliveira

Institution
UM

2018

Estado do Design: Debates críticos à Prática do Design

Author
Ana Margarida Leite Pegada Olo

Institution
UP-FBAUP