Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

I am a Lecturer at the Department of Informatics at the University of Minho. I am also a researcher at HASLab/INESC TEC. My research interest focus mainly on machine learning and data mining. Occasionally, I participate in Bioinformatics research projects involving analysis of molecular dynamic simulations of protein folding/unfolding.

 I hold a PhD in Computing from Imperial College (University of London) where I did research in logic programming. I have been working on the development of association rules mining algorithms and novel patterns to capture distribution learning. I also have interest in social network analysis, graph mining, subgroup mining and motif discovery in time series.

Interest
Topics
Details

Details

  • Name

    Paulo Jorge Azevedo
  • Cluster

    Computer Science
  • Role

    Senior Researcher
  • Since

    01st November 2011
003
Publications

2023

Subgroup mining for performance analysis of regression models

Authors
Pimentel, J; Azevedo, PJ; Torgo, L;

Publication
EXPERT SYSTEMS

Abstract

2020

Sequence Mining for Automatic Generation of Software Tests from GUI Event Traces

Authors
Oliveira, A; Freitas, R; Jorge, A; Amorim, V; Moniz, N; Paiva, ACR; Azevedo, PJ;

Publication
Intelligent Data Engineering and Automated Learning - IDEAL 2020 - 21st International Conference, Guimaraes, Portugal, November 4-6, 2020, Proceedings, Part II

Abstract

2018

Preference rules for label ranking: Mining patterns in multi-target relations

Authors
de Sa, CR; Azevedo, P; Soares, C; Jorge, AM; Knobbe, A;

Publication
INFORMATION FUSION

Abstract
In this paper, we investigate two variants of association rules for preference data, Label Ranking Association Rules and Pairwise Association Rules. Label Ranking Association Rules (LRAR) are the equivalent of Class Association Rules (CAR) for the Label Ranking task. In CAR, the consequent is a single class, to which the example is expected to belong to. In LRAR, the consequent is a ranking of the labels. The generation of LRAR requires special support and confidence measures to assess the similarity of rankings. In this work, we carry out a sensitivity analysis of these similarity-based measures. We want to understand which datasets benefit more from such measures and which parameters have more influence in the accuracy of the model. Furthermore, we propose an alternative type of rules, the Pairwise Association Rules (PAR), which are defined as association rules with a set of pairwise preferences in the consequent. While PAR can be used both as descriptive and predictive models, they are essentially descriptive models. Experimental results show the potential of both approaches.

2018

Discovering a taste for the unusual: exceptional models for preference mining

Authors
de Sa, CR; Duivesteijn, W; Azevedo, P; Jorge, AM; Soares, C; Knobbe, A;

Publication
MACHINE LEARNING

Abstract
Exceptional preferences mining (EPM) is a crossover between two subfields of data mining: local pattern mining and preference learning. EPM can be seen as a local pattern mining task that finds subsets of observations where some preference relations between labels significantly deviate from the norm. It is a variant of subgroup discovery, with rankings of labels as the target concept. We employ several quality measures that highlight subgroups featuring exceptional preferences, where the focus of what constitutes ‘exceptional’ varies with the quality measure: two measures look for exceptional overall ranking behavior, one measure indicates whether a particular label stands out from the rest, and a fourth measure highlights subgroups with unusual pairwise label ranking behavior. We explore a few datasets and compare with existing techniques. The results confirm that the new task EPM can deliver interesting knowledge. © 2018 The Author(s)

2015

Automatically estimating iSAX parameters

Authors
Castro, NC; Azevedo, PJ;

Publication
INTELLIGENT DATA ANALYSIS

Abstract
The Symbolic Aggregate Approximation (iSAX) is widely used in time series data mining. Its popularity arises from the fact that it largely reduces time series size, it is symbolic, allows lower bounding and is space efficient. However, it requires setting two parameters: the symbolic length and alphabet size, which limits the applicability of the technique. The optimal parameter values are highly application dependent. Typically, they are either set to a fixed value or experimentally probed for the best configuration. In this work we propose an approach to automatically estimate iSAX's parameters. The approach - AutoiSAX - not only discovers the best parameter setting for each time series in the database, but also finds the alphabet size for each iSAX symbol within the same word. It is based on simple and intuitive ideas from time series complexity and statistics. The technique can be smoothly embedded in existing data mining tasks as an efficient sub-routine. We analyze its impact in visualization interpretability, classification accuracy and motif mining. Our contribution aims to make iSAX a more general approach as it evolves towards a parameter-free method.

Supervised
thesis

2022

Formalization of Deep Learning Techniques with the Why3 Proof Platform

Author
Márcio Alexandre Mota Sousa

Institution
UM

2021

Interpretabilidade em Aprendizagem Máquina num Contexto de Modelos de Regressão Caixa Negra

Author
João Pedro Torres Pimentel

Institution
UM

2020

Active Learning for fraud Detection

Author
Miguel Lobo Pinto Leite

Institution
UM

2019

Active Learning and Intelligent Queues

Author
Miguel Lobo Pinto Leite

Institution
UM