Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

I am an associate professor at the Department of Computer Science of the Faculty of Science of the University of Porto and the coordinator of LIAAD , the Artificial Intelligence and Decision Support Lab of UP. LIAAD is a unit of INESC TEC (Laboratório Associado) since 2007. I am a PhD in Computer Science by U. Porto, MSc. on Foundations of Advanced Information Technology by the Imperial Collegeand BSc. in Applied Maths and Computer Science, currently Computer Science (U. Porto). My research interests are Data Mining and Machine Learning, in particular association rules, web and text intelligence and data mining for decision support. My past research also includes Inductive Logic Programming and Collaborative Data Mining. I lecture courses related to programming, information processing, data mining, and other areas of computing. While at the Faculty of Economics, where I stayed from 1996 to 2009, I launched, with other colleagues, the MSc. on Data Analysis and Decisison Support Systems, which I coordinated from 2000 to April 2008. I lead research projects on data mining and web intelligence. I was the director of the Masters in Computer Science at DCC-FCUP from June 2010 to August 2013. I co-chaired international conferences (ECML/PKD 2015, Discovery Science 2009, ECML/PKDD 05 and EPIA 01), workshops and seminars in data mining and artificial intelligence. I was Vice-President of APPIA the Portuguese Association for Artificial Intelligence.

Interest
Topics
Details

Details

  • Name

    Alípio Jorge
  • Role

    Centre Coordinator
  • Since

    01st January 2008
027
Publications

2026

Resilience Under Attack: Benchmarking Optimizers Against Poisoning in Federated Learning for Image Classification Using CNN

Authors
Biadgligne, Y; Baghoussi, Y; Li, K; Jorge, A;

Publication
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2025, PT I

Abstract
Federated Learning (FL) enables decentralized model training while preserving data privacy but remains susceptible to poisoning attacks. Malicious clients can manipulate local data or model updates, threatening FL's reliability, especially in privacy-sensitive domains like healthcare and finance. While client-side optimization algorithms play a crucial role in training local models, their resilience to such attacks is underexplored. This study empirically evaluates the robustness of three widely used optimization algorithms: SGD, Adam, and RMSProp-against label-flipping attacks (LFAs) in image classification tasks using Convolutional Neural Networks (CNNs). Through 900 individual runs in both federated and centralized learning (CL) settings, we analyze their performance under Independent and Identically Distributed (IID) and Non-IID data distributions. Results reveal that SGD is the most resilient, achieving the highest accuracy in 87% of cases, while Adam performs best in 13%. Additionally, centralized models outperform FL on CIFAR-10, whereas FL excels on Fashion-MNIST, highlighting the impact of dataset characteristics on adversarial robustness.

2026

Knowledge-Aware Clinical Narrative Extraction Using Ontologies and Knowledge Graphs

Authors
Leite, M; Rb Silva, R; Guimaraes, N; Stork, L; Jorge, A;

Publication
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2025, PT I

Abstract
Providing healthcare professionals with quick access to structured standardized information enables comprehensive analysis and improves clinical decision-making. However, an important part of the records in health institutions is in the form of free text. This paper proposes a pipeline that automatically extracts medical information from Electronic Medical Records (EMRs), based on large language models (LLMs) and a domain ontology defined and validated in collaboration with a medical expert. The output is a knowledge graph of clinical narratives that can be used to search through repositories of EMRs or discover new facts. We showcase our approach on a set of Portuguese clinical texts of cases of Acute Myeloid Leukemia (AML) guided by one medical expert. We evaluate the quality of the extraction and of the knowledge graph.

2026

LLM-Based Framework for Synthetic Data Generation in Portuguese Clinical NER

Authors
Henriques, L; Guimaraes, N; Jorge, A;

Publication
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2025, PT I

Abstract
The ever-increasing volume of data produced in Healthcare demands solutions capable of automatically extracting the relevant elements of their narratives. However, given privacy regulations, bureaucratic procedures, and annotation efforts, the development of said solutions via Natural Language Processing (NLP) systems becomes hindered due to training data scarcity. Such scarcity increases when we consider languages and language varieties with lower resource availability, such as European and Brazilian Portuguese. To address this problem, we propose a Large Language Model (LLM)-based SDG (Synthetic Data Generation) framework to generate and annotate synthetic clinical texts for medical Named-Entity Recognition (NER). The SDG framework consists of a system/user prompt augmented with real examples, powered by GPT-4o. Our results show that, by feeding the framework few real clinical annotated texts, we can generate synthetic data capable of increasing the performance of NER models with respect to their non-augmented counterparts. In addition, the reduction of the BLEU scores in the generated texts indicates a decrease in the risk of privacy disclosure while ensuring greater lexical diversity. These results highlight the potential of synthetic data as a solution to overcome human annotation bottlenecks and privacy concerns, laying the groundwork for future research in clinical NLP across tasks, domains, and low-resource languages.

2026

Machine Learning and Knowledge Discovery in Databases. Research Track and Applied Data Science Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part VIII

Authors
Pfahringer, B; Japkowicz, N; Larrañaga, P; Ribeiro, RP; Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publication
ECML/PKDD (8)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part X

Authors
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Pasquali, A; Moniz, N; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publication
ECML/PKDD (10)

Abstract