Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Sobre

Sobre

Sou professor associado do Departamento de Ciência de Computadores da Faculdade de Ciências da Universidade do Porto e coordenador do LIAAD, Laboratório de Inteligência Artificial e de Apoio à Decisão da UP. O LIAAD é um cenrto do INESC TEC desde 2007. Sou doutor em Ciência da Computação pela U. Porto, MSc. em Fundamentos de Tecnologia de Informação Avançada pelo Imperial College e Lic. Em Matemática Aplicada ramo Ciência de Computadores (U. Porto). Os meus interesses de investigação são Extração de Conhecimento (Data Mining) e Aprendizagem Automática (Machine Learning), em particular regras de associação, text mining e sistemas de recomendação. A minha investigação anterior inclui programação em lógica indutiva e data miing colaborativo. Eu leciono cursos relacionados com programação, processamento de informação, data mining e outras áreas da computação. Enquanto na Faculdade de Economia, onde permaneci de 1996 a 2009, lancei, com outros colegas, o mestrado em Análise de Dados e Sistemas de Apoio à Decisão (MADSAD), que coordenei de 2000 a Abril de 2008. Dirijo projetos em data mining e inteligência na web. Fui diretor do Mestrado em Ciência dos Computadores no DCC-FCUP de junho de 2010 a agosto de 2013. Co-organizei conferências internacionais (ECML / PKD 2015, Discovery Science 2009, ECML / PKDD 05 e EPIA 01), workshops e seminários em data mining e inteligência artificial. Fui Vice-Presidente da APPIA Associação Portuguesa para a Inteligência Artificial.

Tópicos
de interesse
Detalhes

Detalhes

  • Nome

    Alípio Jorge
  • Cargo

    Coordenador de Centro
  • Desde

    01 janeiro 2008
027
Publicações

2026

Resilience Under Attack: Benchmarking Optimizers Against Poisoning in Federated Learning for Image Classification Using CNN

Autores
Biadgligne, Y; Baghoussi, Y; Li, K; Jorge, A;

Publicação
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2025, PT I

Abstract
Federated Learning (FL) enables decentralized model training while preserving data privacy but remains susceptible to poisoning attacks. Malicious clients can manipulate local data or model updates, threatening FL's reliability, especially in privacy-sensitive domains like healthcare and finance. While client-side optimization algorithms play a crucial role in training local models, their resilience to such attacks is underexplored. This study empirically evaluates the robustness of three widely used optimization algorithms: SGD, Adam, and RMSProp-against label-flipping attacks (LFAs) in image classification tasks using Convolutional Neural Networks (CNNs). Through 900 individual runs in both federated and centralized learning (CL) settings, we analyze their performance under Independent and Identically Distributed (IID) and Non-IID data distributions. Results reveal that SGD is the most resilient, achieving the highest accuracy in 87% of cases, while Adam performs best in 13%. Additionally, centralized models outperform FL on CIFAR-10, whereas FL excels on Fashion-MNIST, highlighting the impact of dataset characteristics on adversarial robustness.

2026

Knowledge-Aware Clinical Narrative Extraction Using Ontologies and Knowledge Graphs

Autores
Leite, M; Rb Silva, R; Guimaraes, N; Stork, L; Jorge, A;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2025, PT I

Abstract
Providing healthcare professionals with quick access to structured standardized information enables comprehensive analysis and improves clinical decision-making. However, an important part of the records in health institutions is in the form of free text. This paper proposes a pipeline that automatically extracts medical information from Electronic Medical Records (EMRs), based on large language models (LLMs) and a domain ontology defined and validated in collaboration with a medical expert. The output is a knowledge graph of clinical narratives that can be used to search through repositories of EMRs or discover new facts. We showcase our approach on a set of Portuguese clinical texts of cases of Acute Myeloid Leukemia (AML) guided by one medical expert. We evaluate the quality of the extraction and of the knowledge graph.

2026

LLM-Based Framework for Synthetic Data Generation in Portuguese Clinical NER

Autores
Henriques, L; Guimaraes, N; Jorge, A;

Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2025, PT I

Abstract
The ever-increasing volume of data produced in Healthcare demands solutions capable of automatically extracting the relevant elements of their narratives. However, given privacy regulations, bureaucratic procedures, and annotation efforts, the development of said solutions via Natural Language Processing (NLP) systems becomes hindered due to training data scarcity. Such scarcity increases when we consider languages and language varieties with lower resource availability, such as European and Brazilian Portuguese. To address this problem, we propose a Large Language Model (LLM)-based SDG (Synthetic Data Generation) framework to generate and annotate synthetic clinical texts for medical Named-Entity Recognition (NER). The SDG framework consists of a system/user prompt augmented with real examples, powered by GPT-4o. Our results show that, by feeding the framework few real clinical annotated texts, we can generate synthetic data capable of increasing the performance of NER models with respect to their non-augmented counterparts. In addition, the reduction of the BLEU scores in the generated texts indicates a decrease in the risk of privacy disclosure while ensuring greater lexical diversity. These results highlight the potential of synthetic data as a solution to overcome human annotation bottlenecks and privacy concerns, laying the groundwork for future research in clinical NLP across tasks, domains, and low-resource languages.

2026

Machine Learning and Knowledge Discovery in Databases. Research Track and Applied Data Science Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part VIII

Autores
Pfahringer, B; Japkowicz, N; Larrañaga, P; Ribeiro, RP; Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publicação
ECML/PKDD (8)

Abstract

2026

Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track and Demo Track - European Conference, ECML PKDD 2025, Porto, Portugal, September 15-19, 2025, Proceedings, Part X

Autores
Dutra, I; Pechenizkiy, M; Cortez, P; Pashami, S; Pasquali, A; Moniz, N; Jorge, AM; Soares, C; Abreu, PH; Gama, J;

Publicação
ECML/PKDD (10)

Abstract