Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Sobre

Sobre

Sou professor associado do Departamento de Ciência de Computadores da Faculdade de Ciências da Universidade do Porto e coordenador do LIAAD, Laboratório de Inteligência Artificial e de Apoio à Decisão da UP. O LIAAD é um cenrto do INESC TEC desde 2007. Sou doutor em Ciência da Computação pela U. Porto, MSc. em Fundamentos de Tecnologia de Informação Avançada pelo Imperial College e Lic. Em Matemática Aplicada ramo Ciência de Computadores (U. Porto). Os meus interesses de investigação são Extração de Conhecimento (Data Mining) e Aprendizagem Automática (Machine Learning), em particular regras de associação, text mining e sistemas de recomendação. A minha investigação anterior inclui programação em lógica indutiva e data miing colaborativo. Eu leciono cursos relacionados com programação, processamento de informação, data mining e outras áreas da computação. Enquanto na Faculdade de Economia, onde permaneci de 1996 a 2009, lancei, com outros colegas, o mestrado em Análise de Dados e Sistemas de Apoio à Decisão (MADSAD), que coordenei de 2000 a Abril de 2008. Dirijo projetos em data mining e inteligência na web. Fui diretor do Mestrado em Ciência dos Computadores no DCC-FCUP de junho de 2010 a agosto de 2013. Co-organizei conferências internacionais (ECML / PKD 2015, Discovery Science 2009, ECML / PKDD 05 e EPIA 01), workshops e seminários em data mining e inteligência artificial. Fui Vice-Presidente da APPIA Associação Portuguesa para a Inteligência Artificial.

Tópicos
de interesse
Detalhes

Detalhes

  • Nome

    Alípio Jorge
  • Cargo

    Coordenador de Centro
  • Desde

    01 janeiro 2008
025
Publicações

2025

MedLink: Retrieval and Ranking of Case Reports to Assist Clinical Decision Making

Autores
Cunha, LF; Guimarães, N; Mendes, A; Campos, R; Jorge, A;

Publicação
Advances in Information Retrieval - 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings, Part V

Abstract
In healthcare, diagnoses usually rely on physician expertise. However, complex cases may benefit from consulting similar past clinical reports cases. In this paper, we present MedLink (http://medlink.inesctec.pt), a tool that given a free-text medical report, retrieves and ranks relevant clinical case reports published in health conferences and journals, aiming to support clinical decision-making, particularly in challenging or complex diagnoses. To this regard, we trained two BERT models on the sentence similarity task: a bi-encoder for retrieval and a cross-encoder for reranking. To evaluate our approach, we used 10 medical reports and asked a physician to rank the top 10 most relevant published case reports for each one. Our results show that MedLink’s ranking model achieved NDCG@10 of 0.747. Our demo also includes the visualization of clinical entities (using a NER model) and the production of a textual explanation (using a LLM) to ease comparison and contrasting between reports. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

2025

Preface

Autores
Campos, R; Jorge, M; Jatowt, A; Bhatia, S; Litvak, M;

Publicação
CEUR Workshop Proceedings

Abstract
[No abstract available]

2025

The 8th International Workshop on Narrative Extraction from Texts: Text2Story 2025

Autores
Campos, R; Jorge, A; Jatowt, A; Bhatia, S; Litvak, M;

Publicação
Advances in Information Retrieval - 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings, Part V

Abstract
For seven years, the Text2Story Workshop series has fostered a vibrant community dedicated to understanding narrative structure in text, resulting in significant contributions to the field and developing a shared understanding of the challenges in this domain. While traditional methods have yielded valuable insights, the advent of Transformers and LLMs have ignited a new wave of interest in narrative understanding. The previous iteration of the workshop also witnessed a surge in LLM-based approaches, demonstrating the community’s growing recognition of their potential. In this eighth edition we propose to go deeper into the role of LLMs in narrative understanding. While LLMs have revolutionized the field of NLP and are the go-to tools for any NLP task, the ability to capture, represent and analyze contextual nuances in longer texts is still an elusive goal, let alone the understanding of consistent fine-grained narrative structures in text. Consequently, this iteration of the workshop will explore the issues involved in using LLMs to unravel narrative structures, while also examining the characteristics of narratives generated by LLMs. By fostering dialogue on these emerging areas, we aim to continue the workshop's tradition of driving innovation in narrative understanding research. Text2Story encompasses sessions covering full research papers, work-in-progress, demos, resources, position and dissemination papers, along with one keynote talk. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

2025

Enhancing Portuguese Variety Identification with Cross-Domain Approaches

Autores
Sousa, HO; Almeida, R; Silvano, P; Cantante, I; Campos, R; Jorge, AM;

Publicação
AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA

Abstract
Recent advances in natural language processing have raised expectations for generative models to produce coherent text across diverse language varieties. In the particular case of the Portuguese language, the predominance of Brazilian Portuguese corpora online introduces linguistic biases in these models, limiting their applicability outside of Brazil. To address this gap and promote the creation of European Portuguese resources, we developed a cross-domain language variety identifier (LVI) to discriminate between European and Brazilian Portuguese. Motivated by the findings of our literature review, we compiled the PtBrVarId corpus, a cross-domain LVI dataset, and study the effectiveness of transformer-based LVI classifiers for cross-domain scenarios. Although this research focuses on two Portuguese varieties, our contribution can be extended to other varieties and languages. We open source the code, corpus, and models to foster further research in this task. © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

2025

Tradutor: Building a Variety Specific Translation Model

Autores
Sousa, HO; Almasian, S; Campos, R; Jorge, AM;

Publicação
AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA

Abstract
Language models have become foundational to many widely used systems. However, these seemingly advantageous models are double-edged swords. While they excel in tasks related to resource-rich languages like English, they often lose the fine nuances of language forms, dialects, and varieties that are inherent to languages spoken in multiple regions of the world. Languages like European Portuguese are neglected in favor of their more popular counterpart, Brazilian Portuguese, leading to suboptimal performance in various linguistic tasks. To address this gap, we introduce the first open-source translation model specifically tailored for European Portuguese, along with a novel dataset specifically designed for this task. Results from automatic evaluations on two benchmark datasets demonstrate that our best model surpasses existing open-source translation systems for Portuguese and approaches the performance of industry-leading closed-source systems for European Portuguese. By making our dataset, models, and code publicly available, we aim to support and encourage further research, fostering advancements in the representation of underrepresented language varieties. © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Teses
supervisionadas

2023

Learning Word Sense Representations from Neural Language Models

Autor
Daniel Alexandre Bouçanova Loureiro

Instituição
UP-FCUP

2023

Domain-specific and Context-aware Approaches to Sentiment Analysis

Autor
Shamsuddeen Hassan Muhammad

Instituição
UP-FCUP

2023

Digital technology and the social monitoring of climate change

Autor
Ana Sofia Cabral Cardoso

Instituição
UP-FCUP

2023

Building Portuguese Language Resources for Natural Language Processing Tasks

Autor
Rúben Filipe Seabra de Almeida

Instituição
UP-FCUP

2023

Heart Sound Analysis for Cardiovascular Diseases Identification

Autor
Diogo Marcelo Esterlita Nogueira

Instituição
UP-FCUP