Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by Alípio Jorge

2019

Proceedings of Text2Story - 2nd Workshop on Narrative Extraction From Texts, co-located with the 41st European Conference on Information Retrieval, Text2Story@ECIR 2019, Cologne, Germany, April 14th, 2019

Authors
Jorge, AM; Campos, R; Jatowt, A; Bhatia, S;

Publication
Text2Story@ECIR

Abstract

2020

Proceedings of Text2Story - Third Workshop on Narrative Extraction From Texts co-located with 42nd European Conference on Information Retrieval, Text2Story@ECIR 2020, Lisbon, Portugal, April 14th, 2020 [online only]

Authors
Campos, R; Jorge, AM; Jatowt, A; Bhatia, S;

Publication
Text2Story@ECIR

Abstract

2020

Incremental Approach for Automatic Generation of Domain-Specific Sentiment Lexicon

Authors
Muhammad, SH; Brazdil, P; Jorge, A;

Publication
ECIR (2)

Abstract
Sentiment lexicon plays a vital role in lexicon-based sentiment analysis. The lexicon-based method is often preferred because it leads to more explainable answers in comparison with many machine learning-based methods. But, semantic orientation of a word depends on its domain. Hence, a general-purpose sentiment lexicon may gives sub-optimal performance compare with a domain-specific lexicon. However, it is challenging to manually generate a domain-specific sentiment lexicon for each domain. Still, it is impractical to generate complete sentiment lexicon for a domain from a single corpus. To this end, we propose an approach to automatically generate a domain-specific sentiment lexicon using a vector model enriched by weights. Importantly, we propose an incremental approach for updating an existing lexicon to either the same domain or different domain (domain-adaptation). Finally, we discuss how to incorporate sentiment lexicons information in neural models (word embedding) for better performance.

2019

LIAAD at SemDeep-5 Challenge: Word-in-Context (WiC)

Authors
Loureiro, D; Jorge, A;

Publication
SemDeep@IJCAI

Abstract

2020

MedLinker: Medical Entity Linking with Neural Representations and Dictionary Matching

Authors
Loureiro, D; Jorge, AM;

Publication
ECIR (2)

Abstract
Progress in the field of Natural Language Processing (NLP) has been closely followed by applications in the medical domain. Recent advancements in Neural Language Models (NLMs) have transformed the field and are currently motivating numerous works exploring their application in different domains. In this paper, we explore how NLMs can be used for Medical Entity Linking with the recently introduced MedMentions dataset, which presents two major challenges: (1) a large target ontology of over 2M concepts, and (2) low overlap between concepts in train, validation and test sets. We introduce a solution, MedLinker, that addresses these issues by leveraging specialized NLMs with Approximate Dictionary Matching, and show that it performs competitively on semantic type linking, while improving the state-of-the-art on the more fine-grained task of concept linking (+4 F1 on MedMentions main task).

2020

Sentence Compression for Portuguese

Authors
Nóbrega, FAA; Jorge, AM; Brazdil, P; Pardo, TAS;

Publication
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020

Abstract
The task of Sentence Compression aims at producing a shorter version of a given sentence. This task may assist many other applications, as Automatic Summarization and Text Simplification. In this paper, we investigate methods for Sentence Compression for Portuguese. We focus on machine learning-based algorithms and propose new strategies. We also create reference corpora/datasets for the area, allowing to train and to test the methods of interest. Our results show that some of our methods outperform previous initiatives for Portuguese and produce competitive results with a state of the art method in the area.

  • 14
  • 46