Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
About

About

Ricardo Campos is a Professor at the Polytechnic Institute of Tomar and lecturer at the Porto Business School (PBS). He is a senior researcher of LIAAD-INESC TEC, the Artificial Intelligence and Decision Support Lab of U. Porto, and a collaborator of Ci2.ipt, the Smart Cities Research Center of the Polytechnic of Tomar. He is PhD in Computer Science by the University of Porto (U. Porto), being also a former student of the Universidade da Beira Interior (UBI). He has more than 10 years of experience in Information Retrieval (IR) and Natural Language Processing (NLP), period during which his research has been recognized with multiple awards in international conferences and scientific competitions. He is the leading author of the highly impactful YAKE! keyword extractor toolkit, of the Tell me Stories project and of the Arquivo Público, among other software. His current research focuses on developing methods concerned the process of narrative extraction from texts. He has participated in several research projects and is particularly interested in practical approaches regarding the relationship behind entities, events and temporal aspects, as a means to make sense of unstructured data. He is an editorial board member of the Information Processing and Management Journal (Elsevier), co-chaired international conferences and workshops, and is a regular member of the scientific committee of several international conferences. He is also a member of the Scientific Advisory Forum of the Portulan Clarin - Research Infrastructure for the Science and Technology of Language. For more info please click here.

Interest
Topics
Details

Details

  • Name

    Ricardo Campos
  • Cluster

    Computer Science
  • Role

    Senior Researcher
  • Since

    01st July 2012
001
Publications

2023

Public News Archive: A Searchable Sub-archive to Portuguese Past News Articles

Authors
Campos, R; Correia, D; Jatowt, A;

Publication
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract
Over the past few decades, the amount of information generated turned the Web into the largest knowledge infrastructure existing to date. Web archives have been at the forefront of data preservation, preventing the losses of significant data to humankind. Different snapshots of the web are saved everyday enabling users to surf the past web and to travel through this overtime. Despite these efforts, many people are not aware that the web is being preserved, often finding these infrastructures to be unattractive or difficult to use, when compared to common search engines. In this paper, we give a step towards making use of this preserved information to develop “Public Archive” an intuitive interface that enables end-users to search and analyze a large-scale of 67,242 past preserved news articles belonging to a Portuguese reference newspaper (“Jornal Público”). The referred collection was obtained by scraping 10,976 versions of the homepage of the “Jornal Público” preserved by the Portuguese web archive infrastructure (Arquivo.pt) during the time-period of 2010 to 2021. By doing this, we aim, not only to mark a stand in what respects to make use of this preserved information, but also to come up with an easy-to-follow solution, the Public Archive python package, which creates the roots to be used (with minor adaptations) by other news source providers interested in offering their readers access to past news articles. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

2023

Text Mining and Visualization of Political Party Programs Using Keyword Extraction Methods: The Case of Portuguese Legislative Elections

Authors
Campos, R; Jatowt, A; Jorge, A;

Publication
Information for a Better World: Normality, Virtuality, Physicality, Inclusivity - 18th International Conference, iConference 2023, Virtual Event, March 13-17, 2023, Proceedings, Part I

Abstract

2023

Text2Storyline: Generating Enriched Storylines from Text

Authors
Gonçalves, F; Campos, R; Jorge, A;

Publication
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Abstract
In recent years, the amount of information generated, consumed and stored has grown at an astonishing rate, making it difficult for those seeking information to extract knowledge in good time. This has become even more important, as the average reader is not as willing to spare more time out of their already busy schedule as in the past, thus prioritizing news in a summarized format, which are faster to digest. On top of that, people tend to increasingly rely on strong visual components to help them understand the focal point of news articles in a less tiresome manner. This growing demand, focused on exploring information through visual aspects, urges the need for the emergence of alternative approaches concerned with text understanding and narrative exploration. This motivated us to propose Text2Storyline, a platform for generating and exploring enriched storylines from an input text, a URL or a user query. The latter is to be issued on the Portuguese Web Archive (Arquivo.pt), therefore giving users the chance to expand their knowledge and build up on information collected from web sources of the past. To fulfill this objective, we propose a system that makes use of the Time-Matters algorithm to filter out non-relevant dates and organize relevant content by means of different displays: ‘Annotated Text’, ‘Entities’, ‘Storyline’, ‘Temporal Clustering’ and ‘Word Cloud’. To extend the users’ knowledge, we rely on entity linking to connect persons, events, locations and concepts found in the text to Wikipedia pages, a process also known as Wikification. Each of the entities is then illustrated by means of an image collected from the Arquivo.pt. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

2022

The 5th International Workshop on Narrative Extraction from Texts: Text2Story 2022

Authors
Campos, R; Jorge, A; Jatowt, A; Bhatia, S; Litvak, M;

Publication
ADVANCES IN INFORMATION RETRIEVAL, PT II

Abstract

2022

Proceedings of Text2Story - Fifth Workshop on Narrative Extraction From Texts held in conjunction with the 44th European Conference on Information Retrieval (ECIR 2022), Stavanger, Norway, April 10, 2022

Authors
Campos, R; Jorge, AM; Jatowt, A; Bhatia, S; Litvak, M;

Publication
Text2Story@ECIR

Abstract