Cookies Policy
We use cookies to improve our site and your experience. By continuing to browse our site you accept our cookie policy. Find out More
Close
  • Menu
About
Download Photo HD

About

I am an Associate Professor with the Department of Informatics Engineering of FEUP, University of Porto, and a senior researcher at INESC TEC. I have collaborated on and been in charge of projects in the areas of cultural heritage, multimedia databases, information retrieval and combinatorial optimisation. I have supervised 9 PhD and 33 MSc dissertations. I was the technical leader of the SAPO/U.Porto extension laboratory for 5 years. My teaching activities include courses in the Informatics Engineering and Information Science programmes. Research Data Management is the core of my current research activity. I am the PI of TAIL (FCT/POCI), on research data management workflows for data publication, and I lead the DataPublication@U.Porto pilot in the EUDAT european initiative. I am a member of the Working Group for the National Policy on Open Science with SECTES. My research interests include information retrieval, digital preservation and knowledge representation.

Interest
Topics
Details

Details

006
Publications

2019

Data Deposit in a CKAN Repository: A Dublin Core-Based Simplified Workflow

Authors
Karimova, Y; Castro, JA; Ribeiro, C;

Publication
Communications in Computer and Information Science - Digital Libraries: Supporting Open Science

Abstract

2019

Interplay of Documents' Readability, Comprehension and Consumer Health Search Performance Across Query Terminology

Authors
Lopes, CT; Ribeiro, C;

Publication
Proceedings of the 2019 Conference on Human Information Interaction and Retrieval, CHIIR 2019, Glasgow, Scotland, UK, March 10-14, 2019

Abstract
Because of terminology mismatches, health consumers frequently face difficulties while searching the Web for health information. Difficulties arise in query formulation but also in understanding the retrieved documents. In this work we analyze how documents' readability affects users' comprehension and how both affect the retrieval performance, measured in different ways. In addition, we analyze how performance measures relate with each other. For this purpose we have conducted a laboratory user study with 40 participants. We found that readability is essential for a document to be at least partially relevant and that it becomes even more important if the document has medico-scientific terminology. Moreover, the relevance of a document to a specific user highly depends on its comprehension. In lay queries we found the medical accuracy of users' answers is related to the session's relevance assessments. This shows that users can, at least in part, relate their relevance assessments with the medical accuracy of the documents. On the other hand, this relationship does not exist with medico-scientific queries. © 2019 Copyright held by the owner/author(s). Publication rights licensed to Association for Computing Machinery.

2019

Hands-On Data Publishing with Researchers: Five Experiments with Metadata in Multiple Domains

Authors
Rodrigues, J; Castro, JA; da Silva, JR; Ribeiro, C;

Publication
Communications in Computer and Information Science

Abstract
The current requirements for open data in the EU are increasing the awareness of researchers with respect to data management and data publication. Metadata is essential in research data management, namely on data discovery and reuse. Current practices tend to either leave metadata definition to researchers, or to assign their creation to curators. The former typically results in ad-hoc descriptors, while the latter follows standards but lacks specificity. In this exploratory study, we adopt a researcher-curator collaborative approach in five data publication cases, involving researchers in data description and discussing the use of both generic and domain-oriented metadata. The study shows that researchers working on familiar datasets can contribute effectively to the definition of metadata models, in addition to the actual metadata creation. The cases also provide preliminary evidence of cross-disciplinary descriptor use. Moreover, the interaction with curators highlights the advantages of data management, making researchers more open to participate in the corresponding tasks. © Springer Nature Switzerland AG 2019.

2019

Ranking Dublin Core descriptor lists from user interactions: a case study with Dublin Core Terms using the Dendro platform

Authors
da Silva, JR; Ribeiro, C; Lopes, JC;

Publication
International Journal on Digital Libraries

Abstract
Dublin Core descriptors capture metadata in most repositories, and this includes recent repositories dedicated to datasets. DC descriptors are generic and are being adapted to the requirements of different communities with the so-called Dublin Core Application Profiles that rely on the agreement within user communities, taking into account their evolving needs. In this paper, we propose an automated process to help curators and users discover the descriptors that best suit the needs of a specific research group in the task of describing and depositing datasets. Our approach is supported on Dendro, a prototype research data management platform, where an experimental method is used to rank and present DC Terms descriptors to the users based on their usage patterns. User interaction is recorded and used to score descriptors. In a controlled experiment, we gathered the interactions of two groups as they used Dendro to describe datasets from selected sources. One of the groups viewed descriptors according to the ranking, while the other had the same list of descriptors throughout the experiment. Preliminary results show that (1) some DC Terms are filled in more often than others, with different distribution in the two groups, (2) descriptors in higher ranks were increasingly accepted by users in detriment of manual selection, (3) users were satisfied with the performance of the platform, and (4) the quality of description was not hindered by descriptor ranking. © 2018 Springer-Verlag GmbH Germany, part of Springer Nature

2019

Optimality in nesting problems: New constraint programming models and a new global constraint for non-overlap

Authors
Cherri, LH; Carravilla, MA; Ribeiro, C; Bragion Toledo, FMB;

Publication
Operations Research Perspectives

Abstract
In two-dimensional nesting problems (irregular packing problems) small pieces with irregular shapes must be packed in large objects. A small number of exact methods have been proposed to solve nesting problems, typically focusing on a single problem variant, the strip packing problem. There are however several other variants of the nesting problem which were identified in the literature and are very relevant in the industry. In this paper, constraint programming (CP) is used to model and solve all the variants of irregular cutting and packing problems proposed in the literature. Three approaches, which differ in the representation of the variable domains, in the way they deal with the core constraints and in the objective functions, are the basis for the three models proposed for each variant of the problem. The non-overlap among pieces, which must be enforced for all the problem variants, is guaranteed through the new global constraint NoOverlap in one of the proposed approaches. Taking the benchmark instances for the strip-packing problem, new instances were generated for each problem variant. Extensive computational experiments were run with these problem instances from the literature to evaluate the performance of each approach applied to each problem variant. The models based on the global constraint NoOverlap performed consistently better for all variants due to the increased propagation and to the low memory usage. The performance of the CP model for the strip packing problem with the global constraint NoOverlap was then compared with the Dotted Board with Rotations using larger instances from the literature. The experiments show that the CP model with global constraint NoOverlap can quickly find good quality solutions in shorter computational times even for large instances. © 2019

Supervised
thesis

2018

Automated analysis for process compliance

Author
Mariana Gaspar Oliveira

Institution
UP-FEUP

2017

Validação e Certificação digital de CV

Author
Joana Lopes Beleza

Institution
UP-FEUP

2017

SocialDendro: Aplicação de técnicas das redes sociais à gestão colaborativa de conjuntos de dados

Author
Nelson Miguel da Costa Martins Pereira

Institution
UP-FEUP

2017

Disseminação de conteúdos audiovisuais na web: uso de um perfil de aplicação para a gestão e agregação dos recursos da TVU

Author
Sara Catarina Pinheira de Oliveira

Institution
UP-FEUP

2017

Metadata gamification: Jogos sérios para melhoria de descrição de dados da investigação

Author
Bruno Coelho da Silva

Institution
UP-FEUP