Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by João Rocha Silva

2016

Usage-Driven Dublin Core Descriptor Selection A Case Study Using the Dendro Platform for Research Dataset Description

Authors
da Silva, JR; Ribeiro, C; Lopes, JC;

Publication
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, TPDL 2016

Abstract
Dublin Core schemas are the core metadata models of most repositories, and this includes recent repositories dedicated to datasets. DC descriptors are generic and are being adapted to the needs of different communities with the so-called Dublin Core Application Profiles. DCAPs rely on the agreement within user communities, in a process mainly driven by their evolving needs. In this paper, we propose a complementary automated process, designed to help curators and users discover the descriptors that better suit the needs of a specific research group. We target the description of datasets, and test our approach using Dendro, a prototype research data management platform, where an experimental method is used to rank and present DC Terms descriptors to the users based on their usage patterns. In a controlled experiment, we gathered the interactions of two groups as they used Dendro to describe datasets from selected sources. One of the groups had descriptor ranking on, while the other had the same list of descriptors throughout the whole experiment. Preliminary results show that 1. some DC Terms are filled in more often than others, with different distribution in the two groups, 2. selected descriptors were increasingly accepted by users in detriment of manual selection and 3. users were satisfied with the performance of the platform, as demonstrated by a post-study survey.

2017

Social Dendro: Social Network Techniques Applied to Research Data Description

Authors
Pereira, N; da Silva, JR; Ribeiro, C;

Publication
RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017)

Abstract
Research data management has become an integral part of the research workflow. Currently, concern with data appears mainly at the very last stages of projects, rather than being present from the moment of data creation. The goal of this work is to make data easier to find, share and reuse through early metadata production and in-group review. The approach proposed in this paper, Social Dendro, introduces social network concepts such as posts, shares and comments, in Dendro, our research data management platform. The implementation follows the ontology-based architecture of the platform. Results of a preliminary user test have provided insights for future improvements.

2017

A comparison of research data management platforms: architecture, flexible metadata and interoperability

Authors
Amorim, RC; Castro, JA; da Silva, JR; Ribeiro, C;

Publication
UNIVERSAL ACCESS IN THE INFORMATION SOCIETY

Abstract
Research data management is rapidly becoming a regular concern for researchers, and institutions need to provide them with platforms to support data organization and preparation for publication. Some institutions have adopted institutional repositories as the basis for data deposit, whereas others are experimenting with richer environments for data description, in spite of the diversity of existing workflows. This paper is a synthetic overview of current platforms that can be used for data management purposes. Adopting a pragmatic view on data management, the paper focuses on solutions that can be adopted in the long tail of science, where investments in tools and manpower are modest. First, a broad set of data management platforms is presented-some designed for institutional repositories and digital libraries-to select a short list of the more promising ones for data management. These platforms are compared considering their architecture, support for metadata, existing programming interfaces, as well as their search mechanisms and community acceptance. In this process, the stakeholders' requirements are also taken into account. The results show that there is still plenty of room for improvement, mainly regarding the specificity of data description in different domains, as well as the potential for integration of the data management platforms with existing research management tools. Nevertheless, depending on the context, some platforms can meet all or part of the stakeholders' requirements.

2017

Description + annotation: semantic data publication workflow with Dendro and B2NOTE

Authors
Karimova, Y; Castro, JA; da Silva, JR; Pereira, N; Rodrigues, J; Ribeiro, C;

Publication
Int. J. Metadata Semant. Ontologies

Abstract
Metadata puts research data in their context, making data intelligible and apt to sustain technology evolution and to be reused, in compliance with the FAIR principles. The workflow proposed in this work includes metadata generation in the context of research projects, created with the Dendro platform, and metadata originated in the interaction of people with the deposited data, created with the B2NOTE service from EUDAT. In our experiments, datasets are prepared with Dendro, taking into consideration general-purpose descriptors and domain-specific ones, then transparently deposited in B2SHARE. After publication, B2NOTE provides an environment where authors, other researchers, and any interested party can enrich the description with less formal comments, tags or keywords. This work contributes with (a) a set of use cases in several domains, (b) details on the descriptors used by authors in each case, and (c) reflections on the use of data after publication, using the B2NOTE contributions. © Copyright 2017 Inderscience Enterprises Ltd.

2018

Supporting Description of Research Data: Evaluation and Comparison of Term and Concept Extraction Approaches

Authors
Monteiro, C; Lopes, CT; Silva, JR;

Publication
DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018

Abstract
The importance of research data management is widely recognized. Dendro is an ontology-based platform that allows researchers to describe datasets using generic and domain-specific descriptors from ontologies. Selecting or building the right ontologies for each research domain or group requires meetings between curators and researchers in order to capture the main concepts of their research. Envisioning a tool to assist curators through the automatic extraction of key concepts from research documents, we propose 2 concept extraction methods and compare them with a term extraction method. To compare the three approaches, we use as ground truth an ontology previously created by human curators.

2018

Grassroots Meets Grasstops: Integrated Research Data Management with EUDAT B2 Services, Dendro and LabTablet

Authors
da Silva, JR; Pereira, N; Dias, P; Barros, B;

Publication
DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018

Abstract
We present an integrated research data management (RDM) workflow that captures data from the moment of creation until its deposit. We integrated LabTablet, our electronic laboratory notebook, Dendro, our data organisation and description platform aimed at collaborative management of research data, and EUDAT's B2DROP and B2SHARE platforms. This approach combines the portability and automated metadata production abilities of LabTablet, Dendro as a collaborative RDM tool for dataset preparation, with the scalable storage of B2DROP and the long-term deposit of datasets in B2SHARE. The resulting workflow can be put to work in research groups where laboratorial or field work is central.

  • 4
  • 6