Cookies Policy
We use cookies to improve our site and your experience. By continuing to browse our site you accept our cookie policy. Find out More
Close
  • Menu
About

About

Holds a Master degree in Information Science, by the University of Porto. Currently a Digital Media PhD student.

The main focus of interest is in the definition of domain-specific metadata modelos so researchers can describe the data they are creating.

Interest
Topics
Details

Details

  • Name

    João Aguiar Castro
  • Cluster

    Computer Science
  • Role

    Research Assistant
  • Since

    15th July 2013
Publications

2019

Data Deposit in a CKAN Repository: A Dublin Core-Based Simplified Workflow

Authors
Karimova, Y; Castro, JA; Ribeiro, C;

Publication
Communications in Computer and Information Science - Digital Libraries: Supporting Open Science

Abstract

2018

Research Data Management Tools and Workflows: Experimental Work at the University of Porto

Authors
Ribeiro, C; Rocha da Silva, J; Aguiar Castro, J; Carvalho Amorim, R; Correia Lopes, J; David, G;

Publication
IASSIST Quarterly

Abstract
Research datasets include all kinds of objects, from web pages to sensor data, and originate in every domain. Concerns with data generated in large projects and well-funded research areas are centered on their exploration and analysis. For data in the long tail, the main issues are still how to get data visible, satisfactorily described, preserved, and searchable. Our work aims to promote data publication in research institutions, considering that researchers are the core stakeholders and need straightforward workflows, and that multi-disciplinary tools can be designed and adapted to specific areas with a reasonable effort. For small groups with interesting datasets but not much time or funding for data curation, we have to focus on engaging researchers in the process of preparing data for publication, while providing them with measurable outputs. In larger groups, solutions have to be customized to satisfy the requirements of more specific research contexts. We describe our experience at the University of Porto in two lines of enquiry. For the work with long-tail groups we propose general-purpose tools for data description and the interface to multi-disciplinary data repositories. For areas with larger projects and more specific requirements, namely wind infrastructure, sensor data from concrete structures and marine data, we define specialized workflows. In both cases, we present a preliminary evaluation of results and an estimate of the kind of effort required to keep the proposed infrastructures running.  The tools available to researchers can be decisive for their commitment. We focus on data preparation, namely on dataset organization and metadata creation. For groups in the long tail, we propose Dendro, an open-source research data management platform, and explore automatic metadata creation with LabTablet, an electronic laboratory notebook. For groups demanding a domain-specific approach, our analysis has resulted in the development of models and applications to organize the data and support some of their use cases. Overall, we have adopted ontologies for metadata modeling, keeping in sight metadata dissemination as Linked Open Data.

2018

Research data management in the field of Ecology: An overview

Authors
Alves, C; Castro, JA; Ribeiro, C; Honrado, JP; Lomba, A;

Publication
Proceedings of the International Conference on Dublin Core and Metadata Applications

Abstract
The diversity of research topics and resulting datasets in the field of Ecology (the scientific study of ecological systems and their biodiversity) has grown in parallel with developments in research data management. Based on a meta-analysis performed on 93 scientific references, this paper presents a comprehensive overview of the use of metadata tools in the Ecology domain through time. Overall, 40 metadata tools were found to be either referred or used by the research community from 1997 to 2018. In the same period, 50 different initiatives in ecology and biodiversity research were conceptualized and implemented to promote effective data sharing in the community. A relevant concern that stems from this analysis is the need to establish simple methods to promote data interoperability and reuse, so far limited by the production of metadata according to different standards. With this study, we also highlight challenges and perspectives in research data management in the domain of Ecology towards best practice guidelines.

2017

Involving data creators in an ontology-based design process for metadata models

Authors
Castro, JA; Amorim, RC; Gattelli, R; Karimova, Y; Da Silva, JR; Ribeiro, C;

Publication
Developing Metadata Application Profiles

Abstract
Research data are the cornerstone of science and their current fast rate of production is disquieting researchers. Adequate research data management strongly depends on accurate metadata records that capture the production context of the datasets, thus enabling data interpretation and reuse. This chapter reports on the authors' experience in the development of the metadata models, formalized as ontologies, for several research domains, involving members from small research teams in the overall process. This process is instantiated with four case studies: vehicle simulation; hydrogen production; biological oceanography and social sciences. The authors also present a data description workflow that includes a research data management platform, named Dendro, where researchers can prepare their datasets for further deposit in external data repositories. © 2017, IGI Global.

2017

Promoting Semantic Annotation of Research Data by Their Creators: A Use Case with B2NOTE at the End of the RDM Workflow

Authors
Karimova, Y; Castro, JA; Da Silva, JR; Pereira, N; Ribeiro, C;

Publication
Metadata and Semantic Research - Communications in Computer and Information Science

Abstract

Supervised
thesis

2016

Vocabulários controlados na descrição de dados de investigação no Dendro

Author
Yulia Karimova

Institution
UP-FEUP