Cookies Policy
The website need some cookies and similar means to function. If you permit us, we will use those means to collect data on your visits for aggregated statistics to improve our service. Find out More
Accept Reject
  • Menu
Publications

Publications by HumanISE

2023

Images as Metadata: A New Perspective for Describing Research Data

Authors
Rodrigues, J; Teixeira Lopes, C;

Publication
Journal of Library Metadata

Abstract
Indispensable in many contexts, images are fundamental in the tasks of representation and transmission of information. In the scientific context, images can be tools for researchers seeking to see their data properly managed. Research data management guides in this direction as it determines necessary phases in the life cycle of projects. The description phase is fundamental as it is an essential means for data context, safeguarding, and reuse. The description often occurs through metadata models composed of descriptors capable of attributing context. However, there is one common aspect: the values associated with these descriptors are always textual or numeric. Through studies and work developed over the last few years, we propose a new approach to description, where images can have a preponderant role in the description of data, assuming the role of metadata. We present several pieces of evidence, point out their challenges and determine the opportunities this new perspective can have in the research. Images have specific characteristics that can be leveraged in improving data description. Historical evidence establish that images have always been used and produced in research, yet their representational ability has never been harnessed to describe data and give more context to the scientific process. ©, Joana Rodrigues and Carla Teixeira Lopes. Published with license by Taylor & Francis Group, LLC.

2023

Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents

Authors
Dias, M; Lopes, CT;

Publication
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE

Abstract
Linked data is used in various fields as a new way of structuring and connecting data. Cultural heritage institutions have been using linked data to improve archival descriptions and facilitate the discovery of information. Most archival records have digital representations of physical artifacts in the form of scanned images that are non-machine-readable. Optical Character Recognition (OCR) recognizes text in images and translates it into machine-encoded text. This article evaluates the impact of image processing methods and parameter tuning in OCR applied to typewritten cultural heritage documents. The approach uses a multi-objective problem formulation to minimize Levenshtein edit distance and maximize the number of words correctly identified with a non-dominated sorting genetic algorithm (NSGA-II) to tune the methods' parameters. Evaluation results show that parameterization by digital representation typology benefits the performance of image pre-processing algorithms in OCR. Furthermore, our findings suggest that employing image pre-processing algorithms in OCR might be more suitable for typologies where the text recognition task without pre-processing does not produce good results. In particular, Adaptive Thresholding, Bilateral Filter, and Opening are the best-performing algorithms for the theater plays' covers, letters, and overall dataset, respectively, and should be applied before OCR to improve its performance.

2023

Unveiling Archive Users: Understanding Their Characteristics and Motivations

Authors
Ponte, L; Koch, I; Lopes, CT;

Publication
LEVERAGING GENERATIVE INTELLIGENCE IN DIGITAL LIBRARIES: TOWARDS HUMAN-MACHINE COLLABORATION, ICADL 2023, PT II

Abstract
An institution must understand its users to provide quality services, and archives are no exception. Over the years, archives have adapted to the technological world, and their users have also changed. To understand archive users' characteristics and motivations, we conducted a study in the context of the Portuguese Archives. For this purpose, we analysed a survey and complemented this analysis with information gathered in interviews with archivists. Based on the most frequent reasons for visiting the archives, we defined six main archival profiles (genealogical research, historical research, legal purposes, academic work, institutional purposes and publication purposes), later characterised using the results of the previous analysis. For each profile, we created a persona for a more visual and realistic representation of users.

2023

Linking Theory and Practice of Digital Libraries: 27th International Conference on Theory and Practice of Digital Libraries, TPDL 2023, Zadar, Croatia, September 26-29, 2023, Proceedings

Authors
Alonso, O; Cousijn, H; Silvello, G; Marrero, M; Lopes, CT; Marchesin, S;

Publication
TPDL

Abstract

2023

Linking Theory and Practice of Digital Libraries

Authors
Alonso, O; Cousijn, H; Silvello, G; Marrero, M; Teixeira Lopes, C; Marchesin, S;

Publication
Lecture Notes in Computer Science

Abstract

2023

Chatbots Scenarios for Education

Authors
Virkus, S; Mamede, HS; Ramos Rocio, VJ; Dickel, J; Zubikova, O; Butkiene, R; Vaiciukynas, E; Ceponiene, L; Gudoniene, D;

Publication
Information and Software Technologies - 29th International Conference, ICIST 2023, Kaunas, Lithuania, October 12-14, 2023, Proceedings

Abstract
Educational chatbots are digital tools designed to assist learners in various educational settings. These chatbots use natural language processing (NLP) and machine learning algorithms to simulate human conversation and respond to user queries in a way that facilitates learning. They can be integrated into various educational platforms such as learning management systems, educational apps, and websites to provide learners with a personalized and interactive learning experience. Our paper discusses different scenarios for educational purposes and suggests in total four scenarios for educational needs.

  • 72
  • 648