Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

2025

PAP900: A dataset of semantic relationships between affective words in Portuguese

Autores
dos Santos, AF; Leal, JP; Alves, RA; Jacques, T;

Publicação
DATA IN BRIEF

Abstract
The PAP900 dataset centers on the semantic relationship between affective words in Portuguese. It contains 900 word pairs, each annotated by at least 30 human raters for both semantic similarity and semantic relatedness. In addition to the semantic ratings, the dataset includes the word categorization used to build the word pairs and detailed sociodemographic information about annotators, enabling the analysis of the influence of personal factors on the perception of semantic relationships. Furthermore, this article describes in detail the dataset construction process, from word selection to agreement metrics. Data was collected from Portuguese university psychology students, who completed two rounds of questionnaires. In the first round annotators were asked to rate word pairs on either semantic similarity or relatedness. The second round switched the relation type for most annotators, with a small percentage being asked to repeat the same relation. The instructions given emphasized the differences between semantic relatedness and semantic similarity, and provided examples of expected ratings of both. There are few semantic relations datasets in Portuguese, and none focusing on affective words. PAP900 is distributed in distinct formats to be easy to use for both researchers just looking for the final averaged values and for researchers looking to take advantage of the individual ratings, the word categorization and the annotator data. This dataset is a valuable resource for researchers in computational linguistics, natural language processing, psychology, and cognitive science. (c) 2025TheAuthors.

2025

Multi-domain indoor environmental quality and worker health, well-being, and productivity: Objective and subjective assessments in modern office buildings

Autores
Felgueiras, F; Mourao, Z; Moreira, A; Gabriel, MF;

Publicação
BUILDING AND ENVIRONMENT

Abstract
It is widely recognized that the well-being, health, and productivity of office workers can be influenced by indoor environmental quality (IEQ) conditions in the workplace. This study aimed to investigate associations between multi-domain IEQ in offices and workers' well-being, health, productivity, and perceived IEQ in 30 open office spaces (6 buildings) located in the urban area of Porto, Portugal. This cross-sectional study included 277 office workers and used a combination of methods to assess their perceptions and physiological responses. Data were collected through questionnaires (covering self-reported well-being, health, productivity, and IEQ satisfaction), pupillometry (autonomic nervous system activity), and concurrent monitoring of IEQ. Correlation, comparative, and regression methods were used to explore associations and differences between IEQ indicators and participants' outcomes. The findings showed that offices typically met acceptable IEQ standards. However, a higher prevalence of health problems and symptoms was observed in offices with higher levels of carbon dioxide (CO2), ozone (O3), particulate matter (PM10), and ultrafine particles (UFP). Interestingly, offices with higher COQ, PM2.5, and volatile organic compounds concentrations were linked to a reduced likelihood of participants reporting asthma, dry cough, and allergies. Additionally, thermal discomfort due to high temperatures, increased PM2.5, UFP, CO2, and O3, and low illuminance appear to reduce eye response in office workers. Higher CO2 and noise levels, and temperatures outside the comfortable range, were linked to lower productivity. The multi-domain analysis showed that perception of multiple IEQ factors significantly explained both self-reported productivity and overall satisfaction with work environment. Overall, ensuring proper IEQ and enhancing workers' satisfaction are essential for creating healthy and productive workplaces.

2025

Rebuilding the Past: Reconstructing Portuguese News Outlets with Web Archives

Autores
Silva, R; Campos, R;

Publicação
Advances in Information Retrieval - 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, April 6-10, 2025, Proceedings, Part V

Abstract
Around 80% of websites change significantly or disappear altogether after the first year, resulting in the loss of invaluable information. In this volatile scenario, preserving online content is increasingly essential. This is especially critical for local news outlets, which produce a wealth of information within the unique context of their communities but often lack sufficient archiving resources. In this paper, we take a significant step forward by leveraging the information preserved by the Portuguese Web Archive, Arquivo.pt, to recreate the website of a local news outlet. This online demo grants users direct access to previously lost news articles, images, and front covers, thus contributing to preserving local digital heritage. An IR system was also implemented to ensure easy access, along with a recommendation system based on BERT embeddings to suggest related news articles and enhance user engagement. As a final contribution, we also provide a Python package, enabling others to replicate the process of collecting, processing, retrieving, and recreating websites for local news outlets in Portugal. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

2025

Data Access for Recommender Systems Research: leveraging the EU's Digital Services Act

Autores
Vinagre, J; Porcaro, L; Merisio, S; Purificato, E; Gómez, E;

Publicação
Proceedings of the Nineteenth ACM Conference on Recommender Systems, RecSys 2025, Prague, Czech Republic, September 22-26, 2025

Abstract

2025

Exploring Perceptions of Comfort, Security and Safety in Different Modes of Transport: A Comparative Study

Autores
Ferreira, MC; Dias, TG;

Publicação
TRANSPORT TRANSITIONS: ADVANCING SUSTAINABLE AND INCLUSIVE MOBILITY, TRA CONFERENCE, 2024

Abstract
This study seeks to comprehensively analyze the multidimensional determinants underlying perceptions of safety, security, and comfort in transport mode choice, specifically focusing on private transport, public transport and walking. The research begins with an extensive literature review to identify and delve into the factors influencing perceptions of safety, security, and comfort across various transport modes. This inquiry is further enhanced by organizing two focused group sessions. A total of 35 key factors were identified, forming the basis for subsequent investigation. The study then progressed to the development and administration of a survey aimed at capturing responses from a diverse audience, with the goal of exploring the factors influencing perceptions related to different transport modes. A total of 302 responses were collected and meticulously analyzed to discern the factors impacting various relationships and to identify consistent perceptions across diverse transport modes. Additionally, a factor analysis was conducted to validate the findings derived from the data. The outcomes of this research constitute a significant contribution to the existing literature, offering valuable insights that pave the way for a more holistic understanding of the factors guiding transport mode choices.

2025

Air Quality Data Analysis with Symbolic Principal Components

Autores
Loureiro, P; Oliveira, M; Brito, P; Oliveira, L;

Publicação
Springer Proceedings in Mathematics and Statistics

Abstract
Air pollution is a global challenge with deep implications in public health and environment. We examine air quality data from a monitoring station in Entrecampos, Lisbon, Portugal, using Symbolic Data Analysis. The dataset consists of hourly concentrations of nine pollutants during three years, which are logarithmically transformed and aggregated in intervals, taking the daily minimum and maximum values. The symbolic mean and variance are estimated for each variable through the method of moments, and the pairwise dependencies are captured using a bivariate copula. Symbolic principal component scores are obtained from the estimated covariance matrix and used to fit generalized extreme value distributions. Outlier maps, based on these distributions’ quantiles, are used to identify outlying observations. A comparative analysis with daily average-based outlier detection methods is conducted. The results show the relevance of Symbolic Data Analysis in revealing new insights into air quality. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

  • 61
  • 4353