2024
Autores
Tomaszewska, A; Silvano, P; Leal, A; Amorim, E;
Publicação
ISA 2024: 20th Joint ACL - ISO Workshop on Interoperable Semantic Annotation at LREC-COLING 2024, Workshop Proceedings
Abstract
The main objective of this study is to contribute to multilingual discourse research by employing ISO-24617 Part 8 (Semantic Relations in Discourse, Core Annotation Schema – DR-core) for annotating discourse relations. Centering around a parallel discourse relations corpus that includes English, Polish, and European Portuguese, we initiate one of the few ISO-based comparative analyses through a multilingual corpus that aligns discourse relations across these languages. In this paper, we discuss the project’s contributions, including the annotated corpus, research findings, and statistics related to the use of discourse relations. The paper further discusses the challenges encountered in complying with the ISO standard, such as defining the scope of arguments and annotating specific relation types like Expansion. Our findings highlight the necessity for clearer definitions of certain discourse relations and more precise guidelines for argument spans, especially concerning the inclusion of connectives. Additionally, the study underscores the importance of ongoing collaborative efforts to broaden the inclusion of languages and more comprehensive datasets, with the objective of widening the reach of ISO-guided multilingual discourse research. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.
2024
Autores
Almeida, R; Amorim, E;
Publicação
Legal and Ethical Issues in Human Language Technologies 2024, LEGAL 2024 at LREC-COLING 2024 - Workshop Proceedings
Abstract
Recent advances in deep learning have promoted the advent of many computational systems capable of performing intelligent actions that, until then, were restricted to the human intellect. In the particular case of human languages, these advances allowed the introduction of applications like ChatGPT that are capable of generating coherent text without being explicitly programmed to do so. Instead, these models use large volumes of textual data to learn meaningful representations of human languages. Associated with these advances, concerns about copyright and data privacy infringements caused by these applications have emerged. Despite these concerns, the pace at which new natural language processing applications continued to be developed largely outperformed the introduction of new regulations. Today, communication barriers between legal experts and computer scientists motivate many unintentional legal infringements during the development of such applications. In this paper, a multidisciplinary team intends to bridge this communication gap and promote more compliant Portuguese NLP research by presenting a series of everyday NLP use cases, while highlighting the Portuguese legislation that may arise during its development. © 2024 ELRA Language Resource Association.
2024
Autores
Juliana Machado; Evelin Amorim;
Publicação
Anais do XXXIX Simpósio Brasileiro de Banco de Dados (SBBD 2024)
Abstract
2024
Autores
Barbosa, S; Silva, ME; Rousseau, DD;
Publicação
NONLINEAR PROCESSES IN GEOPHYSICS
Abstract
Palaeoclimate time series, reflecting the state of Earth's climate in the distant past, occasionally display very large and rapid shifts showing abrupt climate variability. The identification and characterisation of these abrupt transitions in palaeoclimate records is of particular interest as this allows for understanding of millennial climate variability and the identification of potential tipping points in the context of current climate change. Methods that are able to characterise these events in an objective and automatic way, in a single time series, or across two proxy records are therefore of particular interest. In our study the matrix profile approach is used to describe Dansgaard-Oeschger (DO) events, abrupt warmings detected in the Greenland ice core, and Northern Hemisphere marine and continental records. The results indicate that canonical events DO-19 and DO-20, occurring at around 72 and 76 ka, are the most similar events over the past 110 000 years. These transitions are characterised by matching transitions corresponding to events DO-1, DO-8, and DO-12. They are abrupt, resulting in a rapid shift to warmer conditions, followed by a gradual return to cold conditions. The joint analysis of the delta 18O and Ca2+ time series indicates that the transition corresponding to the DO-19 event is the most similar event across the two time series.
2024
Autores
Rodrigues, ARF; Silva, ME; Silva, VF; Maia, MRG; Cabrita, ARJ; Trindade, H; Fonseca, AJM; Pereira, JLS;
Publicação
SCIENCE OF THE TOTAL ENVIRONMENT
Abstract
Seasonal and daily variations of gaseous emissions from naturally ventilated dairy cattle barns are important figures for the establishment of effective and specific mitigation plans. The present study aimed to measure methane (CH4) and ammonia (NH3) emissions in three naturally ventilated dairy cattle barns covering the four seasons for two consecutive years. In each barn, air samples from five indoor locations were drawn by a multipoint sampler to a photoacoustic infrared multigas monitor, along with temperature and relative humidity. Milk production data were also recorded. Results showed seasonal differences for CH4 and NH3 emissions in the three barns with no clear trends within years. Globally, diel CH4 emissions increased in the daytime with high intra-hour variability. The average hourly CH4 emissions (g h-1 livestock unit- 1 (LU)) varied from 8.1 to 11.2 and 6.2 to 20.3 in the dairy barn 1, from 10.1 to 31.4 and 10.9 to 22.8 in the dairy barn 2, and from 1.5 to 8.2 and 13.1 to 22.1 in the dairy barn 3, respectively, in years 1 and 2. Diel NH3 emissions highly varied within hours and increased in the daytime. The average hourly NH3 emissions (g h-1 LU-1) varied from 0.78 to 1.56 and 0.50 to 1.38 in the dairy barn 1, from 1.04 to 3.40 and 0.93 to 1.98 in the dairy barn 2, and from 0.66 to 1.32 and 1.67 to 1.73 in the dairy barn 3, respectively, in years 1 and 2. Moreover, the emission factors of CH4 and NH3 were 309.5 and 30.6 (g day- 1 LU-1), respectively, for naturally ventilated dairy cattle barns. Overall, this study provided a detailed characterization of seasonal and daily gaseous emissions variations highlighting the need for future longitudinal emission studies and identifying an opportunity to better adequate the existing mitigation strategies according to season and daytime.
2024
Autores
Costa, EA; Silva, ME; Galvao, Ana Beatriz;
Publicação
SOCIO-ECONOMIC PLANNING SCIENCES
Abstract
Policymakers often have to make decisions based on incomplete economic data because of the usual delay in publishing official statistics. To circumvent this issue, researchers use data from Google Trends (GT) as an early indicator of economic performance. Such data have emerged in the literature as alternative and complementary predictors of macroeconomic outcomes, such as the unemployment rate, featuring readiness, public availability and no costs. This study deals with extensive daily GT data to develop a framework to nowcast monthly unemployment rates tailored to work with real-time data availability, resorting to Mixed Data Sampling (MIDAS) regressions. Portugal is chosen as a use case for the methodology since extracting GT data requires the selection of culturally dependent keywords. The nowcasting period spans 2019 to 2021, encompassing the time frame in which the coronavirus pandemic initiated. The findings indicate that using daily GT data with MIDAS provides timely and accurate insights into the unemployment rate, especially during the COVID-19 pandemic, showing accuracy gains even when compared to nowcasts obtained from typical monthly GT data via traditional ARMAX models.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.