2024
Authors
Abouelmaty, AM; Colaço, A; Fares, AA; Ramos, A; Costa, PA;
Publication
COMPUTERS AND GEOTECHNICS
Abstract
This study focuses on the assessment of ground vibrations due to pile driving activities. Given the likelihood of excessive vibration due to the driving process, it is imperative to predict vibration levels during the design phase. The primary goal of this work is to integrate machine learning techniques, specifically Extreme Gradient Boosting (XGBoost) and Artificial Neural Networks (ANNs) for real-time vibration prediction. The training dataset was generated using a validated numerical model and the trained models were validated based on experimental results. This validation process highlights the efficiency and accuracy of Extreme Gradient Boosting in predicting the-free-field response of the ground.
2024
Authors
Brito, C; Ferreira, P; Paulo, J;
Publication
Abstract
2024
Authors
Ribeiro, R; Moraes, A; Moreno, M; Ferreira, PG;
Publication
MACHINE LEARNING
Abstract
Aging involves complex biological processes leading to the decline of living organisms. As population lifespan increases worldwide, the importance of identifying factors underlying healthy aging has become critical. Integration of multi-modal datasets is a powerful approach for the analysis of complex biological systems, with the potential to uncover novel aging biomarkers. In this study, we leveraged publicly available epigenomic, transcriptomic and telomere length data along with histological images from the Genotype-Tissue Expression project to build tissue-specific regression models for age prediction. Using data from two tissues, lung and ovary, we aimed to compare model performance across data modalities, as well as to assess the improvement resulting from integrating multiple data types. Our results demostrate that methylation outperformed the other data modalities, with a mean absolute error of 3.36 and 4.36 in the test sets for lung and ovary, respectively. These models achieved lower error rates when compared with established state-of-the-art tissue-agnostic methylation models, emphasizing the importance of a tissue-specific approach. Additionally, this work has shown how the application of Hierarchical Image Pyramid Transformers for feature extraction significantly enhances age modeling using histological images. Finally, we evaluated the benefits of integrating multiple data modalities into a single model. Combining methylation data with other data modalities only marginally improved performance likely due to the limited number of available samples. Combining gene expression with histological features yielded more accurate age predictions compared with the individual performance of these data types. Given these results, this study shows how machine learning applications can be extended to/in multi-modal aging research. Code used is available at https://github.com/zroger49/multi_modal_age_prediction.
2024
Authors
Sousa, B; Bessa, M; de Mendonca, FL; Ferreira, PG; Moreira, A; Pereira-Castro, I;
Publication
BIOINFORMATICS
Abstract
APAtizer is a tool designed to analyze alternative polyadenylation events on RNA-sequencing data. The tool handles different file formats, including BAM, htseq, and DaPars bedGraph files. It provides a user-friendly interface that allows users to generate informative visualizations, including Volcano plots, heatmaps, and gene lists. These outputs allow the user to retrieve useful biological insights such as the occurrence of polyadenylation events when comparing two biological conditions. In addition, it can perform differential gene expression, gene ontology analysis, visualization of Venn diagram intersections, and correlation analysis.
2024
Authors
Juliana Machado; Evelin Amorim;
Publication
Anais do XXXIX Simpósio Brasileiro de Banco de Dados (SBBD 2024)
Abstract
2024
Authors
Tomaszewska, A; Silvano, P; Leal, A; Amorim, E;
Publication
ISA 2024: 20th Joint ACL - ISO Workshop on Interoperable Semantic Annotation at LREC-COLING 2024, Workshop Proceedings
Abstract
The main objective of this study is to contribute to multilingual discourse research by employing ISO-24617 Part 8 (Semantic Relations in Discourse, Core Annotation Schema – DR-core) for annotating discourse relations. Centering around a parallel discourse relations corpus that includes English, Polish, and European Portuguese, we initiate one of the few ISO-based comparative analyses through a multilingual corpus that aligns discourse relations across these languages. In this paper, we discuss the project’s contributions, including the annotated corpus, research findings, and statistics related to the use of discourse relations. The paper further discusses the challenges encountered in complying with the ISO standard, such as defining the scope of arguments and annotating specific relation types like Expansion. Our findings highlight the necessity for clearer definitions of certain discourse relations and more precise guidelines for argument spans, especially concerning the inclusion of connectives. Additionally, the study underscores the importance of ongoing collaborative efforts to broaden the inclusion of languages and more comprehensive datasets, with the objective of widening the reach of ISO-guided multilingual discourse research. © 2024 ELRA Language Resource Association: CC BY-NC 4.0.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.