2025
Autores
Paiva, JC; Leal, JP; Figueira, A;
Publicação
ELECTRONICS
Abstract
Automated assessment tools for programming assignments have become increasingly popular in computing education. These tools offer a cost-effective and highly available way to provide timely and consistent feedback to students. However, when evaluating a logically incorrect source code, there are some reasonable concerns about the formative gap in the feedback generated by such tools compared to that of human teaching assistants. A teaching assistant either pinpoints logical errors, describes how the program fails to perform the proposed task, or suggests possible ways to fix mistakes without revealing the correct code. On the other hand, automated assessment tools typically return a measure of the program's correctness, possibly backed by failing test cases and, only in a few cases, fixes to the program. In this paper, we introduce a tool, AsanasAssist, to generate formative feedback messages to students to repair functionality mistakes in the submitted source code based on the most similar algorithmic strategy solution. These suggestions are delivered with incremental levels of detail according to the student's needs, from identifying the block containing the error to displaying the correct source code. Furthermore, we evaluate how well the automatically generated messages provided by AsanasAssist match those provided by a human teaching assistant. The results demonstrate that the tool achieves feedback comparable to that of a human grader while being able to provide it just in time.
2025
Autores
Alves, BA; Fontes, T; Rossetti, R;
Publicação
PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2024, PT II
Abstract
Traffic flow prediction is a critical component of intelligent transportation systems. This study introduces a Bidirectional Long Short-Term Memory (Bi-LSTM) neural network for predicting traffic flow. The model utilizes traffic, weather, and holiday data. To evaluate the model's performance, three experiments were assessed: E1, using all available inputs; E2, excluding weather conditions; and E3 excluding holiday information. The model was trained using the previous 3, 12, and 24 h of data to predict traffic flow for the next 12 h, and its performance was compared with a LSTM model. Traffic predictions benefit from having a large and diverse dataset. Bi-LSTM model can capture temporal patterns more effectively than the LSTM. The MAPE value is improved in around 1% when we increase the historical from 3h to 24 h, plus 1% if Bi-LSTM model is used. Better results are obtained when contextual information is provided. These results reinforce the potential that deep learning models have in the prediction of traffic conditions and the impact of a large and varied dataset in the accuracy of these predictions.
2025
Autores
Vaz, B; Figueira, A;
Publicação
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS
Abstract
This article focuses on the creation and evaluation of synthetic data to address the challenges of imbalanced datasets in machine learning (ML) applications, using fake news detection as a case study. We conducted a thorough literature review on generative adversarial networks (GANs) for tabular data, synthetic data generation methods, and synthetic data quality assessment. By augmenting a public news dataset with synthetic data generated by different GAN architectures, we demonstrate the potential of synthetic data to improve ML models' performance in fake news detection. Our results show a significant improvement in classification performance, especially in the underrepresented class. We also modify and extend a data usage approach to evaluate the quality of synthetic data and investigate the relationship between synthetic data quality and data augmentation performance in classification tasks. We found a positive correlation between synthetic data quality and performance in the underrepresented class, highlighting the importance of high-quality synthetic data for effective data augmentation.
2025
Autores
Ana Pires; André Santos; João Coutinho; Aaron Persad; André Dias; Rui Moura; José Almeida;
Publicação
IAF Space Exploration Symposium
Abstract
2025
Autores
Araújo, A; Jesus, Gd; Nunes, S;
Publicação
EPIA (2)
Abstract
Developing information retrieval (IR) systems that enable access across multiple languages is crucial in multilingual contexts. In Timor-Leste, where Tetun, Portuguese, English, and Indonesian are official and working languages, no cross-lingual information retrieval (CLIR) solutions currently exist to support information access across these languages. This study addresses that gap by investigating CLIR approaches tailored to the linguistic landscape of Timor-Leste. Leveraging an existing monolingual Tetun document collection and ad-hoc text retrieval baselines, we explore the feasibility of CLIR for Tetun. Queries were manually translated into Portuguese, English, and Indonesian to create a multilingual query set. These were then automatically translated back into Tetun using Google Translate and several large language models, and used to retrieve documents in Tetun. Results show that Google Translate is the most reliable tool for Tetun CLIR overall, and the Hiemstra LM consistently outperforms BM25 and DFR BM25 in cross-lingual retrieval performance. However, overall effectiveness remains up to 26.95% points lower than that of the monolingual baseline, underscoring the limitations of current translation tools and the challenges of developing an effective CLIR for Tetun. Despite these challenges, this work establishes the first CLIR baseline for Tetun ad-hoc text retrieval, providing a foundation for future research in this under-resourced setting. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
2025
Autores
Carreira, C; Mendes, A; Ferreira, JF; Christin, N;
Publicação
CoRR
Abstract
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.