2025
Autores
Sousa, H; Almeida, R; Silvano, P; Cantante, I; Campos, R; Jorge, A;
Publicação
THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 24
Abstract
Recent advances in natural language processing have raised expectations for generative models to produce coherent text across diverse language varieties. In the particular case of the Portuguese language, the predominance of Brazilian Portuguese corpora online introduces linguistic biases in these models, limiting their applicability outside of Brazil. To address this gap and promote the creation of European Portuguese resources, we developed a cross-domain language variety identifier (LVI) to discriminate between European and Brazilian Portuguese. Motivated by the findings of our literature review, we compiled the PtBrVarId corpus, a cross-domain LVI dataset, and study the effectiveness of transformer-based LVI classifiers for cross-domain scenarios. Although this research focuses on two Portuguese varieties, our contribution can be extended to other varieties and languages. We open source the code, corpus, and models to foster further research in this task.
2025
Autores
Sousa, H; Almasian, S; Campos, R; Jorge, A;
Publicação
THIRTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, AAAI-25, VOL 39 NO 24
Abstract
Language models have become foundational to many widely used systems. However, these seemingly advantageous models are double-edged swords. While they excel in tasks related to resource-rich languages like English, they often lose the fine nuances of language forms, dialects, and varieties that are inherent to languages spoken in multiple regions of the world. Languages like European Portuguese are neglected in favor of their more popular counterpart, Brazilian Portuguese, leading to suboptimal performance in various linguistic tasks. To address this gap, we introduce the first open-source translation model specifically tailored for European Portuguese, along with a novel dataset specifically designed for this task. Results from automatic evaluations on two benchmark datasets demonstrate that our best model surpasses existing open-source translation systems for Portuguese and approaches the performance of industry-leading closed-source systems for European Portuguese. By making our dataset, models, and code publicly available, we aim to support and encourage further research, fostering advancements in the representation of underrepresented language varieties.
2025
Autores
da Silva, JP; Nogueira, AR; Pinto, J; Curral, M; Alves, AC; Sousa, R;
Publicação
EXPERT SYSTEMS
Abstract
Integrating Industry 4.0 and Quality 4.0 optimises manufacturing through IoT and ML, improving processes and product quality. The primary challenge involves identifying patterns in computer numerical control (CNC) machining time-series data to boost manufacturing quality control. The proposed solution involves an experimental study comparing one-class and binary classification algorithms. This study aims to classify time-series data from CNC turning machines, offering insight into monitoring and adjusting tool wear to maintain product quality. The methodology entails extracting spectral features from time-series data to train both one-class and binary classification algorithms, assessing their effectiveness and computational efficiency. Although certain models consistently outperform others, determining the best performing is not possible, as a trade-off between classification and computational performance is observed, with gradient boosting standing out for effectively balancing both aspects. Thus, the choice between one-class and binary classification ultimately relies on dataset's features and task objectives.
2025
Autores
Ferreira, A; Barroso, J; Reis, A; Gouveia, AJ;
Publicação
Smart Innovation, Systems and Technologies
Abstract
This article presents a systematic review of the most prevalent vulnerabilities plaguing web and mobile applications. By analyzing recent research, it identifies a core set of vulnerabilities, including injection flaws, broken authentication, cross-site scripting (XSS), and insecure direct object references. Recognizing the human element, the article acknowledges the role of social engineering in exploiting these technical weaknesses. The review delves deeper, exploring how these vulnerabilities manifest differently across web and mobile platforms, considering factors like server-side security and API access. The research concludes by advocating for a defense strategy, emphasizing the importance of secure coding practices, robust authentication, and user awareness training. This comprehensive approach paves the way for a more secure digital landscape where both web and mobile applications can thrive. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
2025
Autores
Castro, JT; Pinheiro, I; Marques, MN; Moura, P; dos Santos, FN;
Publicação
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Abstract
In nature, and particularly in agriculture, pollination is fundamental for the sustainability of our society. In this context, pollination is a vital process underlying crop yield quality and is responsible for the biodiversity and the standards of the flora. Bees play a crucial role in natural pollination; however, their populations are declining. Robots can help maintain pollination levels while humans work to recover bee populations. Swarm robotics approaches appear promising for robotic pollination. This paper proposes the cooperation between multiple Unmanned Aerial Vehicles (UAVs) and an Unmanned Ground Vehicle (UGV), leveraging the advantages of collaborative work for pollination, referred to as Pollinationbots. Pollinationbots is based in swarm behaviors and methodologies to implement more effective pollination strategies, ensuring efficient pollination across various scenarios. The paper presents the architecture of the Pollinationbots system, which was evaluated using the Webots simulator, focusing on path planning and follower behavior. Preliminary simulation results indicate that this is a viable solution for robotic pollination. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
2025
Autores
Sousa, TB; Ferreira, HS; Correia, FF;
Publicação
Transactions on Pattern Languages of Programming V
Abstract
Software businesses are continuously increasing their presence in the cloud. While cloud computing is not a new research topic, designing software for the cloud is still challenging, requiring engineers to invest in research to become proficient at working with it. Design patterns can be used to facilitate cloud adoption, as they provide valuable design knowledge and implementation guidelines for recurrent engineering problems. This work introduces a pattern language for designing software for the cloud. We believe developers can significantly reduce their R&D time by adopting these patterns to bootstrap their cloud architecture. The language comprises 10 patterns, organized into four categories: Automated Infrastructure Management, Orchestration and Supervision, Monitoring, and Discovery and Communication. © The Author(s), under exclusive license to Springer-Verlag GmbH, DE, part of Springer Nature 2025.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.