2024
Authors
Lopes, JM; Mota, LP; Mota, SM; Torres, JM; Moreira, RS; Soares, C; Pereira, I; Gouveia, FR; Sobral, P;
Publication
FUTURE INTERNET
Abstract
All types of sports are potential application scenarios for automatic and real-time visual object and event detection. In rink hockey, the popular roller skate variant of team hockey, it is of great interest to automatically track player movements, positions, and sticks, and also to make other judgments, such as being able to locate the ball. In this work, we present a real-time pipeline consisting of an object detection model specifically designed for rink hockey games, followed by a knowledge-based event detection module. Even in the presence of occlusions and fast movements, our deep learning object detection model effectively identifies and tracks important visual elements in real time, such as: ball, players, sticks, referees, crowd, goalkeeper, and goal. Using a curated dataset consisting of a collection of rink hockey videos containing 2525 annotated frames, we trained and evaluated the algorithm's performance and compared it to state-of-the-art object detection techniques. Our object detection model, based on YOLOv7, presents a global accuracy of 80% and, according to our results, good performance in terms of accuracy and speed, making it a good choice for rink hockey applications. In our initial tests, the event detection module successfully detected an important event type in rink hockey games, namely, the occurrence of penalties.
2024
Authors
Batista, A; Torres, JM; Sobral, PM; Moreira, RS; Soares, C; Pereira, I;
Publication
Progress in Artificial Intelligence - 23rd EPIA Conference on Artificial Intelligence, EPIA 2024, Viana do Castelo, Portugal, September 3-6, 2024, Proceedings, Part I
Abstract
Recommendation systems can play an important role in today’s digital content platforms by supporting the suggestion of relevant content in a personalised manner for each customer. Such content customisation has not been consistent across most media domains, and particularly on radio streaming and gaming aggregators, which are the two real-world application domains focused in this work. The challenges faced in these application areas are the dynamic nature of user preferences and the difficulty of generating recommendations for less popular content, due to the overwhelming choice and polarisation of available top content. We present the design and implementation of a Reinforcement Learning-based Recommendation System (RLRS) for web applications, using a Deep Deterministic Policy Gradient (DDPG) agent and, as a reward function, a weighted sum of the user Click Distribution (CD) across the recommended items and the Dwell Time (DT), a measure of the time users spend interacting with those items. Our system has been deployed in real production scenarios with preliminary but promising results. Several metrics are used to track the effectiveness of our approach, such as content coverage, category diversity, and intra-list similarity. In both scenarios tested, the system shows consistent improvement and adaptability over time, reinforcing its applicability. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
2024
Authors
Lopes, JM; Mota, LP; Mota, SM; Torres, JM; Moreira, RS; Soares, C; Pereira, I; Gouveia, F; Sobral, P;
Publication
Abstract
2024
Authors
Gouveia, F; Silva, V; Lopes, J; Moreira, RS; Torres, JM; Guerreiro, MS;
Publication
DISCOVER APPLIED SCIENCES
Abstract
Accurate and up-to-date information about the building stock is fundamental to better understand and mitigate the impact caused by catastrophic earthquakes, as seen recently in Turkey, Syria, Morocco and Afghanistan. Planning for such events is necessary to increase the resilience of the building stock and to minimize casualties and economic losses. Although in several parts of the world new constructions follow more strict compliance with modern seismic codes, a large proportion of existing building stock still demands a more detailed and automated vulnerability analysis. Hence, this paper proposes the use of computer vision deep learning models to automatically classify buildings and create large scale (city or region) exposure models. Such approach promotes the use of open databases covering most cities in the world (cf. OpenStreetMap, Google Street View, Bing Maps and satellite imagery), Therefore providing valuable geographical, topological and image data that may cheaply be used to extract valuable information to feed exposure models. Our previous work using deep learning models achieved, in line with the results from other projects, high classification accuracy concerning building materials and number of storeys. This paper extends the approach by: (i) implementing four CNN-based models to perform classification of three sets of different/extended buildings' characteristics; (ii) training and comparing the performance of the four models for each of the sets; (iii) comparing the risk assessment results based on data extracted from the best CNN-based model against the results obtained with traditional ground data. In brief, the best accuracy obtained with the three tested sets of buildings' characteristics is higher than 80%. Moreover, it is shown that the error resulting from using exposure models fed by automatic classification is not only acceptable, but also far outweighs the time and costs of obtaining a manual and specialised classification of building stocks. Finally, we recognize that automatic assessment of certain complex buildings' characteristics compares to similar limitations of traditional assessments performed by specialized civil engineers, typically related with the identification of the number of storeys and the construction material. However, the identified limitations do not show worse results when compared against the use of manual buildings' assessment. Implement an AI/ML framework for automating the collection of buildings' fa & ccedil;ades pictures annotated with several characteristics required by Exposure Models.Collect, process and filter a 4.239 pictures dataset of buildings' fa & ccedil;ades, which was made publicly available.Train, validate and test several Deep Learning models using 3 sets of building characteristics to produce exposure models with accuracies above 80%.Use heatmaps to show which image areas are more activated for a given prediction, thus helping to explain classification results.Compare simulation results using the predicted exposure model and a manually created exposure model, for the same set of buildings.
2024
Authors
Moita, S; Moreira, RS; Gouveia, F; Torres, JM; Gerreiro, MS; Ferreira, D; Sucena, S; Dinis, MA;
Publication
2024 INTERNATIONAL CONFERENCE ON SMART APPLICATIONS, COMMUNICATIONS AND NETWORKING, SMARTNETS-2024
Abstract
There is a widespread social awareness for the need of adequate accessibility (e.g. missing ramps at crosswalks, obstacles and potholes at sidewalks) in the planning of safe and inclusive city spaces for all citizens. Therefore, municipal authorities responsible for planning urban spaces could benefit from the use of tools for automating the identification of areas in need of accessibility improving interventions. This paper builds on the assumption that it is possible to use Machine Learning (ML) pipelines for automating the detection of accessibility constraints in public spaces, particularly on sidewalks. Those pipelines rely mostly on Deep Learning algorithms to automate the detection of common accessibility issues. Current literature approaches rely on the use of traditional classifiers focused on images' datasets containing single-labelled accessibility classes. We propose an alternative approach using object-detection models that provide a more generic and human-like mode, as it will look into wider city pictures to spot multiple accessibility problems at once. Hence, we evaluate and compare the results of a more generic YOLO model against previous results obtained by more traditional ResNet classification models. The ResNet models used in Project Sidewalk were trained and tested on per-city basis datasets of images crowd-labeled with accessibility attributes. By combining the use of the Project Sidewalk and Google Street View (GSV) service APIs, we re-assembled a world-cities-mix dataset used to train, validate and test the YOLO object-detection model, which exhibited precision and recall values above 84%. Our team of architects and civil engineers also collected a labeled image dataset from two central areas of Porto city, which was used to jointly train and test the YOLO model. The results show that training (even with a small dataset of Porto) the cities-mix-trained YOLO model, provides comparable precision values against the ones obtained by ResNet per-city classifiers. Furthermore, the YOLO approach offers a more human-like generic and efficient pipeline, thus justifying its future exploitation on automating cataloging accessibility mappings in cities.
2024
Authors
Oliveira, JM; Ramos, P;
Publication
MATHEMATICS
Abstract
This study investigates the effectiveness of Transformer-based models for retail demand forecasting. We evaluated vanilla Transformer, Informer, Autoformer, PatchTST, and temporal fusion Transformer (TFT) against traditional baselines like AutoARIMA and AutoETS. Model performance was assessed using mean absolute scaled error (MASE) and weighted quantile loss (WQL). The M5 competition dataset, comprising 30,490 time series from 10 stores, served as the evaluation benchmark. The results demonstrate that Transformer-based models significantly outperform traditional baselines, with Transformer, Informer, and TFT leading the performance metrics. These models achieved MASE improvements of 26% to 29% and WQL reductions of up to 34% compared to the seasonal Na & iuml;ve method, particularly excelling in short-term forecasts. While Autoformer and PatchTST also surpassed traditional methods, their performance was slightly lower, indicating the potential for further tuning. Additionally, this study highlights a trade-off between model complexity and computational efficiency, with Transformer models, though computationally intensive, offering superior forecasting accuracy compared to the significantly slower traditional models like AutoARIMA. These findings underscore the potential of Transformer-based approaches for enhancing retail demand forecasting, provided the computational demands are managed effectively.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.